mirror of
https://github.com/ziglang/zig.git
synced 2025-12-06 14:23:09 +00:00
reduce AstGen.numberLiteral stack usage
At the moment, the LLVM IR we generate for this fn is
define internal fastcc void @AstGen.numberLiteral ... {
Entry:
...
%16 = alloca %"fmt.parse_float.decimal.Decimal(f128)", align 8
...
That `Decimal` is huuuge! It stores
pub const max_digits = 11564;
digits: [max_digits]u8,
on the stack.
It comes from `convertSlow` function, which LLVM happily inlined,
despite it being the cold path. Forbid inlining that to not penalize
callers with excessive stack usage.
Backstory: I was looking for needles memcpys in TigerBeetle, and came up
with this copyhound.zig tool for doing just that:
ee67e2ab95/src/copyhound.zig
Got curious, run it on the Zig's own code base, and looked at some of
the worst offenders.
List of worst offenders:
warning: crypto.kyber_d00.Kyber.SecretKey.decaps: 7776 bytes memcpy
warning: crypto.ff.Modulus.powPublic: 8160 bytes memcpy
warning: AstGen.numberLiteral: 11584 bytes memcpy
warning: crypto.tls.Client.init__anon_133566: 13984 bytes memcpy
warning: http.Client.connectUnproxied: 16896 bytes memcpy
warning: crypto.tls.Client.init__anon_133566: 16904 bytes memcpy
warning: objcopy.ElfFileHelper.tryCompressSection: 32768 bytes memcpy
Note from Andrew: I removed `noinline` from this commit since it should
be enough to set it to be cold.
This commit is contained in:
parent
e313584a48
commit
772debb03a
@ -32,7 +32,12 @@ pub fn getShift(n: usize) usize {
|
||||
///
|
||||
/// The algorithms described here are based on "Processing Long Numbers Quickly",
|
||||
/// available here: <https://arxiv.org/pdf/2101.11408.pdf#section.11>.
|
||||
///
|
||||
/// Note that this function needs a lot of stack space and is marked
|
||||
/// cold to hint against inlining into the caller.
|
||||
pub fn convertSlow(comptime T: type, s: []const u8) BiasedFp(T) {
|
||||
@setCold(true);
|
||||
|
||||
const MantissaT = mantissaType(T);
|
||||
const min_exponent = -(1 << (math.floatExponentBits(T) - 1)) + 1;
|
||||
const infinite_power = (1 << math.floatExponentBits(T)) - 1;
|
||||
|
||||
Loading…
x
Reference in New Issue
Block a user