mirror/zig - zig - Bouvais Git

mirror/zig

mirror of https://github.com/ziglang/zig.git synced 2026-02-20 08:14:48 +00:00

Author	SHA1	Message	Date
Andrew Kelley	16a2562c3f	zig fmt: implement container decls	2021-02-05 15:47:18 -07:00
Andrew Kelley	cf42ae178d	std.MultiArrayList: use `@memset` builtin for undefined See comment for more details	2021-02-05 15:45:33 -07:00
Isaac Freund	0f3fa4d654	zig fmt: array types	2021-02-05 11:36:19 -08:00
Isaac Freund	6f3b93e2e8	zig fmt: struct and anon array initialization	2021-02-05 10:51:45 -08:00
Isaac Freund	3e960cfffe	zig fmt: float literal with exponent	2021-02-05 10:51:45 -08:00
Isaac Freund	0b4bb9b84f	std.MultiArrayList: implement review comments	2021-02-05 10:51:45 -08:00
Andrew Kelley	7069459a76	zig fmt: implement struct init	2021-02-04 19:59:06 -07:00
Andrew Kelley	8e46d06650	zig fmt: implement fn protos and defers	2021-02-04 16:38:29 -07:00
Asherah Connor	4428acf0f7	zig fmt: deref, unwrap optional	2021-02-04 10:49:45 -08:00
Andrew Kelley	725adf8332	zig fmt: builtin calls and array access	2021-02-03 22:12:11 -07:00
Andrew Kelley	f5279cbada	zig fmt: implement top-level fields	2021-02-03 17:02:12 -07:00
Andrew Kelley	1a83b29bea	zig fmt: implement if, call, field access, assignment	2021-02-02 21:05:53 -07:00
Andrew Kelley	0c6b98b825	zig fmt: implement simple test with doc comments	2021-02-01 21:31:41 -07:00
Andrew Kelley	272a0ab359	zig fmt: implement "line comment followed by top-level comptime"	2021-02-01 20:11:55 -07:00
Andrew Kelley	20554d32c0	zig fmt: start reworking with new memory layout * start implementation of ast.Tree.firstToken and lastToken * clarify some ast.Node doc comments * reimplement renderToken	2021-02-01 17:23:49 -07:00
Andrew Kelley	bf8fafc37d	stage2: tokenizer does not emit line comments anymore only std.zig.render cares about these, and it can find them in the original source easily enough.	2021-01-31 21:57:48 -07:00
Andrew Kelley	4dca99d3f6	stage2: rework AST memory layout This is a proof-of-concept of switching to a new memory layout for tokens and AST nodes. The goal is threefold: * smaller memory footprint * faster performance for tokenization and parsing * most importantly, a proof-of-concept that can be also applied to ZIR and TZIR to improve the entire compiler pipeline in this way. I had a few key insights here: * Underlying premise: using less memory will make things faster, because of fewer allocations and better cache utilization. Also using less memory is valuable in and of itself. * Using a Struct-Of-Arrays for tokens and AST nodes, saves the bytes of padding between the enum tag (which kind of token is it; which kind of AST node is it) and the next fields in the struct. It also improves cache coherence, since one can peek ahead in the tokens array without having to load the source locations of tokens. * Token memory can be conserved by only having the tag (1 byte) and byte offset (4 bytes) for a total of 5 bytes per token. It is not necessary to store the token ending byte offset because one can always re-tokenize later, but also most tokens the length can be trivially determined from the tag alone, and for ones where it doesn't, string literals for example, one must parse the string literal again later anyway in astgen, making it free to re-tokenize. * AST nodes do not actually need to store more than 1 token index because one can poke left and right in the tokens array very cheaply. So far we are left with one big problem though: how can we put AST nodes into an array, since different AST nodes are different sizes? This is where my key observation comes in: one can have a hash table for the extra data for the less common AST nodes! But it gets even better than that: I defined this data that is always present for every AST Node: * tag (1 byte) - which AST node is it * main_token (4 bytes, index into tokens array) - the tag determines which token this points to * struct{lhs: u32, rhs: u32} - enough to store 2 indexes to other AST nodes, the tag determines how to interpret this data You can see how a binary operation, such as `a * b` would fit into this structure perfectly. A unary operation, such as `a` would also fit, and leave `rhs` unused. So this is a total of 13 bytes per AST node. And again, we don't have to pay for the padding to round up to 16 because we store in struct-of-arrays format. I made a further observation: the only kind of data AST nodes need to store other than the main_token is indexes to sub-expressions. That's it. The only purpose of an AST is to bring a tree structure to a list of tokens. This observation means all the data that nodes store are only sets of u32 indexes to other nodes. The other tokens can be found later by the compiler, by poking around in the tokens array, which again is super fast because it is struct-of-arrays, so you often only need to look at the token tags array, which is an array of bytes, very cache friendly. So for nearly every kind of AST node, you can store it in 13 bytes. For the rarer AST nodes that have 3 or more indexes to other nodes to store, either the lhs or the rhs will be repurposed to be an index into an extra_data array which contains the extra AST node indexes. In other words, no hash table needed, it's just 1 big ArrayList with the extra data for AST Nodes. Final observation, no need to have a canonical tag for a given AST. For example: The expression `foo(bar)` is a function call. Function calls can have any number of parameters. However in this example, we can encode the function call into the AST with a tag called `FunctionCallOnlyOneParam`, and use lhs for the function expr and rhs for the only parameter expr. Meanwhile if the code was `foo(bar, baz)` then the AST node would have to be `FunctionCall` with lhs still being the function expr, but rhs being the index into `extra_data`. Then because the tag is `FunctionCall` it means `extra_data[rhs]` is the "start" and `extra_data[rhs+1]` is the "end". Now the range `extra_data[start..end]` describes the list of parameters to the function. Point being, you only have to pay for the extra bytes if the AST actually requires it. There's no limit to the number of different AST tag encodings. Preliminary results: 15% improvement on cache-misses * 28% improvement on total instructions executed * 26% improvement on total CPU cycles * 22% improvement on wall clock time This is 1/4 items on the checklist before this can actually be merged: * [x] parser * [ ] render (zig fmt) * [ ] astgen * [ ] translate-c	2021-01-30 20:16:59 -07:00
Andrew Kelley	766b315b38	std.GeneralPurposeAllocator: logging improvements It now uses the log scope "gpa" instead of "std". Additionally, there is a new config option `verbose_log` which enables info log messages for every allocation. Can be useful when debugging. This option is off by default.	2021-01-30 20:15:26 -07:00
Andrew Kelley	0808d98e10	add std.MultiArrayList Also known as "Struct-Of-Arrays" or "SOA". The purpose of this data structure is to provide a similar API to ArrayList but instead of the element type being a struct, the fields of the struct are in N different arrays, all with the same length and capacity. Having this abstraction means we can put them in the same allocation, avoiding overhead with the allocator. It also saves a tiny bit of overhead from the redundant capacity and length fields, since each struct element shares the same value. This is an alternate implementation to #7854.	2021-01-30 20:12:13 -07:00
Joran Dirk Greef	881ecdc72f	Add MAX_RW_COUNT limit to std.os.pread() Fixes: https://github.com/ziglang/zig/issues/7805	2021-01-25 10:41:38 -08:00
Koakuma	09450419d3	Fix f128 NaN check on big-endian hosts On big-endian hosts, zig_f128_isNaN() takes the high and low halves from the wrong element, resulting in buggy NaN detection behavior. This fixes it.	2021-01-25 10:40:23 -08:00
Timon Kruiper	e23bc1f76a	render: fix bug when rendering struct initializer with length 1 This crashed the compiler when running translate-c. See the added test.	2021-01-25 10:40:00 -08:00
Andrew Kelley	4ca1f4ec2e	Merge pull request #7846 from LemonBoy/filtertest stage1: don't filter test blocks with empty label	2021-01-25 10:39:11 -08:00
Evan Haas	57b2176e28	translate-c: Improve array support 1. For incomplete arrays with initializer list (`int x[] = {1};`) use the initializer size as the array size. 2. For arrays initialized with a string literal translate it as an array of character literals instead of `[*c]const u8` 3. Don't crash if an empty initializer is used for an incomplete array. 4. Add a test for multi-character character constants Additionally lay some groundwork for supporting wide string literals. fixes #4831 #7832 #7842	2021-01-25 10:37:23 -08:00
Joran Dirk Greef	68a040aec7	linux: add fallocate() to io_uring	2021-01-25 10:34:20 -08:00
Timon Kruiper	9238d12537	windows: make sure to handle PATH_NOT_FOUND when deleting files Fixes #7879	2021-01-25 10:33:08 -08:00
Andrew Kelley	0cfa39304b	zig cc: recognize more coff linker options Related: #7874	2021-01-24 14:30:28 -07:00
Andrew Kelley	b56e916fa1	Merge branch 'FireFox317-deadlock-windows-fix' Merges #7861	2021-01-24 12:22:51 -07:00
Andrew Kelley	2b321c25ce	std.Progress: call refreshWithHeldLock as appropriate	2021-01-24 12:22:17 -07:00
Timon Kruiper	4f7d76f19c	fix windows bug in Progress.zig This bug caused the compiler to deadlock when multiple c objects were build in parallel. Thanks @kprotty for finding this bug!	2021-01-24 12:20:51 -07:00
Andrew Kelley	15278b7f4b	Merge pull request #7856 from ziglang/lto add LTO support	2021-01-24 11:09:48 -08:00
Andrew Kelley	0d4b6ac741	add LTO support The CLI gains -flto and -fno-lto options to override the default. However, the cool thing about this is that the defaults are great! In general when you use build-exe in release mode, Zig will enable LTO if it would work and it would help. zig cc supports detecting and honoring the -flto and -fno-lto flags as well. The linkWithLld functions are improved to all be the same with regards to copying the artifact instead of trying to pass single objects through LLD with -r. There is possibly a future improvement here as well; see the respective TODOs. stage1 is updated to support outputting LLVM bitcode instead of machine code when lto is enabled. This allows LLVM to optimize across the Zig and C/C++ code boundary. closes #2845	2021-01-23 18:18:07 -07:00
Andrew Kelley	ab4f3aee3d	stage2: wasm arch does not support -mred-zone flags	2021-01-22 23:35:32 -07:00
Andrew Kelley	3647784d05	stage2: add missing frexpl.c to mingw c source file list	2021-01-22 23:35:13 -07:00
LemonBoy	134f5fd3d6	std: Update `test ""` to `test` where it makes sense	2021-01-22 15:46:58 +01:00
LemonBoy	ac004e1bf1	stage1: Allow nameless test blocks Nameless blocks are never filtered, the test prefix is still applied.	2021-01-22 15:46:58 +01:00
Jakub Konka	843d91e75d	Bring back stack trace printing on ARM Darwin This temporary patch fixes a segfault caused by miscompilation by the LLD when generating stubs for initialization of thread local storage. We effectively bypass TLS in the default panic handler so that no segfault is generated and the stack trace is correctly reported back to the user. Note that, this is linked directly to a bigger issue with LLD ziglang/zig#7527 and when resolved, we only need to remove the `comptime` code path introduced with this patch to use the default panic handler that relies on TLS. Co-authored-by: Andrew Kelley <andrew@ziglang.org>	2021-01-21 23:20:42 +01:00
LemonBoy	fc5ae1c409	stage1: don't filter test blocks with empty label The common pattern of including a file containing all the tests in a empty-label test block breaks down when using --test-filter.	2021-01-21 09:48:57 +01:00
Evan Haas	bea791b639	translate-c: fix variadic function calls 1702b413 introduced a bug with variadic function calls - trying to access the paramType of non-existent parameters.	2021-01-20 22:26:18 -08:00
Jakub Konka	58344e0017	Merge pull request #7829 from kubkon/macho-safer stage2 macho: make int casts fallible where necessary	2021-01-20 17:28:31 +01:00
Andrew Kelley	8098b3f84c	stage2: implement TZIR printing for call instruction	2021-01-19 21:09:46 -07:00
Rafael Ristovski	41e6aa78bb	zig cc: Support reading input from stdin This fixes #6271, which allows using `zig cc` with meson.	2021-01-19 17:23:44 -08:00
Andrew Kelley	072d1e088c	stage2: fix anonymous Decl ty/val wrong arena string literals and error set types were allocating the ty/val fields of the anonymous Decl into the owner Decl's arena, rather than the new anonymous Decl's arena as intended. This caused use of undefined value later on in the pipeline.	2021-01-19 16:25:55 -07:00
Andrew Kelley	1af31baf0b	stage2: -Dlog enables all logging, log scopes can be set at runtime Previously you had to recompile if you wanted to change the log scopes that get printed. Now, log scopes can be set at runtime, and -Dlog controls whether all logging is available at runtime. Purpose here is a nicer development experience. Most likely stage2 developers will always want -Dlog enabled and then pass --debug-log scopes when debugging particular issues.	2021-01-19 15:49:08 -07:00
Jakub Konka	a26ab9afee	Backport Elf changes from d5d0619	2021-01-19 22:54:34 +01:00
Jakub Konka	0e56d4cc02	stage2: converge x86_64 and aarch64 tests on macOS	2021-01-19 22:39:49 +01:00
Jakub Konka	5d4401ceec	macho: fix overflowing u64 range	2021-01-19 22:39:49 +01:00
Jakub Konka	e726868b02	macho: reuse existing names from the string table	2021-01-19 22:39:49 +01:00
Jakub Konka	7d3aa58e16	macho: make int casts safer	2021-01-19 22:39:49 +01:00
Andrew Kelley	287f640cc9	stage2: ELF: fix crash when only 1 function and it gets updated	2021-01-19 14:08:43 -07:00

1 2 3 4 5 ...

12474 Commits