mirror/zig - zig - Bouvais Git

mirror/zig

mirror of https://github.com/ziglang/zig.git synced 2025-12-24 07:03:11 +00:00

Author	SHA1	Message	Date
David Rubin	598413357d	Sema: use unwrapped generic owner in `getFuncInstanceIes`	2025-03-25 15:24:41 +01:00
Alex Rønne Petersen	5c44934e20	Move the compiler's LLVM bitcode builder to std.zig.llvm.	2025-02-27 01:32:49 -05:00
Alex Rønne Petersen	6ba785584a	compiler: Implement @disableIntrinsics() builtin function. Closes #21833. Closes #22110.	2025-02-23 04:08:56 +01:00
Meghan Denny	9142482372	std.ArrayList: popOrNull() -> pop() [v2] (#22720 )	2025-02-10 04:21:31 +00:00
mlugg	3ce857d054	Sema: fix incorrectly succeeding type resolution Resolves: #21436	2025-02-05 18:31:39 +00:00
mlugg	3ca588bcc6	compiler: integrate importing ZON with incremental compilation The changes from a few commits earlier, where semantic analysis no longer occurs if any Zig files failed to lower to ZIR, mean `file` dependencies are no longer necessary! However, we now need them for ZON files, to be invalidated whenever a ZON file changes.	2025-02-04 16:20:29 +00:00
Mason Remaley	13c6eb0d71	compiler,std: implement ZON support This commit allows using ZON (Zig Object Notation) in a few ways. * `@import` can be used to load ZON at comptime and convert it to a normal Zig value. In this case, `@import` must have a result type. * `std.zon.parse` can be used to parse ZON at runtime, akin to the parsing logic in `std.json`. * `std.zon.stringify` can be used to convert arbitrary data structures to ZON at runtime, again akin to `std.json`.	2025-02-03 09:14:37 +00:00
Jacob Young	b9531f5de6	x86_64: rewrite float vector conversions	2025-01-31 23:00:34 -05:00
Matthew Lugg	3767b08039	Merge pull request #22602 from mlugg/incr-embedfile incremental: handle `@embedFile`	2025-01-26 01:41:56 +00:00
mlugg	f47b8de2ad	incremental: handle `@embedFile` Uses of `@embedFile` register dependencies on the corresponding `Zcu.EmbedFile`. At the start of every update, we iterate all embedded files and update them if necessary, and invalidate the dependencies if they changed. In order to properly integrate with the lazy analysis model, failed embed files are now reported by the `AnalUnit` which actually used `@embedFile`; the filesystem error is stored in the `Zcu.EmbedFile`. An incremental test is added covering incremental updates to embedded files, and I have verified locally that dependency invalidation is working correctly.	2025-01-25 06:07:08 +00:00
Jacob Young	c7433212d1	x86_64: rewrite scalar and vector int `@min` and `@max`	2025-01-24 21:02:32 -05:00
Jacob Young	b1fa89439a	x86_64: rewrite float vector `@abs` and equality comparisons	2025-01-24 20:56:11 -05:00
mlugg	1bce01de97	compiler: pass error return traces everywhere	2025-01-22 02:22:56 -05:00
mlugg	0ec6b2dd88	compiler: simplify generic functions, fix issues with inline calls The original motivation here was to fix regressions caused by #22414. However, while working on this, I ended up discussing a language simplification with Andrew, which changes things a little from how they worked before #22414. The main user-facing change here is that any reference to a prior function parameter, even if potentially comptime-known at the usage site or even not analyzed, now makes a function generic. This applies even if the parameter being referenced is not a `comptime` parameter, since it could still be populated when performing an inline call. This is a breaking language change. The detection of this is done in AstGen; when evaluating a parameter type or return type, we track whether it referenced any prior parameter, and if so, we mark this type as being "generic" in ZIR. This will cause Sema to not evaluate it until the time of instantiation or inline call. A lovely consequence of this from an implementation perspective is that it eliminates the need for most of the "generic poison" system. In particular, `error.GenericPoison` is now completely unnecessary, because we identify generic expressions earlier in the pipeline; this simplifies the compiler and avoids redundant work. This also entirely eliminates the concept of the "generic poison value". The only remnant of this system is the "generic poison type" (`Type.generic_poison` and `InternPool.Index.generic_poison_type`). This type is used in two places: * During semantic analysis, to represent an unknown result type. * When storing generic function types, to represent a generic parameter/return type. It's possible that these use cases should instead use `.none`, but I leave that investigation to a future adventurer. One last thing. Prior to #22414, inline calls were a little inefficient, because they re-evaluated even non-generic parameter types whenever they were called. Changing this behavior is what ultimately led to #22538. Well, because the new logic will mark a type expression as generic if there is any change its resolved type could differ in an inline call, this redundant work is unnecessary! So, this is another way in which the new design reduces redundant work and complexity. Resolves: #22494 Resolves: #22532 Resolves: #22538	2025-01-21 02:41:42 +00:00
mlugg	9804cc8bc6	all: update to `std.builtin.Type.{Pointer,Array,StructField}` field renames	2025-01-16 12:49:58 +00:00
mlugg	d00e05f186	all: update to `std.builtin.Type.Pointer.Size` field renames This was done by regex substitution with `sed`. I then manually went over the entire diff and fixed any incorrect changes. This diff also changes a lot of `callconv(.C)` to `callconv(.c)`, since my regex happened to also trigger here. I opted to leave these changes in, since they are a correct migration, even if they're not the one I was trying to do!	2025-01-16 12:46:29 +00:00
Andrew Kelley	a7bd1a631b	wasm codegen: fix mistaking extern data as function	2025-01-15 15:11:37 -08:00
Andrew Kelley	eb943890d9	resolve merge conflicts with 497592c9b45a94fb7b6028bf45b80f183e395a9b	2025-01-15 15:11:36 -08:00
Andrew Kelley	91efc5c98b	wasm linker: fix calling imported functions and more disciplined type safety for output function indexes	2025-01-15 15:11:35 -08:00
Andrew Kelley	458f658b42	wasm linker: implement missing logic fix some compilation errors for reworked Emit now that it's actually referenced introduce DataSegment.Id for sorting data both from object files and from the Zcu. introduce optimization: data segment sorting includes a descending sort on reference count so that references to data can be smaller integers leading to better LEB encodings. this optimization is skipped for object files. implement uav address access function which is based on only 1 hash table lookup to find out the offset after sorting.	2025-01-15 15:11:35 -08:00
Andrew Kelley	d6b42e585b	wasm linker: implement name subsection unlike the previous implementation, we can simply iterate an array.	2025-01-15 15:11:35 -08:00
Andrew Kelley	c96e23632f	frontend: add const to more Zcu pointers	2025-01-15 15:11:35 -08:00
Jacob Young	02692ad78c	cbe: fix miscomps of the compiler	2025-01-10 06:10:15 -05:00
Jacob Young	dde3116e50	Dwarf: implement new incremental line number update API	2025-01-05 02:20:56 +00:00
mlugg	f01029c4af	incremental: new `AnalUnit` to group dependencies on `std.builtin` decls This commit reworks how values like the panic handler function are memoized during a compiler invocation. Previously, the value was resolved by whichever analysis requested it first, and cached on `Zcu`. This is problematic for incremental compilation, as after the initial resolution, no dependencies are marked by users of this memoized state. This is arguably acceptable for `std.builtin`, but it's definitely not acceptable for the panic handler/messages, because those can be set by the user (`std.builtin.Panic` checks `@import("root").Panic`). So, here we introduce a new kind of `AnalUnit`, called `memoized_state`. There are 3 such units: * `.{ .memoized_state = .va_list }` resolves the type `std.builtin.VaList` * `.{ .memoized_state = .panic }` resolves `std.Panic` * `.{ .memoized_state = .main }` resolves everything else we want These units essentially "bundle" the resolution of their corresponding declarations, storing the results into fields on `Zcu`. This way, when, for instance, a function wants to call the panic handler, it simply runs `ensureMemoizedStateResolved`, registering one dependency, and pulls the values from the `Zcu`. This "bundling" minimizes dependency edges. The 3 units are separated to allow them to act independently: for instance, the panic handler can use `std.builtin.Type` without triggering a dependency loop.	2025-01-04 07:51:19 +00:00
mlugg	f818098971	incremental: correctly return `error.AnalysisFail` when type structure changes `Zcu.PerThead.ensureTypeUpToDate` is set up in such a way that it only returns the updated type the first time it is called. In general, that's okay; however, the exception is that we want the function to continue returning `error.AnalysisFail` when the type has been lost, or its number of captures changed. Therefore, the check for this case now happens before the up-to-date success return. For simplicity, the number of captures is now handled by intentionally losing the instruction in `Zcu.mapOldZirToNew`, since there is nothing to gain from tracking a type when old instances of it can never be reused.	2025-01-04 05:44:29 +00:00
Jacob Young	ec60156f18	InternPool: fix leak when the last namespace bucket is full	2024-12-29 15:28:40 -05:00
mlugg	42dac40b3f	InternPool: fix segfault in `rehashTrackedInsts` The `.empty` map in a shard is weird: it claims to have capacity 1, but you're not actually allowed to actually use that capacity. That's fine for the normal insertion algorithm, because it always resizes to a higher capacity when inserting the initial element. However, `rehashTrackedInsts` was not aware of this caveat, so sometimes tried to store to the single element of the `empty` map. This system exists to avoid an extra branch in the main resizing logic (since `new_cap = old_cap * 2` only works if the capacity is never non-zero). However, it's fine for `rehashTrackedInsts` to have an extra branch to handle this case, since it's literally called once per update.	2024-12-26 02:19:02 +00:00
mlugg	3afda4322c	compiler: analyze type and value of global declaration separately This commit separates semantic analysis of the annotated type vs value of a global declaration, therefore allowing recursive and mutually recursive values to be declared. Every `Nav` which undergoes analysis now has two corresponding `AnalUnit`s: `.{ .nav_val = n }` and `.{ .nav_ty = n }`. The `nav_val` unit is responsible for fully resolving the `Nav`: determining its value, linksection, addrspace, etc. The `nav_ty` unit, on the other hand, resolves only the information necessary to construct a pointer to the `Nav`: its type, addrspace, etc. (It does also analyze its linksection, but that could be moved to `nav_val` I think; it doesn't make any difference). Analyzing a `nav_ty` for a declaration with no type annotation will just mark a dependency on the `nav_val`, analyze it, and finish. Conversely, analyzing a `nav_val` for a declaration with a type annotation will first mark a dependency on the `nav_ty` and analyze it, using this as the result type when evaluating the value body. The `nav_val` and `nav_ty` units always have references to one another: so, if a `Nav`'s type is referenced, its value implicitly is too, and vice versa. However, these dependencies are trivial, so, to save memory, are only known implicitly by logic in `resolveReferences`. In general, analyzing ZIR `decl_val` will only analyze `nav_ty` of the corresponding `Nav`. There are two exceptions to this. If the declaration is an `extern` declaration, then we immediately ensure the `Nav` value is resolved (which doesn't actually require any more analysis, since such a declaration has no value body anyway). Additionally, if the resolved type has type tag `.@"fn"`, we again immediately resolve the `Nav` value. The latter restriction is in place for two reasons: * Functions are special, in that their externs are allowed to trivially alias; i.e. with a declaration `extern fn foo(...)`, you can write `const bar = foo;`. This is not allowed for non-function externs, and it means that function types are the only place where it is possible for a declaration `Nav` to have a `.@"extern"` value without actually being declared `extern`. We need to identify this situation immediately so that the `decl_ref` can create a pointer to the real extern `Nav`, not this alias. * In certain situations, such as taking a pointer to a `Nav`, Sema needs to queue analysis of a runtime function if the value is a function. To do this, the function value needs to be known, so we need to resolve the value immediately upon `&foo` where `foo` is a function. This restriction is simple to codify into the eventual language specification, and doesn't limit the utility of this feature in practice. A consequence of this commit is that codegen and linking logic needs to be more careful when looking at `Nav`s. In general: * When `updateNav` or `updateFunc` is called, it is safe to assume that the `Nav` being updated (the owner `Nav` for `updateFunc`) is fully resolved. * Any `Nav` whose value is/will be an `@"extern"` or a function is fully resolved; see `Nav.getExtern` for a helper for a common case here. * Any other `Nav` may only have its type resolved. This didn't seem to be too tricky to satisfy in any of the existing codegen/linker backends. Resolves: #131	2024-12-24 02:18:41 +00:00
mlugg	40aafcd6a8	compiler: remove Cau The `Cau` abstraction originated from noting that one of the two primary roles of the legacy `Decl` type was to be the subject of comptime semantic analysis. However, the data stored in `Cau` has always had some level of redundancy. While preparing for #131, I went to remove that redundany, and realised that `Cau` now had exactly one field: `owner`. This led me to conclude that `Cau` is, in fact, an unnecessary level of abstraction over what are in reality fundamentally different kinds of analysis unit (`AnalUnit`). Types, `Nav` vals, and `comptime` declarations are all analyzed in different ways, and trying to treat them as the same thing is counterproductive! So, these 3 cases are now different alternatives in `AnalUnit`. To avoid stealing bits from `InternPool`-based IDs, which are already a little starved for bits due to the sharding datastructures, `AnalUnit` is expanded to 64 bits (30 of which are currently unused). This doesn't impact memory usage too much by default, because we don't store `AnalUnit`s all too often; however, we do store them a lot under `-fincremental`, so a non-trivial bump to peak RSS can be observed there. This will be improved in the future when I made `InternPool.DepEntry` less memory-inefficient. `Zcu.PerThread.ensureCauAnalyzed` is split into 3 functions, for each of the 3 new types of `AnalUnit`. The new logic is much easier to understand, because it avoids conflating the logic of these fundamentally different cases.	2024-12-24 02:18:41 +00:00
mlugg	18362ebe13	Zir: refactor `declaration` instruction representation The new representation is often more compact. It is also more straightforward to understand: for instance, `extern` is represented on the `declaration` instruction itself rather than using a special instruction. The same applies to `var`, making both of these far more compact. This commit also separates the type and value bodies of a `declaration` instruction. This is a prerequisite for #131. In general, `declaration` now directly encodes details of the syntax form used, and the embedded ZIR bodies are for actual expressions. The only exception to this is functions, where ZIR is effectively designed as if we had #1717. `extern fn` declarations are modeled as `extern const` with a function type, and normal `fn` definitions are modeled as `const` with a `func{,_fancy,_inferred}` instruction. This may change in the future, but improving on this was out of scope for this commit.	2024-12-23 21:09:17 +00:00
Jacob Young	5776d8f270	lldb: add pretty printer for cau and nav indices	2024-12-20 22:51:20 -05:00
Jacob Young	5c76e08f49	lldb: add pretty printer for intern pool indices	2024-12-20 22:51:20 -05:00
mlugg	7408679234	compiler: disallow `callconv` etc from depending on function parameters Resolves: #22261	2024-12-18 23:06:35 +00:00
mlugg	242bb44695	compiler: move `RuntimeIndex` to `Sema` Just a small refactor.	2024-12-18 20:34:10 +00:00
Jacob Young	737154fcd8	InternPool: fix typo	2024-12-17 17:26:55 -05:00
Jacob Young	8c0628d0e2	Dwarf: include comptime-only values in debug info	2024-12-16 17:25:52 -05:00
Jacob Young	1983adb8ae	InternPool: we have pointer subtraction now!	2024-12-16 15:11:23 -05:00
Carl Åstholm	b352595aa2	Add compiler internals tests There are several test decls inside `/src` that are not currently being tested and have bitrotted as a result. This commit revives those tests and adds the `test-compiler-internals` set of tests which tests everything reachable from `/src/main.zig`.	2024-12-13 08:49:02 -05:00
Andrew Kelley	7575f21212	Merge pull request #22157 from mlugg/astgen-error-lazy compiler: allow semantic analysis of files with AstGen errors	2024-12-09 18:32:23 -05:00
mlugg	135c733eef	InternPool: fix crash in `rehashTrackedInsts` When a shard has zero elements, we don't need to reserve any capacity.	2024-12-08 10:53:51 +00:00
mlugg	7f3211a101	compiler: incremental compilation fixes The previous commit exposed some bugs in incremental compilation. This commit fixes those, and adds a little more logging for debugging incremental compilation. Also, allow `ast-check -t` to dump ZIR when there are non-fatal AstGen errors.	2024-12-05 19:58:42 +00:00
David Rubin	a6af55cc6e	ip: cleanup `@constCast` usages	2024-11-25 18:41:36 -05:00
mlugg	d11bbde5f9	compiler: remove anonymous struct types, unify all tuples This commit reworks how anonymous struct literals and tuples work. Previously, an untyped anonymous struct literal (e.g. `const x = .{ .a = 123 }`) was given an "anonymous struct type", which is a special kind of struct which coerces using structural equivalence. This mechanism was a holdover from before we used RLS / result types as the primary mechanism of type inference. This commit changes the language so that the type assigned here is a "normal" struct type. It uses a form of equivalence based on the AST node and the type's structure, much like a reified (`@Type`) type. Additionally, tuples have been simplified. The distinction between "simple" and "complex" tuple types is eliminated. All tuples, even those explicitly declared using `struct { ... }` syntax, use structural equivalence, and do not undergo staged type resolution. Tuples are very restricted: they cannot have non-`auto` layouts, cannot have aligned fields, and cannot have default values with the exception of `comptime` fields. Tuples currently do not have optimized layout, but this can be changed in the future. This change simplifies the language, and fixes some problematic coercions through pointers which led to unintuitive behavior. Resolves: #16865	2024-10-31 20:42:53 +00:00
Andrew Kelley	78f643c46d	Merge pull request #21758 from kcbanner/dll_storage_class Add `is_dll_import` to @extern, to support `__declspec(dllimport)` with the MSVC ABI	2024-10-23 15:35:54 -07:00
kcbanner	7edd69d8aa	tests: add tests for is_dll_import externs - tests/standalone/extern wasn't running its test step - add compile error tests for thread local / dll import @extern in a comptime scope	2024-10-22 18:46:14 -04:00
kcbanner	ee25757245	Add support for specifying `dll_storage_class` in @extern	2024-10-22 12:41:35 -04:00
mlugg	0b786059b5	compiler: avoid unreasonable eval branch quotas Using `@FieldType` (#21702).	2024-10-19 19:21:17 +01:00
mlugg	ec19086aa0	compiler: remove @setAlignStack This commit finishes implementing #21209 by removing the `@setAlignStack` builtin in favour of `CallingConvention` payloads. The x86_64 backend is updated to use the stack alignment given in the calling convention (the LLVM backend was already updated in a previous commit). Resolves: #21209	2024-10-19 19:15:23 +01:00
mlugg	bc797a97b1	std: update for new `CallingConvention` The old `CallingConvention` type is replaced with the new `NewCallingConvention`. References to `NewCallingConvention` in the compiler are updated accordingly. In addition, a few parts of the standard library are updated to use the new type correctly.	2024-10-19 19:15:23 +01:00

1 2 3 4 5 ...

317 Commits