mirror/zig - zig - Bouvais Git

mirror/zig

mirror of https://github.com/ziglang/zig.git synced 2026-02-03 21:23:36 +00:00

Author	SHA1	Message	Date
mlugg	7e82398cfe	Sema: remove some incorrect calls to `requireRuntimeBlock` Most calls to `requireRuntimeBlock` in Sema are not correct. This function doesn't deal with all of them, but it does deal with ones which have, in combination with the past few commits, introduced real-world regressions. Related: #22353	2024-12-31 09:55:03 +00:00
mlugg	9a70eeeac5	compiler: ensure local `const`s in comptime scope are comptime-known This fixes a bug which exposed a compiler implementation detail (ZIR alloc elision). Previously, `const` declarations with a runtime-known value in a comptime scope were permitted only if AstGen was able to elide the alloc in ZIR, since the error was reported by storing to the comptime alloc. This just adds a new instruction to also emit this error when the alloc is elided.	2024-12-31 09:55:03 +00:00
mlugg	6026a5f217	compiler: ensure result of `block_comptime` is comptime-known To avoid this PR regressing error messages, most of the work here has gone towards improving error notes for why code was comptime-evaluated. ZIR `block_comptime` now stores a "comptime reason", the enum for which is also used by Sema. There are two types in Sema: * `ComptimeReason` represents the reason we started evaluating something at comptime. * `BlockComptimeReason` represents the reason a given block is evaluated at comptime; it's either a `ComptimeReason` with an attached source location, or it's because we're in a function which was called at comptime (and that function's `Block` should be consulted for the "parent" reason). Every `Block` stores a `?BlockComptimeReason`. The old `is_comptime` field is replaced with a trivial `isComptime()` method which returns whether that reason is non-`null`. Lastly, the handling for `block_comptime` has been simplified. It was previously going through an unnecessary runtime-handling path; now, it is a trivial sub block exited through a `break_inline` instruction. Resolves: #22296	2024-12-31 09:55:03 +00:00
mlugg	3afda4322c	compiler: analyze type and value of global declaration separately This commit separates semantic analysis of the annotated type vs value of a global declaration, therefore allowing recursive and mutually recursive values to be declared. Every `Nav` which undergoes analysis now has two corresponding `AnalUnit`s: `.{ .nav_val = n }` and `.{ .nav_ty = n }`. The `nav_val` unit is responsible for fully resolving the `Nav`: determining its value, linksection, addrspace, etc. The `nav_ty` unit, on the other hand, resolves only the information necessary to construct a pointer to the `Nav`: its type, addrspace, etc. (It does also analyze its linksection, but that could be moved to `nav_val` I think; it doesn't make any difference). Analyzing a `nav_ty` for a declaration with no type annotation will just mark a dependency on the `nav_val`, analyze it, and finish. Conversely, analyzing a `nav_val` for a declaration with a type annotation will first mark a dependency on the `nav_ty` and analyze it, using this as the result type when evaluating the value body. The `nav_val` and `nav_ty` units always have references to one another: so, if a `Nav`'s type is referenced, its value implicitly is too, and vice versa. However, these dependencies are trivial, so, to save memory, are only known implicitly by logic in `resolveReferences`. In general, analyzing ZIR `decl_val` will only analyze `nav_ty` of the corresponding `Nav`. There are two exceptions to this. If the declaration is an `extern` declaration, then we immediately ensure the `Nav` value is resolved (which doesn't actually require any more analysis, since such a declaration has no value body anyway). Additionally, if the resolved type has type tag `.@"fn"`, we again immediately resolve the `Nav` value. The latter restriction is in place for two reasons: * Functions are special, in that their externs are allowed to trivially alias; i.e. with a declaration `extern fn foo(...)`, you can write `const bar = foo;`. This is not allowed for non-function externs, and it means that function types are the only place where it is possible for a declaration `Nav` to have a `.@"extern"` value without actually being declared `extern`. We need to identify this situation immediately so that the `decl_ref` can create a pointer to the real extern `Nav`, not this alias. * In certain situations, such as taking a pointer to a `Nav`, Sema needs to queue analysis of a runtime function if the value is a function. To do this, the function value needs to be known, so we need to resolve the value immediately upon `&foo` where `foo` is a function. This restriction is simple to codify into the eventual language specification, and doesn't limit the utility of this feature in practice. A consequence of this commit is that codegen and linking logic needs to be more careful when looking at `Nav`s. In general: * When `updateNav` or `updateFunc` is called, it is safe to assume that the `Nav` being updated (the owner `Nav` for `updateFunc`) is fully resolved. * Any `Nav` whose value is/will be an `@"extern"` or a function is fully resolved; see `Nav.getExtern` for a helper for a common case here. * Any other `Nav` may only have its type resolved. This didn't seem to be too tricky to satisfy in any of the existing codegen/linker backends. Resolves: #131	2024-12-24 02:18:41 +00:00
mlugg	40aafcd6a8	compiler: remove Cau The `Cau` abstraction originated from noting that one of the two primary roles of the legacy `Decl` type was to be the subject of comptime semantic analysis. However, the data stored in `Cau` has always had some level of redundancy. While preparing for #131, I went to remove that redundany, and realised that `Cau` now had exactly one field: `owner`. This led me to conclude that `Cau` is, in fact, an unnecessary level of abstraction over what are in reality fundamentally different kinds of analysis unit (`AnalUnit`). Types, `Nav` vals, and `comptime` declarations are all analyzed in different ways, and trying to treat them as the same thing is counterproductive! So, these 3 cases are now different alternatives in `AnalUnit`. To avoid stealing bits from `InternPool`-based IDs, which are already a little starved for bits due to the sharding datastructures, `AnalUnit` is expanded to 64 bits (30 of which are currently unused). This doesn't impact memory usage too much by default, because we don't store `AnalUnit`s all too often; however, we do store them a lot under `-fincremental`, so a non-trivial bump to peak RSS can be observed there. This will be improved in the future when I made `InternPool.DepEntry` less memory-inefficient. `Zcu.PerThread.ensureCauAnalyzed` is split into 3 functions, for each of the 3 new types of `AnalUnit`. The new logic is much easier to understand, because it avoids conflating the logic of these fundamentally different cases.	2024-12-24 02:18:41 +00:00
mlugg	18362ebe13	Zir: refactor `declaration` instruction representation The new representation is often more compact. It is also more straightforward to understand: for instance, `extern` is represented on the `declaration` instruction itself rather than using a special instruction. The same applies to `var`, making both of these far more compact. This commit also separates the type and value bodies of a `declaration` instruction. This is a prerequisite for #131. In general, `declaration` now directly encodes details of the syntax form used, and the embedded ZIR bodies are for actual expressions. The only exception to this is functions, where ZIR is effectively designed as if we had #1717. `extern fn` declarations are modeled as `extern const` with a function type, and normal `fn` definitions are modeled as `const` with a `func{,_fancy,_inferred}` instruction. This may change in the future, but improving on this was out of scope for this commit.	2024-12-23 21:09:17 +00:00
mlugg	7408679234	compiler: disallow `callconv` etc from depending on function parameters Resolves: #22261	2024-12-18 23:06:35 +00:00
mlugg	242bb44695	compiler: move `RuntimeIndex` to `Sema` Just a small refactor.	2024-12-18 20:34:10 +00:00
mlugg	f154cd1fdc	Sema: disallow unsafe in-memory coercions The error messages here aren't amazing yet, but this is an improvement on status quo, because the current behavior allows false negative compile errors, so effectively miscompiles. Resolves: #15874	2024-12-16 14:53:54 +00:00
Matthew Lugg	3a0a9aa9b8	Merge pull request #22245 from mlugg/zir-no-doc-comments compiler: remove doc comments from Zir	2024-12-16 02:48:28 +00:00
mlugg	847c34ac66	compiler: remove doc comments from Zir This code was left over from the legacy Autodoc implementation. No component of the compiler pipeline actually requires doc comments, so it is a waste of time and space to store them in ZIR.	2024-12-15 16:32:55 +00:00
mlugg	72ba7f7e98	Sema: disallow runtime stores to pointers with comptime-only element types	2024-12-15 11:09:04 +00:00
mlugg	b5d3db5fc6	Sema: do not allow coercing undefined to opaque types	2024-12-15 11:09:04 +00:00
David Rubin	09a8fa2120	ensure `InstMap` capacity before remapping error code	2024-12-14 16:41:17 -05:00
Andrew Kelley	7575f21212	Merge pull request #22157 from mlugg/astgen-error-lazy compiler: allow semantic analysis of files with AstGen errors	2024-12-09 18:32:23 -05:00
Andrew Kelley	8245d7fac0	Merge pull request #22164 from mlugg/astgen-ref-dedup AstGen: correctly deduplicate `ref` of `param` and `alloc_inferred`	2024-12-09 18:04:26 -05:00
wooster0	a221b2fbf2	Sema: fix use of Zcu.LazySrcLoc in error message It currently prints as: :3:18: error: untagged union 'Zcu.LazySrcLoc{ .base_node_inst = InternPool.TrackedInst.Index(104), .offset = Zcu.LazySrcLoc.Offset{ .node_offset = Zcu.LazySrcLoc.Offset.TracedOffset{ .x = -2, .trace = (value tracing disabled) } } }' cannot be converted to integer	2024-12-08 17:23:57 +00:00
mlugg	03f5b967f0	AstGen: correctly deduplicate `ref` of `param` and `alloc_inferred` Both of these instructions were previously under a special case in `rvalue` which resulted in every reference to such an instruction adding a new `ref` instruction. This had the effect that, for instance, `&a != &a` for parameters. Deduplicating these `ref` instructions was problematic for different reasons. For `alloc_inferred`, the problem was that it's not valid to `ref` the alloc until the allocation has been resolved (`resolve_inferred_alloc`), but `AstGen.appendBodyWithFixups` would place the `ref` directly after the `alloc_inferred`. This is solved by bringing `resolve_inferred_alloc` in line with `make_ptr_const` by having it return the final pointer, rather than modifying `sema.inst_map` of the original `alloc_inferred`. That way, the `ref` refers to the `resolve_inferred_alloc` instruction, so is placed immediately after it, avoiding this issue. For `param`, the problem is a bit trickier: `param` instructions live in a body which must contain only `param` instructions, then a `func{,_inferred,_fancy}`, then a `break_inline`. Moreover, `param` instructions may be referenced not only by the function body, but also by other parameters, the return type expression, etc. Each of these bodies requires separate `ref` instructions. This is solved by pulling entries out of `ref_table` after evaluating each component of the function declaration, and appending the refs later on when actually putting the bodies together. This gives way to another issue: if you write `fn f(x: T) @TypeOf(x.foo())`, then since `x.foo()` takes a reference to `x`, this `ref` instruction is now in a comptime context (outside of the `@TypeOf` ZIR body), so emits a compile error. This is solved by loosening the rules around `ref` instructions; because they are not side-effecting, it is okay to allow `ref` of runtime values at comptime, resulting in a runtime-known value in a comptime scope. We already apply this mechanism in some cases; for instance, it's why `runtime_array.len` works in a `comptime` context. In future, we will want to give similar treatment to many operations in Sema: in general, it's fine to apply runtime operations at comptime provided they don't have side effects! Resolves: #22140	2024-12-08 10:53:50 +00:00
mlugg	4d7818a76a	compiler: allow files with AstGen errors to undergo semantic analysis This commit enhances AstGen to introduce a form of error resilience which allows valid ZIR to be emitted even when AstGen errors occur. When a non-fatal AstGen error (e.g. `appendErrorNode`) occurs, ZIR generation is not affected; the error is added to `astgen.errors` and ultimately to the errors stored in `extra`, but that doesn't stop us getting valid ZIR. Fatal AstGen errors (e.g. `failNode`) are a bit trickier. These errors return `error.AnalysisFail`, which is propagated up the stack. In theory, any parent expression can catch this error and handle it, continuing ZIR generation whilst throwing away whatever was lost. For now, we only do this in one place: when creating declarations. If a call to `fnDecl`, `comptimeDecl`, `globalVarDecl`, etc, returns `error.AnalysisFail`, the `declaration` instruction is still created, but its body simply contains the new `extended(astgen_error())` instruction, which instructs Sema to terminate semantic analysis with a transitive error. This means that a fatal AstGen error causes the innermost declaration containing the error to fail, but the rest of the file remains intact. If a source file contains parse errors, or an `error.AnalysisFail` happens when lowering the top-level struct (e.g. there is an error in one of its fields, or a name has multiple declarations), then lowering for the entire file fails. Alongside the existing `Zir.hasCompileErrors` query, this commit introduces `Zir.loweringFailed`, which returns `true` only in this case. The end result here is that files with AstGen failures will almost always still emit valid ZIR, and hence can undergo semantic analysis on the parts of the file which are (from AstGen's perspective) valid. This is a noteworthy improvement to UX, but the main motivation here is actually incremental compilation. Previously, AstGen failures caused lots of semantic analysis work to be thrown out, because all `AnalUnit`s in the file required re-analysis so as to trigger necessary transitive failures and remove stored compile errors which would no longer make sense (because a fresh compilation of this code would not emit those errors, as the units those errors applied to would fail sooner due to referencing a failed file). Now, this case only applies when a file has severe top-level errors, which is far less common than something like having an unused variable. Lastly, this commit changes a few errors in `AstGen` to become fatal when they were previously non-fatal and vice versa. If there is still a reasonable way to continue AstGen and lower to ZIR after an error, it is non-fatal; otherwise, it is fatal. For instance, `comptime const`, while redundant syntax, has a clear meaning we can lower; on the other hand, using an undeclared identifer has no sane lowering, so must trigger a fatal error.	2024-12-05 19:58:38 +00:00
Alex Kladov	6188cb8e50	sema: add a missing errdefer This fix doesn't matter at all in the grand scheme of things, but I think the story behind it is perhaps curious, as it might point at a design flaw in the Sema's error reporting API. So, a story: On lobsters, there's a rather heated discussion on the merits on RAII vs defer. I don't really like participating in heating discussions, but also sort of can't stop thinking about this. My own personal experience with Zig's defer and errdefer is that they are fiddly to get right consistency --- if a program has a lot of resource management to do, I _always_ mess up at least one defer/errdefer. I've found my internal peace by just avoiding spread-out, "pox" resource management, and instead centralizing resource ownership under one of the following patterns: * Either the thing is acquired and released in main * Or main allocates N instances of thing, and then the rest of the code explicitly juggles this finite pool of N. Notably, this juggling typically doesn't involve defer/errdefer at all, as, at this level of precision, there are no `try`s left, so you only code the happy path * Or there's some sort of arena thing, where a bunch of resources have a single owner, the user's don' bother cleaning up their resources, and instead the owner does it once at the end. So I wanted to make a lobster.rs comment in the vein of "yeah, if your program is mostly about resource management, then Zig could be kinda a pain, but that's friction tells you something: perhaps your program shouldn't be about resource management, and instead it should be doing what it is supposed to do?". And, as an evidence for my claim, I wanted to point out some large body of Zig code which doesn't have a lot of errdefers. So, I cracked opened Sema.zig, `ctrl+f` for `defer`, saw whopping 400 something occupancies, and my heart skipped a bit. Looking at the occurrences, _some_ of them were non-resource-related usages of defer. But a lot of them were the following pattern: ```zig const msg = try sema.errMsg(src, "comptime control flow inside runtime block", .{}); errdefer msg.destroy(sema.gpa); ``` This is exactly the thing that I know _I_ can't get right consistently! So, at this point, I made a prediction that at least one of `errdefer`s is missing. So, I looked at the first few `const msg = try` and of course found one without `errdefer`. I am at 0.8 that, even with this PR applied, the claim will still stand --- there will be `errdefer` missing. So it feels like some API re-design is in order, to make sure individual error messages are not resources. Could Sema just own all partially-constructed error messages, and, at a few known safe-points: * if the control flow is normal, assert that there are no in-progress error messages * if we are throwing an error, go and release messages immediately? I am unlikely to do the actual refactor here, but I think it's worth highlighting the overall pattern here. PS: I am only 0.9 sure that what I've found is indeed a bug! I don't understand the code, I did a dumb text search, so I _could_ have made a fool of myself here :P	2024-12-03 14:18:26 -05:00
David Rubin	1d78d4f8c1	sema: hotpath `++` and `**` for array-pointers	2024-11-28 18:05:36 -08:00
David Rubin	bc3094b278	sema: make `++` and `**` return immutable pointers	2024-11-27 20:39:23 -08:00
Jacob Young	c894ac09a3	dwarf: fix stepping through an inline loop containing one statement Previously, stepping from the single statement within the loop would always exit the loop because all of the code unrolled from the loop is associated with the same line and treated by the debugger as one line.	2024-11-24 17:28:12 -05:00
Alex Rønne Petersen	aef5c75602	compiler: Disallow align(0) everywhere in the language. Thus leaving the design space for this alignment value open, e.g. for packing.	2024-11-23 18:44:07 -05:00
xdBronch	5f3a70ed5f	Fix peer type resolution with allowzero pointers	2024-11-20 02:09:50 +02:00
Jacob Young	a8ec306b49	Sema: fix peer resolution alignment between slice and empty struct An empty struct that coerces to an empty array should not force `align(1)` on the resulting slice type.	2024-11-16 21:22:57 -05:00
Robin Voetter	62f4a6b4d8	Merge pull request #21937 from Snektron/spirv-vulkan-ptrs spirv: miscellaneous vulkan + zig stuff	2024-11-09 21:50:33 +01:00
Wooster	35201e9d93	Sema: fix wording in error message It's an FQN, not an actual file name.	2024-11-09 20:21:32 +00:00
Robin Voetter	d35dfc5a3f	add storage_buffer address space	2024-11-08 20:43:57 +01:00
Robin Voetter	688d7055e3	spirv: assembler hacky constant placeholders	2024-11-08 20:43:55 +01:00
Robin Voetter	08ad7afc1e	spirv: forbid pointer arithmetic	2024-11-08 20:38:20 +01:00
Alex Rønne Petersen	e4e3d7ab41	Sema: Disallow calling functions with certain special calling conventions.	2024-11-02 10:44:18 +01:00
Robin Voetter	ba5f57616f	Merge pull request #21861 from alichraghi/master spirv: push constants and small fixes	2024-11-01 03:44:37 +01:00
Ali Cheraghi	a1cb9563f6	spirv: Uniform/PushConstant variables - Rename GPU address spaces to match with SPIR-V spec. - Emit `Block` Decoration for Uniform/PushConstant variables. - Don't emit `OpTypeForwardPointer` for non-opencl targets. (there's still a false-positive about recursive structs) Signed-off-by: Ali Cheraghi <alichraghi@proton.me>	2024-11-01 02:03:33 +03:30
mlugg	d11bbde5f9	compiler: remove anonymous struct types, unify all tuples This commit reworks how anonymous struct literals and tuples work. Previously, an untyped anonymous struct literal (e.g. `const x = .{ .a = 123 }`) was given an "anonymous struct type", which is a special kind of struct which coerces using structural equivalence. This mechanism was a holdover from before we used RLS / result types as the primary mechanism of type inference. This commit changes the language so that the type assigned here is a "normal" struct type. It uses a form of equivalence based on the AST node and the type's structure, much like a reified (`@Type`) type. Additionally, tuples have been simplified. The distinction between "simple" and "complex" tuple types is eliminated. All tuples, even those explicitly declared using `struct { ... }` syntax, use structural equivalence, and do not undergo staged type resolution. Tuples are very restricted: they cannot have non-`auto` layouts, cannot have aligned fields, and cannot have default values with the exception of `comptime` fields. Tuples currently do not have optimized layout, but this can be changed in the future. This change simplifies the language, and fixes some problematic coercions through pointers which led to unintuitive behavior. Resolves: #16865	2024-10-31 20:42:53 +00:00
Robin Voetter	3450809e3d	Merge pull request #21826 from Snektron/spirv-vulkan spirv: vulkan setup	2024-10-29 19:58:51 +01:00
mlugg	05b445a276	Sema: add missing coercion to bool for condbr_inline Also, start using labeled switch statements when dispatching maybe-runtime instructions like condbr to comptime-only variants like condbr_inline. This can't be merged until we get a zig1.wasm update due to #21385. Resolves: #21405	2024-10-28 02:24:59 +00:00
Robin Voetter	49a067ccfe	spirv: forbid merging logical pointers Under some architecture/operating system combinations it is forbidden to return a pointer from a merge, as these pointers must point to a location at compile time. This adds a check for those cases when returning a pointer from a block merge.	2024-10-27 16:31:45 +01:00
Matthew Lugg	4adf63aefc	Merge pull request #21796 from Rexicon226/var-args fix callconv resolution for varargs	2024-10-25 18:31:01 +01:00
Alex Rønne Petersen	03d0e296cb	Merge pull request #21710 from alexrp/function-alignment Some improvements to the compiler's handling of function alignment	2024-10-25 11:10:28 +02:00
David Rubin	97e584a6b9	use `cCallingConvention` instead of `.C` in Sema using `.C` in Sema is incorrect since it will be resolved under the target that Zig was compiled with, not the target build configuration. This is easily solved by just calling `cCallingConvention` on the target to resolve it.	2024-10-25 00:08:41 -07:00
Veikka Tuominen	9ffee5abed	Sema: fix check for whether current AnalUnit is a test function Closes #21159	2024-10-24 13:34:07 +03:00
Andrew Kelley	ba71079837	combine codegen work queue and linker task queue these tasks have some shared data dependencies so they cannot be done simultaneously. Future work should untangle these data dependencies so that more can be done in parallel. for now this commit ensures correctness by making linker input parsing and codegen tasks part of the same queue.	2024-10-23 16:27:39 -07:00
Andrew Kelley	e567abb339	rework linker inputs * Compilation.objects changes to Compilation.link_inputs which stores objects, archives, windows resources, shared objects, and strings intended to be put directly into the dynamic section. Order is now preserved between all of these kinds of linker inputs. If it is determined the order does not matter for a particular kind of linker input, that item should be moved to a different array. * rename system_libs to windows_libs * untangle library lookup from CLI types * when doing library lookup, instead of using access syscalls, go ahead and open the files and keep the handles around for passing to the cache system and the linker. * during library lookup and cache file hashing, use positioned reads to avoid affecting the file seek position. * library directories are opened in the CLI and converted to Directory objects, warnings emitted for those that cannot be opened.	2024-10-23 16:27:38 -07:00
Andrew Kelley	78f643c46d	Merge pull request #21758 from kcbanner/dll_storage_class Add `is_dll_import` to @extern, to support `__declspec(dllimport)` with the MSVC ABI	2024-10-23 15:35:54 -07:00
kcbanner	b87fa93500	Change `ExternOptions.dll_storage_class` to `is_dll_import` It wouldn't make sense to have passe `.export` here, and that was in fact a compile error - so simply make this a bool instead.	2024-10-22 12:41:35 -04:00
kcbanner	a4690ecb1f	Cause a compilation error to occur if using @extern with is_dll_import in a comptime scope. Add a note about thread local / dll import being the cause.	2024-10-22 12:41:35 -04:00
kcbanner	ee25757245	Add support for specifying `dll_storage_class` in @extern	2024-10-22 12:41:35 -04:00
Alex Rønne Petersen	ef72b91ac2	compiler: Remove uses of defaultFunctionAlignment() in the frontend. minFunctionAlignment() is something we can know ahead of time for any given target because it's a matter of ABI. However, defaultFunctionAlignment() is a matter of optimization and every backend can do it differently depending on any number of factors. For example, LLVM will base the choice on the CPU model in its aarch64 backend. So just don't use this value in the frontend.	2024-10-20 09:21:17 +02:00
mlugg	8d5ac6bdea	Sema: add and improve some callconv compile errors	2024-10-19 19:46:07 +01:00

1 2 3 4 5 ...

2299 Commits