mirror/zig - zig - Bouvais Git

mirror/zig

mirror of https://github.com/ziglang/zig.git synced 2026-01-13 02:45:11 +00:00

Author	SHA1	Message	Date
Will Lillis	953355ebea	fix: error on non-exhaustive enums with zero width backing type (#21374 ) Co-authored-by: WillLillis <wlillis@umass.edu>	2025-02-02 03:36:16 +00:00
Andrew Kelley	963651bbf2	Merge pull request #22672 from jacobly0/x86_64-rewrite x86_64: rewrite float conversions	2025-02-01 14:32:43 -08:00
mlugg	3924f173af	compiler: do not propagate result type to `try` operand This commit effectively reverts 9e683f0, and hence un-accepts #19777. While nice in theory, this proposal turned out to have a few problems. Firstly, supplying a result type implicitly coerces the operand to this type -- that's the main point of result types! But for `try`, this is actually a bad idea; we want a redundant `try` to be a compile error, not to silently coerce the non-error value to an error union. In practice, this didn't always happen, because the implementation was buggy anyway; but when it did, it was really quite silly. For instance, `try try ... try .{ ... }` was an accepted expression, with the inner initializer being initially coerced to `E!E!...E!T`. Secondly, the result type inference here didn't play nicely with `return`. If you write `return try`, the operand would actually receive a result type of `E!E!T`, since the `return` gave a result type of `E!T` and the `try` wrapped it in another error union. More generally, the problem here is that `try` doesn't know when it should or shouldn't nest error unions. This occasionally broke code which looked like it should work. So, this commit prevents `try` from propagating result types through to its operand. A key motivation for the original proposal here was decl literals; so, as a special case, `try .foo(...)` is still an allowed syntax form, caught by AstGen and specially lowered. This does open the doors to allowing other special cases for decl literals in future, such as `.foo(...) catch ...`, but those proposals are for another time. Resolves: #21991 Resolves: #22633	2025-02-01 15:48:45 +00:00
mlugg	149031204c	Sema: skip aliasing check and runtime operation for `@memcpy` of zero-bit type This check isn't valid in such cases, because the source and destination pointers both refer to zero bits of memory, meaning they effectively never alias. Resolves: #21655	2025-02-01 09:48:18 +00:00
mlugg	d97441d37e	Sema: fix `@splat` of OPV arrays	2025-02-01 09:46:29 +00:00
Jacob Young	b9531f5de6	x86_64: rewrite float vector conversions	2025-01-31 23:00:34 -05:00
Jacob Young	afa74c6b21	Sema: introduce all_vector_instructions backend feature Sema is arbitrarily scalarizing some operations, which means that when I try to implement vectorized versions of those operations in a backend, they are impossible to test due to Sema not producing them. Now, I can implement them and then temporarily enable the new feature for that backend in order to test them. Once the backend supports all of them, the feature can be permanently enabled. This also deletes the Air instructions `int_from_bool` and `int_from_ptr`, which are just bitcasts with a fixed result type, since changing `un_op` to `ty_op` takes up the same amount of memory.	2025-01-31 23:00:34 -05:00
mlugg	b01d6b156c	compiler: add `intcast_safe` AIR instruction This instruction is like `intcast`, but includes two safety checks: * Checks that the int is in range of the destination type * If the destination type is an exhaustive enum, checks that the int is a named enum value This instruction is locked behind the `safety_checked_instructions` backend feature; if unsupported, Sema will emit a fallback, as with other safety-checked instructions. This instruction is used to add a missing safety check for `@enumFromInt` truncating bits. This check also has a fallback for backends which do not yet support `safety_checked_instructions`. Resolves: #21946	2025-01-30 14:47:59 +00:00
mlugg	107b65ec5d	Sema: explain why we tried to call an `extern fn` at comptime I recently saw a user hit the "comptime call of extern function" error, and get confused because they didn't know why the scope was `comptime`. So, use `explainWhyBlockIsComptime` on this and related errors to add all the relevant notes. The added test case shows the motivating situation.	2025-01-29 18:43:24 +00:00
mlugg	71d16106ad	Sema: `@memcpy` changes * The langspec definition of `@memcpy` has been changed so that the source and destination element types must be in-memory coercible, allowing all such calls to be raw copying operations, not actually applying any coercions. * Implement aliasing check for comptime `@memcpy`; a compile error will now be emitted if the arguments alias. * Implement more efficient comptime `@memcpy` by loading and storing a whole array at once, similar to how `@memset` is implemented.	2025-01-29 06:35:22 +00:00
Will Lillis	672bc8141f	fix: Only suggest try on destructure of error union if payload type can be destructured (#21510 )	2025-01-26 19:38:07 +01:00
Matthew Lugg	3767b08039	Merge pull request #22602 from mlugg/incr-embedfile incremental: handle `@embedFile`	2025-01-26 01:41:56 +00:00
Andrew Kelley	d0d5ca2b6c	Merge pull request #22581 from jacobly0/x86_64-rewrite x86_64: rewrite `@abs` on floats	2025-01-25 07:30:45 -08:00
mlugg	f47b8de2ad	incremental: handle `@embedFile` Uses of `@embedFile` register dependencies on the corresponding `Zcu.EmbedFile`. At the start of every update, we iterate all embedded files and update them if necessary, and invalidate the dependencies if they changed. In order to properly integrate with the lazy analysis model, failed embed files are now reported by the `AnalUnit` which actually used `@embedFile`; the filesystem error is stored in the `Zcu.EmbedFile`. An incremental test is added covering incremental updates to embedded files, and I have verified locally that dependency invalidation is working correctly.	2025-01-25 06:07:08 +00:00
Jacob Young	c7433212d1	x86_64: rewrite scalar and vector int `@min` and `@max`	2025-01-24 21:02:32 -05:00
Jacob Young	b1fa89439a	x86_64: rewrite float vector `@abs` and equality comparisons	2025-01-24 20:56:11 -05:00
mlugg	b6726913d3	Zcu: remove `null_stack_trace` The new simplifications to the panic handler have eliminated the need for this piece of memoized state.	2025-01-24 22:33:23 +00:00
mlugg	5a6666db55	all: update for `panic.unwrapError` and `panic.call` signature changes	2025-01-24 22:33:23 +00:00
mlugg	b0a8931690	Sema: prepare to remove `?*StackTrace` argument from `unwrapError` and `call` Now that we propagate the error return trace to all `callconv(.auto)` functions, passing it explicitly to panic handlers is redundant.	2025-01-24 20:50:20 +00:00
mlugg	83991efe10	compiler: yet more panic handler changes * `std.builtin.Panic` -> `std.builtin.panic`, because it is a namespace. * `root.Panic` -> `root.panic` for the same reason. There are type checks so that we still allow the legacy `pub fn panic` strategy in the 0.14.0 release. * `std.debug.SimplePanic` -> `std.debug.simple_panic`, same reason. * `std.debug.NoPanic` -> `std.debug.no_panic`, same reason. * `std.debug.FormattedPanic` is now a function `std.debug.FullPanic` which takes as input a `panicFn` and returns a namespace with all the panic functions. This handles the incredibly common case of just wanting to override how the message is printed, whilst keeping nice formatted panics. * Remove `std.builtin.panic.messages`; now, every safety panic has its own function. This reduces binary bloat, as calls to these functions no longer need to prepare any arguments (aside from the error return trace). * Remove some legacy declarations, since a zig1.wasm update has happened. Most of these were related to the panic handler, but a quick grep for "zig1" brought up a couple more results too. Also, add some missing type checks to Sema. Resolves: #22584 formatted -> full	2025-01-24 19:29:51 +00:00
Matthew Lugg	0e815c652d	Merge pull request #22572 from jacobly0/new-error-trace compiler: include error trace in all functions, implement for x86_64 backend	2025-01-22 16:48:27 +00:00
mlugg	1bce01de97	compiler: pass error return traces everywhere	2025-01-22 02:22:56 -05:00
mlugg	e864c38cc3	Sema: fix crash when `inline` loop condition is not comptime-known	2025-01-22 04:18:43 +00:00
mlugg	0ec6b2dd88	compiler: simplify generic functions, fix issues with inline calls The original motivation here was to fix regressions caused by #22414. However, while working on this, I ended up discussing a language simplification with Andrew, which changes things a little from how they worked before #22414. The main user-facing change here is that any reference to a prior function parameter, even if potentially comptime-known at the usage site or even not analyzed, now makes a function generic. This applies even if the parameter being referenced is not a `comptime` parameter, since it could still be populated when performing an inline call. This is a breaking language change. The detection of this is done in AstGen; when evaluating a parameter type or return type, we track whether it referenced any prior parameter, and if so, we mark this type as being "generic" in ZIR. This will cause Sema to not evaluate it until the time of instantiation or inline call. A lovely consequence of this from an implementation perspective is that it eliminates the need for most of the "generic poison" system. In particular, `error.GenericPoison` is now completely unnecessary, because we identify generic expressions earlier in the pipeline; this simplifies the compiler and avoids redundant work. This also entirely eliminates the concept of the "generic poison value". The only remnant of this system is the "generic poison type" (`Type.generic_poison` and `InternPool.Index.generic_poison_type`). This type is used in two places: * During semantic analysis, to represent an unknown result type. * When storing generic function types, to represent a generic parameter/return type. It's possible that these use cases should instead use `.none`, but I leave that investigation to a future adventurer. One last thing. Prior to #22414, inline calls were a little inefficient, because they re-evaluated even non-generic parameter types whenever they were called. Changing this behavior is what ultimately led to #22538. Well, because the new logic will mark a type expression as generic if there is any change its resolved type could differ in an inline call, this redundant work is unnecessary! So, this is another way in which the new design reduces redundant work and complexity. Resolves: #22494 Resolves: #22532 Resolves: #22538	2025-01-21 02:41:42 +00:00
mlugg	8bcb578507	Sema: fix `is_non_null_ptr` handling for runtime-known pointers We can still often determine a comptime result based on the type, even if the pointer is runtime-known. Also, we previously used load -> is non null instead of AIR `is_non_null_ptr` if the pointer is comptime-known, but that's a bad heuristic. Instead, we should check for the pointer to be comptime-known, and for the load to be comptime-known, and only in that case should we call `Sema.analyzeIsNonNull`. Resolves: #22556	2025-01-21 00:33:32 +00:00
mlugg	3b6e5ba490	Sema: don't try to initialize global union pointer at comptime Resolves: #19832	2025-01-18 14:30:06 +00:00
mlugg	f7b9f84df2	incremental: fix enum resolution bugs	2025-01-18 14:30:06 +00:00
mlugg	726c94d5f1	Sema: prepare for `sentinel` -> `sentinel_ptr` field rename The commit 2 after this will explain this diff.	2025-01-16 12:49:58 +00:00
mlugg	b6abe1dbf7	compiler: make it easier to apply breaking changes to `std.builtin` Documentation for this will be on the wiki shortly. Resolves: #21842	2025-01-16 12:49:48 +00:00
mlugg	d00e05f186	all: update to `std.builtin.Type.Pointer.Size` field renames This was done by regex substitution with `sed`. I then manually went over the entire diff and fixed any incorrect changes. This diff also changes a lot of `callconv(.C)` to `callconv(.c)`, since my regex happened to also trigger here. I opted to leave these changes in, since they are a correct migration, even if they're not the one I was trying to do!	2025-01-16 12:46:29 +00:00
Andrew Kelley	943dac3e85	compiler: add type safety for export indices	2025-01-15 15:11:35 -08:00
mlugg	4b910e525d	Sema: more validation for builtin decl types Also improve the source locations when this validation fails. Resolves: #22465	2025-01-14 22:44:18 +00:00
mlugg	5322459a0b	Sema: fix UB in error reporting And add test coverage for the compile error in question.	2025-01-14 21:17:46 +00:00
xdBronch	fb43e91b22	Sema: disallow non scalar sentinels in array types and reified types (#22473 )	2025-01-13 05:28:53 +00:00
mlugg	f78f9388fe	Sema: allow tail calls of function pointers Resolves: #22474	2025-01-13 02:57:15 +00:00
mlugg	04c9f50aec	compiler: improve "... contains reference to comptime var" errors `Sema.explainWhyValueContainsReferenceToComptimeVar` (concise name!) adds notes to an error explaining how to get from a given `Value` to a pointer to some `comptime var` (or a comptime field). Previously, this error could be very opaque in any case where it wasn't obvious where the comptime var pointer came from; particularly for type captures. Now, the error notes explain this to the user.	2025-01-11 08:54:47 +00:00
mlugg	e9bd2d45d4	Sema: rewrite semantic analysis of function calls This rewrite improves some error messages, hugely simplifies the logic, and fixes several bugs. One of these bugs is technically a new rule which Andrew and I agreed on: if a parameter has a comptime-only type but is not declared `comptime`, then the corresponding call argument should not be evaluated at comptime; only resolved. Implementing this required changing how function types work a little, which in turn required allowing a new kind of function coercion for some generic use cases: function coercions are now allowed to implicitly remove `comptime` annotations from parameters with comptime-only types. This is okay because removing the annotation affects only the call site. Resolves: #22262	2025-01-09 06:46:47 +00:00
David Rubin	40f5eac79c	Sema: fix invalid AIR from array concat	2025-01-07 06:17:40 -05:00
mlugg	137787edbb	Sema: fix incorrect type in `optional_payload` instruction Resolves: #22417	2025-01-05 19:38:19 +00:00
mlugg	b039a8b615	compiler: slightly simplify builtin decl memoization Rather than `Zcu.BuiltinDecl.Memoized` being a struct with fields, it can instead just be an array, indexed by the enum. This allows runtime indexing, avoiding a few now-unnecessary `inline` switch cases.	2025-01-05 05:52:02 +00:00
mlugg	f01029c4af	incremental: new `AnalUnit` to group dependencies on `std.builtin` decls This commit reworks how values like the panic handler function are memoized during a compiler invocation. Previously, the value was resolved by whichever analysis requested it first, and cached on `Zcu`. This is problematic for incremental compilation, as after the initial resolution, no dependencies are marked by users of this memoized state. This is arguably acceptable for `std.builtin`, but it's definitely not acceptable for the panic handler/messages, because those can be set by the user (`std.builtin.Panic` checks `@import("root").Panic`). So, here we introduce a new kind of `AnalUnit`, called `memoized_state`. There are 3 such units: * `.{ .memoized_state = .va_list }` resolves the type `std.builtin.VaList` * `.{ .memoized_state = .panic }` resolves `std.Panic` * `.{ .memoized_state = .main }` resolves everything else we want These units essentially "bundle" the resolution of their corresponding declarations, storing the results into fields on `Zcu`. This way, when, for instance, a function wants to call the panic handler, it simply runs `ensureMemoizedStateResolved`, registering one dependency, and pulls the values from the `Zcu`. This "bundling" minimizes dependency edges. The 3 units are separated to allow them to act independently: for instance, the panic handler can use `std.builtin.Type` without triggering a dependency loop.	2025-01-04 07:51:19 +00:00
mlugg	f818098971	incremental: correctly return `error.AnalysisFail` when type structure changes `Zcu.PerThead.ensureTypeUpToDate` is set up in such a way that it only returns the updated type the first time it is called. In general, that's okay; however, the exception is that we want the function to continue returning `error.AnalysisFail` when the type has been lost, or its number of captures changed. Therefore, the check for this case now happens before the up-to-date success return. For simplicity, the number of captures is now handled by intentionally losing the instruction in `Zcu.mapOldZirToNew`, since there is nothing to gain from tracking a type when old instances of it can never be reused.	2025-01-04 05:44:29 +00:00
mlugg	b4da8eef2a	Zir: split up start and end of range in `for_len` The old lowering was kind of neat, but it unintentionally allowed the syntax `for (123) \|_\| { ... }`, and there wasn't really a way to fix that. So, instead, we include both the start and the end of the range in the `for_len` instruction (each operand to `for` now has two entries in this multi-op instruction). This slightly increases the size of ZIR for loops of predominantly indexables, but the difference is small enough that it's not worth complicating ZIR to try and fix it.	2025-01-03 22:28:37 +00:00
mlugg	252c203101	Sema: correctly label `block_comptime` for restoring error return trace index Resolves: #22384	2025-01-02 16:26:50 +00:00
mlugg	5333d2443a	Sema: fix invalid coercion `[n:x]T` -> `[m]T` for `n != m` The change in `Sema.coerceExtra` is just to avoid an unhelpful error message, covered by the added test case. Resolves: #22373	2025-01-01 16:20:40 +00:00
mlugg	106df881d3	Sema: add doc comments for comptime reason types	2024-12-31 09:56:21 +00:00
mlugg	7e82398cfe	Sema: remove some incorrect calls to `requireRuntimeBlock` Most calls to `requireRuntimeBlock` in Sema are not correct. This function doesn't deal with all of them, but it does deal with ones which have, in combination with the past few commits, introduced real-world regressions. Related: #22353	2024-12-31 09:55:03 +00:00
mlugg	9a70eeeac5	compiler: ensure local `const`s in comptime scope are comptime-known This fixes a bug which exposed a compiler implementation detail (ZIR alloc elision). Previously, `const` declarations with a runtime-known value in a comptime scope were permitted only if AstGen was able to elide the alloc in ZIR, since the error was reported by storing to the comptime alloc. This just adds a new instruction to also emit this error when the alloc is elided.	2024-12-31 09:55:03 +00:00
mlugg	6026a5f217	compiler: ensure result of `block_comptime` is comptime-known To avoid this PR regressing error messages, most of the work here has gone towards improving error notes for why code was comptime-evaluated. ZIR `block_comptime` now stores a "comptime reason", the enum for which is also used by Sema. There are two types in Sema: * `ComptimeReason` represents the reason we started evaluating something at comptime. * `BlockComptimeReason` represents the reason a given block is evaluated at comptime; it's either a `ComptimeReason` with an attached source location, or it's because we're in a function which was called at comptime (and that function's `Block` should be consulted for the "parent" reason). Every `Block` stores a `?BlockComptimeReason`. The old `is_comptime` field is replaced with a trivial `isComptime()` method which returns whether that reason is non-`null`. Lastly, the handling for `block_comptime` has been simplified. It was previously going through an unnecessary runtime-handling path; now, it is a trivial sub block exited through a `break_inline` instruction. Resolves: #22296	2024-12-31 09:55:03 +00:00
mlugg	3afda4322c	compiler: analyze type and value of global declaration separately This commit separates semantic analysis of the annotated type vs value of a global declaration, therefore allowing recursive and mutually recursive values to be declared. Every `Nav` which undergoes analysis now has two corresponding `AnalUnit`s: `.{ .nav_val = n }` and `.{ .nav_ty = n }`. The `nav_val` unit is responsible for fully resolving the `Nav`: determining its value, linksection, addrspace, etc. The `nav_ty` unit, on the other hand, resolves only the information necessary to construct a pointer to the `Nav`: its type, addrspace, etc. (It does also analyze its linksection, but that could be moved to `nav_val` I think; it doesn't make any difference). Analyzing a `nav_ty` for a declaration with no type annotation will just mark a dependency on the `nav_val`, analyze it, and finish. Conversely, analyzing a `nav_val` for a declaration with a type annotation will first mark a dependency on the `nav_ty` and analyze it, using this as the result type when evaluating the value body. The `nav_val` and `nav_ty` units always have references to one another: so, if a `Nav`'s type is referenced, its value implicitly is too, and vice versa. However, these dependencies are trivial, so, to save memory, are only known implicitly by logic in `resolveReferences`. In general, analyzing ZIR `decl_val` will only analyze `nav_ty` of the corresponding `Nav`. There are two exceptions to this. If the declaration is an `extern` declaration, then we immediately ensure the `Nav` value is resolved (which doesn't actually require any more analysis, since such a declaration has no value body anyway). Additionally, if the resolved type has type tag `.@"fn"`, we again immediately resolve the `Nav` value. The latter restriction is in place for two reasons: * Functions are special, in that their externs are allowed to trivially alias; i.e. with a declaration `extern fn foo(...)`, you can write `const bar = foo;`. This is not allowed for non-function externs, and it means that function types are the only place where it is possible for a declaration `Nav` to have a `.@"extern"` value without actually being declared `extern`. We need to identify this situation immediately so that the `decl_ref` can create a pointer to the real extern `Nav`, not this alias. * In certain situations, such as taking a pointer to a `Nav`, Sema needs to queue analysis of a runtime function if the value is a function. To do this, the function value needs to be known, so we need to resolve the value immediately upon `&foo` where `foo` is a function. This restriction is simple to codify into the eventual language specification, and doesn't limit the utility of this feature in practice. A consequence of this commit is that codegen and linking logic needs to be more careful when looking at `Nav`s. In general: * When `updateNav` or `updateFunc` is called, it is safe to assume that the `Nav` being updated (the owner `Nav` for `updateFunc`) is fully resolved. * Any `Nav` whose value is/will be an `@"extern"` or a function is fully resolved; see `Nav.getExtern` for a helper for a common case here. * Any other `Nav` may only have its type resolved. This didn't seem to be too tricky to satisfy in any of the existing codegen/linker backends. Resolves: #131	2024-12-24 02:18:41 +00:00

1 2 3 4 5 ...

2345 Commits