mirror/zig - zig - Bouvais Git

mirror/zig

mirror of https://github.com/ziglang/zig.git synced 2025-12-30 01:53:16 +00:00

Author	SHA1	Message	Date
Matthew Lugg	c5383173a0	compiler: replace `@Type` with individual type-creating builtins The new builtins are: * `@EnumLiteral` * `@Int` * `@Fn` * `@Pointer` * `@Tuple` * `@Enum` * `@Union` * `@Struct` Their usage is documented in the language reference. There is no `@Array` because arrays can be created like this: if (sentinel) \|s\| [n:s]T else [n]T There is also no `@Float`. Instead, `std.meta.Float` can serve this use case if necessary. There is no `@ErrorSet` and intentionally no way to achieve this. Likewise, there is intentionally no way to reify tuples with comptime fields, or function types with comptime parameters. These decisions simplify the Zig language specification, and moreover make Zig code more readable by discouraging overly complex metaprogramming. Co-authored-by: Ali Cheraghi <alichraghi@proton.me> Resolves: #10710	2025-11-22 22:42:37 +00:00
Matthew Lugg	c091e27aac	compiler: spring cleaning I started this diff trying to remove a little dead code from the C backend, but ended up finding a bunch of dead code sprinkled all over the place: * `packed` handling in the C backend which was made dead by `Legalize` * Representation of pointers to runtime-known vector indices * Handling for the `vector_store_elem` AIR instruction (now removed) * Old tuple handling from when they used the InternPool repr of structs * Straightforward unused functions * TODOs in the LLVM backend for features which Zig just does not support	2025-11-12 16:00:15 +00:00
Jacob Young	906ce2ad7c	InternPool: use sequential string indices instead of byte offsets This allows more bytes to be referenced by a smaller index range. Closes #22867 Closes #25297 Closes #25339	2025-10-05 00:25:21 -04:00
Jacob Young	f58200e3f2	Elf2: create a new linker from scratch This iteration already has significantly better incremental support. Closes #24110	2025-09-21 14:09:14 -07:00
Andrew Kelley	0681bf06ab	frontend: packed struct field ptr no longer finds byte borders technically breaking, but I doubt anyone will notice.	2025-09-20 18:33:01 -07:00
Andrew Kelley	7b8a7989ef	frontend: additionally handle C pointers in ptrOptPayload	2025-09-20 18:33:00 -07:00
Andrew Kelley	f0fe55a9bb	frontend: fix too strict assertion field ptr can be based on C pointer	2025-09-20 18:33:00 -07:00
Andrew Kelley	150169f1e0	std.fmt: delete deprecated APIs std.fmt.Formatter -> std.fmt.Alt std.fmt.format -> std.Io.Writer.print	2025-08-31 12:49:18 -07:00
Andrew Kelley	79f267f6b9	std.Io: delete GenericReader and delete deprecated alias std.io	2025-08-29 17:14:26 -07:00
Justus Klausecker	8fda49ea8a	fix: remove incorrect '&'	2025-08-12 16:33:58 +02:00
Justus Klausecker	79e5c138c6	replace even more aggregate interns	2025-08-12 16:33:57 +02:00
Justus Klausecker	d0586da18e	Sema: Improve comptime arithmetic undef handling This commit expands on the foundations laid by https://github.com/ziglang/zig/pull/23177 and moves even more `Sema`-only functionality from `Value` to `Sema.arith`. Specifically all shift and bitwise operations, `@truncate`, `@bitReverse` and `@byteSwap` have been moved and adapted to the new rules around `undefined`. Especially the comptime shift operations have been basically rewritten, fixing many open issues in the process. New rules applied to operators: * `<<`, `@shlExact`, `@shlWithOverflow`, `>>`, `@shrExact`: compile error if any operand is undef * `<<\|`, `~`, `^`, `@truncate`, `@bitReverse`, `@byteSwap`: return undef if any operand is undef * `&`, `\|`: Return undef if both operands are undef, turn undef into actual `0xAA` bytes otherwise Additionally this commit canonicalizes the representation of aggregates with all-undefined members in the `InternPool` by disallowing them and enforcing the usage of a single typed `undef` value instead. This reduces the amount of edge cases and fixes a bunch of bugs related to partially undefined vecs. List of operations directly affected by this patch: * `<<`, `<<\|`, `@shlExact`, `@shlWithOverflow` * `>>`, `@shrExact` * `&`, `\|`, `~`, `^` and their atomic rmw + reduce pendants * `@truncate`, `@bitReverse`, `@byteSwap`	2025-08-12 16:33:57 +02:00
Andrew Kelley	5378fdb153	std.fmt: fully remove format string from format methods Introduces `std.fmt.alt` which is a helper for calling alternate format methods besides one named "format".	2025-07-07 22:43:53 -07:00
Andrew Kelley	941bc37193	compiler: update all instances of std.fmt.Formatter	2025-07-07 22:43:52 -07:00
Jacob Young	6b41beb370	big.int: implement float conversions These conversion routines accept a `round` argument to control how the result is rounded and return whether the result is exact. Most callers wanted this functionality and had hacks around it being missing. Also delete `std.math.big.rational` because it was only being used for float conversion, and using rationals for that is a lot more complex than necessary. It also required an allocator, whereas the new integer routines only need to be passed enough memory to store the result.	2025-06-15 14:15:18 -04:00
Daniel Kongsgaard	5e3c0b7af7	Allow more operators on bool vectors (#24131 ) * Sema: allow binary operations and boolean not on vectors of bool * langref: Clarify use of operators on vectors (`and` and `or` not allowed) closes #24093	2025-06-12 15:16:23 -07:00
Jacob Young	b483defc5a	Legalize: implement scalarization of binary operations	2025-05-31 18:54:28 -04:00
mlugg	92c63126e8	compiler: tlv pointers are not comptime-known Pointers to thread-local variables do not have their addresses known until runtime, so it is nonsensical for them to be comptime-known. There was logic in the compiler which was essentially attempting to treat them as not being comptime-known despite the pointer being an interned value. This was a bit of a mess, the check was frequent enough to actually show up in compiler profiles, and it was very awkward for backends to deal with, because they had to grapple with the fact that a "constant" they were lowering might actually require runtime operations. So, instead, do not consider these pointers to be comptime-known in any way. Never intern such a pointer; instead, when the address of a threadlocal is taken, emit an AIR instruction which computes the pointer at runtime. This avoids lots of special handling for TLVs across basically all codegen backends; of all somewhat-functional backends, the only one which wasn't improved by this change was the LLVM backend, because LLVM pretends this complexity around threadlocals doesn't exist. This change simplifies Sema and codegen, avoids a potential source of bugs, and potentially improves Sema performance very slightly by avoiding a non-trivial check on a hot path.	2025-05-27 19:23:11 +01:00
wooster0	a365971a33	std.meta.intToEnum -> std.enums.fromInt Also use an optional as the return type instead of an error code.	2025-05-13 07:28:41 +02:00
kcbanner	fa59153896	Value: ensure that extern structs have their layout resolved in ptrField	2025-04-11 15:30:20 -04:00
David Rubin	ae442e2c29	big.int: return normalized results from `{add,sub}Carry`	2025-03-24 06:58:01 -07:00
mlugg	2a4e06bcb3	Sema: rewrite comptime arithmetic This commit reworks how Sema handles arithmetic on comptime-known values, fixing many bugs in the process. The general pattern is that arithmetic on comptime-known values is now handled by the new namespace `Sema.arith`. Functions handling comptime arithmetic no longer live on `Value`; this is because some of them can emit compile errors, so some can't go on `Value`. Only semantic analysis should really be doing arithmetic on `Value`s anyway, so it makes sense for it to integrate more tightly with `Sema`. This commit also implements more coherent rules surrounding how `undefined` interacts with comptime and mixed-comptime-runtime arithmetic. The rules are as follows. * If an operation cannot trigger Illegal Behavior, and any operand is `undefined`, the result is `undefined`. This includes operations like `0 \| undef`, where the LHS logically could* be used to determine a defined result. This is partly to simplify the language, but mostly to permit codegen backends to represent `undefined` values as completely invalid states. * If an operation can trigger Illegal Behvaior, and any operand is `undefined`, then Illegal Behavior results. This occurs even if the operand in question isn't the one that "decides" illegal behavior; for instance, `undef / 1` is undefined. This is for the same reasons as described above. * An operation which would trigger Illegal Behavior, when evaluated at comptime, instead triggers a compile error. Additionally, if one operand is comptime-known undef, such that the other (runtime-known) operand isn't needed to determine that Illegal Behavior would occur, the compile error is triggered. * The only situation in which an operation with one comptime-known operand has a comptime-known result is if that operand is undefined, in which case the result is either undefined or a compile error per the above rules. This could potentially be loosened in future (for instance, `0 * rt` could be comptime-known 0 with a runtime assertion that `rt` is not undefined), but at least for now, defining it more conservatively simplifies the language and allows us to easily change this in future if desired. This commit fixes many bugs regarding the handling of `undefined`, particularly in vectors. Along with a collection of smaller tests, two very large test cases are added to check arithmetic on `undefined`. The operations which have been rewritten in this PR are: * `+`, `+%`, `+\|`, `@addWithOverflow` * `-`, `-%`, `-\|`, `@subWithOverflow` * ``, `%`, `\|`, `@mulWithOverflow` `/`, `@divFloor`, `@divTrunc`, `@divExact` * `%`, `@rem`, `@mod` Other arithmetic operations are currently unchanged. Resolves: #22743 Resolves: #22745 Resolves: #22748 Resolves: #22749 Resolves: #22914	2025-03-16 08:17:50 +00:00
kcbanner	981f84157c	Value: fix comparison of NaN in compareHeteroAdvanaced Sema: fix equality comparison of signed zeroes and NaN in compareScalar tests: add test coverage for vector float comparisons	2025-03-03 15:35:07 -05:00
Jacob Young	220f80e71d	Dwarf: fix lowering of comptime-only optional pointer `null` values Closes #22974	2025-02-22 22:47:32 -05:00
Mason Remaley	13c6eb0d71	compiler,std: implement ZON support This commit allows using ZON (Zig Object Notation) in a few ways. * `@import` can be used to load ZON at comptime and convert it to a normal Zig value. In this case, `@import` must have a result type. * `std.zon.parse` can be used to parse ZON at runtime, akin to the parsing logic in `std.json`. * `std.zon.stringify` can be used to convert arbitrary data structures to ZON at runtime, again akin to `std.json`.	2025-02-03 09:14:37 +00:00
mlugg	71d16106ad	Sema: `@memcpy` changes * The langspec definition of `@memcpy` has been changed so that the source and destination element types must be in-memory coercible, allowing all such calls to be raw copying operations, not actually applying any coercions. * Implement aliasing check for comptime `@memcpy`; a compile error will now be emitted if the arguments alias. * Implement more efficient comptime `@memcpy` by loading and storing a whole array at once, similar to how `@memset` is implemented.	2025-01-29 06:35:22 +00:00
mlugg	0ec6b2dd88	compiler: simplify generic functions, fix issues with inline calls The original motivation here was to fix regressions caused by #22414. However, while working on this, I ended up discussing a language simplification with Andrew, which changes things a little from how they worked before #22414. The main user-facing change here is that any reference to a prior function parameter, even if potentially comptime-known at the usage site or even not analyzed, now makes a function generic. This applies even if the parameter being referenced is not a `comptime` parameter, since it could still be populated when performing an inline call. This is a breaking language change. The detection of this is done in AstGen; when evaluating a parameter type or return type, we track whether it referenced any prior parameter, and if so, we mark this type as being "generic" in ZIR. This will cause Sema to not evaluate it until the time of instantiation or inline call. A lovely consequence of this from an implementation perspective is that it eliminates the need for most of the "generic poison" system. In particular, `error.GenericPoison` is now completely unnecessary, because we identify generic expressions earlier in the pipeline; this simplifies the compiler and avoids redundant work. This also entirely eliminates the concept of the "generic poison value". The only remnant of this system is the "generic poison type" (`Type.generic_poison` and `InternPool.Index.generic_poison_type`). This type is used in two places: * During semantic analysis, to represent an unknown result type. * When storing generic function types, to represent a generic parameter/return type. It's possible that these use cases should instead use `.none`, but I leave that investigation to a future adventurer. One last thing. Prior to #22414, inline calls were a little inefficient, because they re-evaluated even non-generic parameter types whenever they were called. Changing this behavior is what ultimately led to #22538. Well, because the new logic will mark a type expression as generic if there is any change its resolved type could differ in an inline call, this redundant work is unnecessary! So, this is another way in which the new design reduces redundant work and complexity. Resolves: #22494 Resolves: #22532 Resolves: #22538	2025-01-21 02:41:42 +00:00
mlugg	9804cc8bc6	all: update to `std.builtin.Type.{Pointer,Array,StructField}` field renames	2025-01-16 12:49:58 +00:00
mlugg	b6abe1dbf7	compiler: make it easier to apply breaking changes to `std.builtin` Documentation for this will be on the wiki shortly. Resolves: #21842	2025-01-16 12:49:48 +00:00
mlugg	d00e05f186	all: update to `std.builtin.Type.Pointer.Size` field renames This was done by regex substitution with `sed`. I then manually went over the entire diff and fixed any incorrect changes. This diff also changes a lot of `callconv(.C)` to `callconv(.c)`, since my regex happened to also trigger here. I opted to leave these changes in, since they are a correct migration, even if they're not the one I was trying to do!	2025-01-16 12:46:29 +00:00
Andrew Kelley	8abdebecdc	wasm linker: implement `@tagName` for sparse enums	2025-01-15 15:11:36 -08:00
mlugg	04c9f50aec	compiler: improve "... contains reference to comptime var" errors `Sema.explainWhyValueContainsReferenceToComptimeVar` (concise name!) adds notes to an error explaining how to get from a given `Value` to a pointer to some `comptime var` (or a comptime field). Previously, this error could be very opaque in any case where it wasn't obvious where the comptime var pointer came from; particularly for type captures. Now, the error notes explain this to the user.	2025-01-11 08:54:47 +00:00
David Rubin	b99dbb6fb5	Value: implement `orderAgainstZeroInner` for errors	2024-12-29 13:36:52 -05:00
mlugg	3afda4322c	compiler: analyze type and value of global declaration separately This commit separates semantic analysis of the annotated type vs value of a global declaration, therefore allowing recursive and mutually recursive values to be declared. Every `Nav` which undergoes analysis now has two corresponding `AnalUnit`s: `.{ .nav_val = n }` and `.{ .nav_ty = n }`. The `nav_val` unit is responsible for fully resolving the `Nav`: determining its value, linksection, addrspace, etc. The `nav_ty` unit, on the other hand, resolves only the information necessary to construct a pointer to the `Nav`: its type, addrspace, etc. (It does also analyze its linksection, but that could be moved to `nav_val` I think; it doesn't make any difference). Analyzing a `nav_ty` for a declaration with no type annotation will just mark a dependency on the `nav_val`, analyze it, and finish. Conversely, analyzing a `nav_val` for a declaration with a type annotation will first mark a dependency on the `nav_ty` and analyze it, using this as the result type when evaluating the value body. The `nav_val` and `nav_ty` units always have references to one another: so, if a `Nav`'s type is referenced, its value implicitly is too, and vice versa. However, these dependencies are trivial, so, to save memory, are only known implicitly by logic in `resolveReferences`. In general, analyzing ZIR `decl_val` will only analyze `nav_ty` of the corresponding `Nav`. There are two exceptions to this. If the declaration is an `extern` declaration, then we immediately ensure the `Nav` value is resolved (which doesn't actually require any more analysis, since such a declaration has no value body anyway). Additionally, if the resolved type has type tag `.@"fn"`, we again immediately resolve the `Nav` value. The latter restriction is in place for two reasons: * Functions are special, in that their externs are allowed to trivially alias; i.e. with a declaration `extern fn foo(...)`, you can write `const bar = foo;`. This is not allowed for non-function externs, and it means that function types are the only place where it is possible for a declaration `Nav` to have a `.@"extern"` value without actually being declared `extern`. We need to identify this situation immediately so that the `decl_ref` can create a pointer to the real extern `Nav`, not this alias. * In certain situations, such as taking a pointer to a `Nav`, Sema needs to queue analysis of a runtime function if the value is a function. To do this, the function value needs to be known, so we need to resolve the value immediately upon `&foo` where `foo` is a function. This restriction is simple to codify into the eventual language specification, and doesn't limit the utility of this feature in practice. A consequence of this commit is that codegen and linking logic needs to be more careful when looking at `Nav`s. In general: * When `updateNav` or `updateFunc` is called, it is safe to assume that the `Nav` being updated (the owner `Nav` for `updateFunc`) is fully resolved. * Any `Nav` whose value is/will be an `@"extern"` or a function is fully resolved; see `Nav.getExtern` for a helper for a common case here. * Any other `Nav` may only have its type resolved. This didn't seem to be too tricky to satisfy in any of the existing codegen/linker backends. Resolves: #131	2024-12-24 02:18:41 +00:00
mlugg	242bb44695	compiler: move `RuntimeIndex` to `Sema` Just a small refactor.	2024-12-18 20:34:10 +00:00
mlugg	d11bbde5f9	compiler: remove anonymous struct types, unify all tuples This commit reworks how anonymous struct literals and tuples work. Previously, an untyped anonymous struct literal (e.g. `const x = .{ .a = 123 }`) was given an "anonymous struct type", which is a special kind of struct which coerces using structural equivalence. This mechanism was a holdover from before we used RLS / result types as the primary mechanism of type inference. This commit changes the language so that the type assigned here is a "normal" struct type. It uses a form of equivalence based on the AST node and the type's structure, much like a reified (`@Type`) type. Additionally, tuples have been simplified. The distinction between "simple" and "complex" tuple types is eliminated. All tuples, even those explicitly declared using `struct { ... }` syntax, use structural equivalence, and do not undergo staged type resolution. Tuples are very restricted: they cannot have non-`auto` layouts, cannot have aligned fields, and cannot have default values with the exception of `comptime` fields. Tuples currently do not have optimized layout, but this can be changed in the future. This change simplifies the language, and fixes some problematic coercions through pointers which led to unintuitive behavior. Resolves: #16865	2024-10-31 20:42:53 +00:00
Andrew Kelley	78f643c46d	Merge pull request #21758 from kcbanner/dll_storage_class Add `is_dll_import` to @extern, to support `__declspec(dllimport)` with the MSVC ABI	2024-10-23 15:35:54 -07:00
kcbanner	a4690ecb1f	Cause a compilation error to occur if using @extern with is_dll_import in a comptime scope. Add a note about thread local / dll import being the cause.	2024-10-22 12:41:35 -04:00
mlugg	0b786059b5	compiler: avoid unreasonable eval branch quotas Using `@FieldType` (#21702).	2024-10-19 19:21:17 +01:00
mlugg	51706af908	compiler: introduce new `CallingConvention` This commit begins implementing accepted proposal #21209 by making `std.builtin.CallingConvention` a tagged union. The stage1 dance here is a little convoluted. This commit introduces the new type as `NewCallingConvention`, keeping the old `CallingConvention` around. The compiler uses `std.builtin.NewCallingConvention` exclusively, but when fetching the type from `std` when running the compiler (e.g. with `getBuiltinType`), the name `CallingConvention` is used. This allows a prior build of Zig to be used to build this commit. The next commit will update `zig1.wasm`, and then the compiler and standard library can be updated to completely replace `CallingConvention` with `NewCallingConvention`. The second half of #21209 is to remove `@setAlignStack`, which will be implemented in another commit after updating `zig1.wasm`.	2024-10-19 19:08:59 +01:00
Andrew Kelley	4f8d244e7e	remove formatted panics implements #17969	2024-09-26 12:35:14 -07:00
mlugg	4650e5b9fc	Sema: clean up cmpNumeric There is one minor language change here, which is that comparisons of the form `comptime_inf < runtime_f32` have their results comptime-known. This is consistent with comparisons against comptime NaN for instance, which are always comptime known. A corresponding behavior test is added. This fixes a bug with int comparison elision which my previous commit somehow triggered. `Sema.compareIntsOnlyPossibleResult` is much cleaner now!	2024-09-17 11:00:38 +01:00
mlugg	5e12ca9fe3	compiler: implement labeled switch/continue	2024-09-01 18:30:31 +01:00
mlugg	0fe3fd01dd	std: update `std.builtin.Type` fields to follow naming conventions The compiler actually doesn't need any functional changes for this: Sema does reification based on the tag indices of `std.builtin.Type` already! So, no zig1.wasm update is necessary. This change is necessary to disallow name clashes between fields and decls on a type, which is a prerequisite of #9938.	2024-08-28 08:39:59 +01:00
Jacob Young	8c3f6c72c0	Dwarf: fix and test string format	2024-08-27 02:09:59 -04:00
David Rubin	f777b29832	fix up merge conflicts with master	2024-08-25 22:43:57 -07:00
David Rubin	8d2b495b2a	value: remove deprecated `Module` usages	2024-08-25 15:16:46 -07:00
David Rubin	80cd53d3bb	sema: clean-up `{union,struct}FieldAlignment` and friends My main gripes with this design were that it was incorrectly namespaced, the naming was inconsistent and a bit wrong (`fooAlign` vs `fooAlignment`). This commit moves all the logic from `PerThread.zig` to use the zcu + tid system that the previous couple commits introduce. I've organized and merged the functions to be a bit more specific to their own purpose. - `fieldAlignment` takes a struct or union type, an index, and a Zcu (or the Sema version which takes a Pt), and gives you the alignment of the field at the index. - `structFieldAlignment` takes the field type itself, and provides the logic to handle special cases, such as externs. A design goal I had in mind was to avoid using the word 'struct' in the function name, when it worked for things that aren't structs, such as unions.	2024-08-25 15:16:46 -07:00
David Rubin	b4bb64ce78	sema: rework type resolution to use Zcu when possible	2024-08-25 15:16:42 -07:00
mlugg	548a087faf	compiler: split Decl into Nav and Cau The type `Zcu.Decl` in the compiler is problematic: over time it has gained many responsibilities. Every source declaration, container type, generic instantiation, and `@extern` has a `Decl`. The functions of these `Decl`s are in some cases entirely disjoint. After careful analysis, I determined that the two main responsibilities of `Decl` are as follows: * A `Decl` acts as the "subject" of semantic analysis at comptime. A single unit of analysis is either a runtime function body, or a `Decl`. It registers incremental dependencies, tracks analysis errors, etc. * A `Decl` acts as a "global variable": a pointer to it is consistent, and it may be lowered to a specific symbol by the codegen backend. This commit eliminates `Decl` and introduces new types to model these responsibilities: `Cau` (Comptime Analysis Unit) and `Nav` (Named Addressable Value). Every source declaration, and every container type requiring resolution (so not including `opaque`), has a `Cau`. For a source declaration, this `Cau` performs the resolution of its value. (When #131 is implemented, it is unsolved whether type and value resolution will share a `Cau` or have two distinct `Cau`s.) For a type, this `Cau` is the context in which type resolution occurs. Every non-`comptime` source declaration, every generic instantiation, and every distinct `extern` has a `Nav`. These are sent to codegen/link: the backends by definition do not care about `Cau`s. This commit has some minor technically-breaking changes surrounding `usingnamespace`. I don't think they'll impact anyone, since the changes are fixes around semantics which were previously inconsistent (the behavior changed depending on hashmap iteration order!). Aside from that, this changeset has no significant user-facing changes. Instead, it is an internal refactor which makes it easier to correctly model the responsibilities of different objects, particularly regarding incremental compilation. The performance impact should be negligible, but I will take measurements before merging this work into `master`. Co-authored-by: Jacob Young <jacobly0@users.noreply.github.com> Co-authored-by: Jakub Konka <kubkon@jakubkonka.com>	2024-08-11 07:29:41 +01:00

1 2

82 Commits