mirror/zig - zig - Bouvais Git

mirror/zig

mirror of https://github.com/ziglang/zig.git synced 2025-12-28 17:13:19 +00:00

Author	SHA1	Message	Date
mlugg	1033d71017	Sema,type: unify type query functions The following pairs of functions have been combined using the "advanced" pattern used for other type queries: * `Sema.fnHasRuntimeBits`, `Type.isFnOrHasRuntimeBits` * `Sema.typeRequiresComptime`, `Type.comptimeOnly`	2023-10-10 21:40:23 -07:00
Andrew Kelley	87d09edf2d	Merge pull request #17352 from kcbanner/extern_union_comptime_memory sema: Support reinterpreting extern/packed unions at comptime via field access	2023-10-03 11:20:08 -07:00
Xavier Bouchoux	405705cb76	codegen: fix byte-aligned field offsets in unaligned nested packed structs	2023-10-03 05:34:19 +00:00
Xavier Bouchoux	62d178e91a	codegen: fix field offsets in packed structs * add nested packed struct/union behavior tests * use ptr_info.packed_offset rather than trying to duplicate the logic from Sema.structFieldPtrByIndex() * use the container_ptr_info.packed_offset to account for non-aligned nested structs. * dedup type.packedStructFieldBitOffset() and module.structPackedFieldBitOffset()	2023-10-03 06:39:20 +02:00
kcbanner	d657b6c0e2	sema: support reinterpreting extern/packed unions at comptime via field access My previous change for reading / writing to unions at comptime did not handle union field read/writes correctly in all cases. Previously, if a field was written to a union, it would overwrite the entire value. This is problematic when a field of a larger size is subsequently read, because the value would not be long enough, causing a panic. Additionally, the writing behaviour itself was incorrect. Writing to a field of a packed or extern union should only overwrite the bits corresponding to that field, allowing for memory reintepretation via field writes / reads. I addressed these problems as follows: Add the concept of a "backing type" for extern / packed unions (`Type.unionBackingType`). For extern unions, this is a `u8` array, for packed unions it's an integer matching the `bitSize` of the union. Whenever union memory is read at comptime, it's read as this type. When union memory is written at comptime, the tag may still be known. If so, the memory is written using the tagged type. If the tag is unknown (because this union had previously been read from memory), it's simply written back out as the backing type. I added `write_packed` to the `reinterpret` field of `ComptimePtrMutationKit`. This causes writes of the operand to be packed - which is necessary when writing to a field of a packed union. Without this, writing a value to a `u1` field would overwrite the entire byte it occupied. The final case to address was reading a different (potentially larger) field from a union when it was written with a known tag. To handle this, a new kind of bitcast was introduced (`bitCastUnionFieldVal`) which supports reading a larger field by using a backing buffer that has the unwritten bits set to undefined. The reason to support this (vs always just writing the union as it's backing type), is that no reads to larger fields ever occur at comptime, it would be strictly worse to have spent time writing the full backing type.	2023-10-02 13:15:28 -04:00
antlilja	6a29646a55	Rename `@fabs` to `@abs` and accept integers Replaces the @fabs builtin with a new @abs builtins which accepts floats, signed integers and vectors of said types.	2023-09-27 11:15:53 -07:00
Veikka Tuominen	f4c884617f	Merge pull request #17215 from kcbanner/read_from_memory_union sema: add support for unions in readFromMemory and writeToMemory	2023-09-26 11:16:03 +03:00
kcbanner	e7bf143b36	type: handle the 0-length array case in abiSizeAdvanced This fixes a panic in `unionAbiSize` when a 0-length array of a union is used as a struct field. Because `resolveTypeLayout` does not resolve the `elem_ty` if `arrayLenIncludingSentinel` returns 0 for the array, the child union type is not guaranteed to have a resolved layout at this point. Fixed this case by just returning 0 here.	2023-09-25 05:24:55 -07:00
Andrew Kelley	eb072fa528	Merge pull request #17256 from ziglang/packed-bit-offsets compiler: packed structs cache bit offsets	2023-09-24 19:42:06 -07:00
Andrew Kelley	c08c0fc6ed	revert "compiler: packed structs cache bit offsets" This is mostly a revert of a7088fd9a3edb037f0f51bb402a3c557334634f3. Measurement revealed the commit actually regressed performance.	2023-09-24 14:37:36 -07:00
Andrew Kelley	a7088fd9a3	compiler: packed structs cache bit offsets Instead of linear search every time a packed struct field's bit or byte offset is wanted, they are computed once during resolution of the packed struct's backing int type, and stored in InternPool for O(1) lookup. Closes #17178	2023-09-23 23:06:08 -07:00
mlugg	09a57583a4	compiler: preserve result type information through address-of operator This commit introduces the new `ref_coerced_ty` result type into AstGen. This represents a expression which we want to treat as an lvalue, and the pointer will be coerced to a given type. This change gives known result types to many expressions, in particular struct and array initializations. This allows certain casts to work which previously required explicitly specifying types via `@as`. It also eliminates our dependence on anonymous struct types for expressions of the form `&.{ ... }` - this paves the way for #16865, and also results in less Sema magic happening for such initializations, also leading to potentially better runtime code. As part of these changes, this commit also implements #17194 by disallowing RLS on explicitly-typed struct and array initializations. Apologies for linking these changes - it seemed rather pointless to try and separate them, since they both make big changes to struct and array initializations in AstGen. The rationale for this change can be found in the proposal - in essence, performing RLS whilst maintaining the semantics of the intermediary type is a very difficult problem to solve. This allowed the problematic `coerce_result_ptr` ZIR instruction to be completely eliminated, which in turn also simplified the logic for inferred allocations in Sema - thanks to this, we almost break even on line count! In doing this, the ZIR instructions surrounding these initializations have been restructured - some have been added and removed, and others renamed for clarity (and their semantics changed slightly). In order to optimize ZIR tag count, the `struct_init_anon_ref` and `array_init_anon_ref` instructions have been removed in favour of using `ref` on a standard anonymous value initialization, since these instructions are now virtually never used. Lastly, it's worth noting that this commit introduces a slightly strange source of generic poison types: in the expression `@as(*anyopaque, &x)`, the sub-expression `x` has a generic poison result type, despite no generic code being involved. This turns out to be a logical choice, because we don't know the result type for `x`, and the generic poison type represents precisely this case, providing the semantics we need. Resolves: #16512 Resolves: #17194	2023-09-23 22:01:08 +01:00
kcbanner	4e9f5f25c8	type: resolve packed union type layouts in bitSizeAdvanced Before this change, packed structs containing packed unions could make it to codegen without having their layout resolved.	2023-09-23 13:22:22 -04:00
kcbanner	f2a24b48e1	sema: rework the comptime representation of comptime unions When the tag is not known, it's set to `.none`. In this case, the value is either an array of bytes (for extern unions) or an integer (for packed unions).	2023-09-23 13:05:04 -04:00
kcbanner	2fddd767ba	sema: add support for unions in readFromMemory and writeToMemory	2023-09-23 13:04:56 -04:00
Andrew Kelley	81b5df347a	compiler: fix structFieldName crash for tuples When struct types have no field names, the names are implicitly understood to be strings corresponding to the field indexes in declaration order. It used to be the case that a NullTerminatedString would be stored for each field in this case, however, now, callers must handle the possibility that there are no names stored at all. This commit introduces `legacyStructFieldName`, a function to fake the previous behavior. Probably something better could be done by reworking all the callsites of this function.	2023-09-21 17:29:34 -07:00
mlugg	9965d960b5	type: give empty unions 1-byte alignment Zero-byte alignment is no longer valid for runtime types. I made most of these changes in an earlier commit, but missed this case.	2023-09-21 14:48:41 -07:00
mlugg	1b672e41c5	InternPool,Sema,type,llvm: alignment fixes This changeset fixes the handling of alignment in several places. The new rules are: * `@alignOf(T)` where `T` is a runtime zero-bit type is at least 1, maybe greater. * Zero-bit fields in `extern` structs do force alignment, potentially offsetting following fields. * Zero-bit fields do have addresses within structs which can be observed and are consistent with `@offsetOf`. These are not necessarily all implemented correctly yet (see disabled test), but this commit fixes all regressions compared to master, and makes one new test pass.	2023-09-21 14:48:41 -07:00
Andrew Kelley	5ea3de55c4	Sema: fix dependency loop regression on struct field alignment	2023-09-21 14:48:40 -07:00
Andrew Kelley	baea62a8ad	fix regressions from this branch	2023-09-21 14:48:40 -07:00
Andrew Kelley	accd5701c2	compiler: move struct types into InternPool proper Structs were previously using `SegmentedList` to be given indexes, but were not actually backed by the InternPool arrays. After this, the only remaining uses of `SegmentedList` in the compiler are `Module.Decl` and `Module.Namespace`. Once those last two are migrated to become backed by InternPool arrays as well, we can introduce state serialization via writing these arrays to disk all at once. Unfortunately there are a lot of source code locations that touch the struct type API, so this commit is still work-in-progress. Once I get it compiling and passing the test suite, I can provide some interesting data points such as how it affected the InternPool memory size and performance comparison against master branch. I also couldn't resist migrating over a bunch of alignment API over to use the log2 Alignment type rather than a mismash of u32 and u64 byte units with 0 meaning something implicitly different and special at every location. Turns out you can do all the math you need directly on the log2 representation of alignments.	2023-09-21 14:48:40 -07:00
r00ster91	ee4ced9683	write function types consistently with a space before `fn` keyword Currently, the compiler (like @typeName) writes it `fn(...) Type` but zig fmt writes it `fn (...) Type` (notice the space after `fn`). This inconsistency is now resolved and function types are consistently written the zig fmt way. Before this there were more `fn (...) Type` occurrences than `fn(...) Type` already.	2023-09-19 15:15:05 +03:00
mlugg	6df78c3bc1	Sema: mark pointers to inline functions as comptime-only This is supposed to be the case, similar to how pointers to generic functions are comptime-only (several pieces of logic already assumed this). These types being considered runtime was causing `dbg_var_val` AIR instructions to be wrongly emitted for such values, causing codegen backends to create a runtime reference to the inline function, which (at least on the LLVM backend) triggers an error. Resolves: #38	2023-09-15 21:46:38 -07:00
Andrew Kelley	cb6201715a	InternPool: prevent anon struct UAF bugs with type safety Instead of using actual slices for InternPool.Key.AnonStructType, this commit changes to use Slice types instead, which store a long-lived index rather than a pointer. This is a follow-up to 7ef1eb1c27754cb0349fdc10db1f02ff2dddd99b.	2023-09-12 20:08:56 -04:00
Andrew Kelley	ada0010471	compiler: move unions into InternPool There are a couple concepts here worth understanding: Key.UnionType - This type is available before resolving the union's fields. The enum tag type, number of fields, and field names, field types, and field alignments are not available with this. InternPool.UnionType - This one can be obtained from the above type with `InternPool.loadUnionType` which asserts that the union's enum tag type has been resolved. This one has all the information available. Additionally: * ZIR: Turn an unused bit into `any_aligned_fields` flag to help semantic analysis know whether a union has explicit alignment on any fields (usually not). * Sema: delete `resolveTypeRequiresComptime` which had the same type signature and near-duplicate logic to `typeRequiresComptime`. - Make opaque types not report comptime-only (this was inconsistent between the two implementations of this function). * Implement accepted proposal #12556 which is a breaking change.	2023-08-22 13:54:14 -07:00
Andrew Kelley	7ef1eb1c27	InternPool: safer enum API The key changes in this commit are: ```diff - names: []const NullTerminatedString, + names: NullTerminatedString.Slice, - values: []const Index, + values: Index.Slice, ``` Which eliminates the slices from `InternPool.Key.EnumType` and replaces them with structs that contain `start` and `len` indexes. This makes the lifetime of `EnumType` change from expiring with updates to InternPool, to expiring when the InternPool is garbage-collected, which is currently never. This is gearing up for a larger change I started working on locally which moves union types into InternPool. As a bonus, I fixed some unnecessary instances of `@as`.	2023-08-17 18:16:03 -07:00
Andrew Kelley	b820d5df79	Merge pull request #16747 from jacobly0/llvm-wo-libllvm llvm: enable the backend even when not linked to llvm	2023-08-10 12:02:57 -07:00
mlugg	6917a8c258	AstGen: handle `ty` result location for struct and array init correctly Well, this was a journey! The original issue I was trying to fix is covered by the new behavior test in array.zig: in essence, `ty` and `coerced_ty` result locations were not correctly propagated. While fixing this, I noticed a similar bug in struct inits: the type was propagated to fields fine, but the actual struct init was unnecessarily anonymous, which could lead to unnecessary copies. Note that the behavior test added in struct.zig was already passing - the bug here didn't change any easy-to-test behavior - but I figured I'd add it anyway. This is a little harder than it seems, because the result type may not itself be an array/struct type: it could be an optional / error union wrapper. A new ZIR instruction is introduced to unwrap these. This is also made a little tricky by the fact that it's possible for result types to be unknown at the time of semantic analysis (due to `anytype` parameters), leading to generic poison. In these cases, we must essentially downgrade to an anonymous initialization. Fixing these issues exposed another bug, related to type resolution in Sema. That issue is now tracked by #16603. As a temporary workaround for this bug, a few result locations for builtin function operands have been disabled in AstGen. This is technically a breaking change, but it's very minor: I doubt it'll cause any breakage in the wild.	2023-08-09 19:46:55 +01:00
Jacob Young	66084b6c3f	Sema: remove `validateRunTimeType` This function does not seem to differ in any interesting way from `!typeRequiresComptime`, other than the `is_extern` param which is only used in one place, and some differences did not seem correct anyway. My reasoning for changing opaque types to be comptime-only is that `explainWhyTypeIsComptime` is quite happy to explain why they are. :D	2023-08-09 05:46:44 -04:00
Andrew Kelley	c804abc7f6	Sema: handle adhoc inferred error sets in helper functions There were two missing places. Regressed in the #16318 branch. Found from compiling Bun. Unfortunately we do not have a behavior test reduction for this bug.	2023-07-22 20:21:27 -07:00
Andrew Kelley	3f2a4720b1	compiler: fix branch regressions * getOwnedFunctionIndex no longer checks if the value is actually a function. * The callsites to `intern` that I added want to avoid the `getCoerced` call, so I added `intern2`. * Adding to inferred error sets should not happen if the destination error set is not the inferred error set of the current Sema instance. * adhoc_inferred_error_set_type can be seen by the backend. Treat it like anyerror.	2023-07-18 19:02:06 -07:00
Andrew Kelley	927f6ec8ca	frontend: fix inferred error sets of comptime/inline calls Previously, they shared function index with the owner decl, but that would clobber the data stored for inferred error sets of runtime calls. Now there is an adhoc_inferred_error_set_type which models the problem much more correctly.	2023-07-18 19:02:06 -07:00
Andrew Kelley	d98526f912	Type.print: fix inferred error set crash The index is a function, not an inferred error set.	2023-07-18 19:02:05 -07:00
Andrew Kelley	6d72f971af	InternPool: implement getExternFunc	2023-07-18 19:02:05 -07:00
Andrew Kelley	f3dc53f6b5	compiler: rework inferred error sets * move inferred error sets into InternPool. - they are now represented by pointing directly at the corresponding function body value. * inferred error set working memory is now in Sema and expires after the Sema for the function corresponding to the inferred error set is finished having its body analyzed. * error sets use a InternPool.Index.Slice rather than an actual slice to avoid lifetime issues.	2023-07-18 19:02:05 -07:00
Andrew Kelley	db33ee45b7	rework generic function calls Abridged summary: * Move `Module.Fn` into `InternPool`. * Delete a lot of confusing and problematic `Sema` logic related to generic function calls. This commit removes `Module.Fn` and replaces it with two new `InternPool.Tag` values: * `func_decl` - corresponding to a function declared in the source code. This one contains line/column numbers, zir_body_inst, etc. * `func_instance` - one for each monomorphization of a generic function. Contains a reference to the `func_decl` from whence the instantiation came, along with the `comptime` parameter values (or types in the case of `anytype`) Since `InternPool` provides deduplication on these values, these fields are now deleted from `Module`: * `monomorphed_func_keys` * `monomorphed_funcs` * `align_stack_fns` Instead of these, Sema logic for generic function instantiation now unconditionally evaluates the function prototype expression for every generic callsite. This is technically required in order for type coercions to work. The previous code had some dubious, probably wrong hacks to make things work, such as `hashUncoerced`. I'm not 100% sure how we were able to eliminate that function and still pass all the behavior tests, but I'm pretty sure things were still broken without doing type coercion for every generic function call argument. After the function prototype is evaluated, it produces a deduplicated `func_instance` `InternPool.Index` which can then be used for the generic function call. Some other nice things made by this simplification are the removal of `comptime_args_fn_inst` and `preallocated_new_func` from `Sema`, and the messy logic associated with them. I have not yet been able to measure the perf of this against master branch. On one hand, it reduces memory usage and pointer chasing of the most heavily used `InternPool` Tag - function bodies - but on the other hand, it does evaluate function prototype expressions more than before. We will soon find out.	2023-07-18 19:02:05 -07:00
kcbanner	82a9d5d78d	type: update optionalReprIsPayload to handle inferred error sets	2023-07-10 04:52:23 -04:00
mlugg	f26dda2117	all: migrate code to new cast builtin syntax Most of this migration was performed automatically with `zig fmt`. There were a few exceptions which I had to manually fix: * `@alignCast` and `@addrSpaceCast` cannot be automatically rewritten * `@truncate`'s fixup is incorrect for vectors * Test cases are not formatted, and their error locations change	2023-06-24 16:56:39 -07:00
Jacob Young	6aa88ecc54	Type/Value: garbage collect some methods	2023-06-22 11:45:33 -07:00
Andrew Kelley	12813d5912	Merge pull request #16105 from jacobly0/intern-pool-opt InternPool: various optimizations	2023-06-21 00:07:49 -07:00
Jacob Young	a257e33fff	Type: remove arbitrary restrictions on param and return types Opaque and `noreturn` makes sense since they don't represent real values, but `null` and `undefined` are perfectly normal comptime-only values. Closes #16088	2023-06-20 21:51:01 -07:00
Jacob Young	96cdd51c14	Type: delete legacy allocation functions	2023-06-20 14:02:09 -04:00
Jacob Young	52ec121469	Sema: optimize callers of `indexToKey`	2023-06-20 14:02:09 -04:00
Evan Haas	fb9d6b8bd9	codegen: Set c_char signedness based on the target	2023-06-20 00:26:42 -07:00
Eric Joldasov	50339f595a	all: zig fmt and rename "@XToY" to "@YFromX" Signed-off-by: Eric Joldasov <bratishkaerik@getgoogleoff.me>	2023-06-19 12:34:42 -07:00
Motiejus Jakštys	d41111d7ef	mem: rename alignGeneric to mem.align Anecdote 1: The generic version is way more popular than the non-generic one in Zig codebase: git grep -w alignForward \| wc -l 56 git grep -w alignForwardGeneric \| wc -l 149 git grep -w alignBackward \| wc -l 6 git grep -w alignBackwardGeneric \| wc -l 15 Anecdote 2: In my project (turbonss) that does much arithmetic and alignment I exclusively use the Generic functions. Anecdote 3: we used only the Generic versions in the Macho Man's linker workshop.	2023-06-17 12:49:13 -07:00
r00ster91	2593156068	migration: std.math.{min, min3, max, max3} -> `@min` & `@max`	2023-06-16 13:44:09 -07:00
mlugg	ce88c43a4e	Sema: allow indexing tuple and vector pointers Resolves: #13852 Resolves: #14705	2023-06-14 22:07:25 -07:00
Jacob Young	d37ebfcf23	InternPool: avoid as many slices pointing to `string_bytes` as possible These are frequently invalidated whenever a string is interned, so avoid creating pointers to `string_bytes` wherever possible. This is an attempt to fix random CI failures.	2023-06-11 23:45:09 -07:00
mlugg	54460e39ac	Autodoc: make it work under InternPool	2023-06-11 22:04:15 -07:00

1 2 3 4 5 ...

534 Commits