mirror/zig - zig - Bouvais Git

mirror/zig

mirror of https://github.com/ziglang/zig.git synced 2026-01-05 04:53:17 +00:00

Author	SHA1	Message	Date
Andrew Kelley	c7dc451a2a	stage2: more debuggable panics For now these errors are handled via `@panic` rather than `unreachable`. These are relatively likely bugs to occur at this early stage of development, and handling them as panics lets us ship release builds of the compiler without worrying about undefined behavior. Furthermore, in stage1, `@panic` is implemented to include an error return trace, while `unreachable` is not. In this case, the error return traces are extremely helpful in debugging the compiler.	2021-08-05 23:20:53 -07:00
Andrew Kelley	786e238a7f	AstGen: fix function declarations They were putting their return type expressions into the wrong ZIR block, resulting in a compiler crash.	2021-08-05 23:20:10 -07:00
Andrew Kelley	47f2463b5c	std.HashMap: fix getPtrAdapted. AstGen: fix fn param iteration There was a bug in stage2 regarding iteration of function parameter AST. This resulted in a false negative "unused parameter" compile error, which, when fixed, revealed a bug in the std lib HashMap implementation.	2021-08-05 23:17:29 -07:00
Andrew Kelley	c03a04a589	stage2: return type expressions of generic functions * ZIR encoding for function instructions have a body for the return type. This lets Sema for generic functions do the same thing it does for parameters, handling `error.GenericPoison` in the evaluation of the return type by marking the function as generic. * Sema: fix missing block around the new Decl arena finalization. This led to a memory corruption. * Added some floating point support to the LLVM backend but didn't get far enough to pass any new tests.	2021-08-05 19:19:19 -07:00
Andrew Kelley	e9e3a29946	stage2: implement generic function memoization Module has a new field `monomorphed_funcs` which stores the set of `*Module.Fn` objects which are generic function instantiations. The hash is based on hashes of comptime values of parameters known to be comptime based on an explicit comptime keyword or must-be-comptime type expressions that can be evaluated without performing monomorphization. This allows function calls to be semantically analyzed cheaply for generic functions which are already instantiated. The table is updated with a single `getOrPutAdapted` in the semantic analysis of `call` instructions, by pre-allocating the `Fn` object and passing it to the child `Sema`.	2021-08-05 16:37:21 -07:00
Andrew Kelley	f58cbef165	stage2: std.mem.eql works now * The `indexable_ptr_len` ZIR instruction now uses a `none_or_ref` ResultLoc. This prevents an unnecessary `ref` instruction from being emitted. * Sema: Fix `analyzeCall` using the incorrect ZIR object for the generic function callee. * LLVM backend: `genTypedValue` supports a `Slice` type encoded with the `decl_ref` `Value`.	2021-08-04 23:02:13 -07:00
Andrew Kelley	d4468affb7	stage2 generics improvements: anytype and param type exprs AstGen result locations now have a `coerced_ty` tag which is the same as `ty` except it assumes that Sema will do a coercion, so it does not redundantly add an `as` instruction into the ZIR code. This results in cleaner ZIR and about a 14% reduction of ZIR bytes. param and param_comptime ZIR instructions now have a block body for their type expressions. This allows Sema to skip evaluation of the block in the case that the parameter is comptime-provided. It also allows a new mechanism to function: when evaluating type expressions of generic functions, if it would depend on another parameter, it returns `error.GenericPoison` which bubbles up and then is caught by the param/param_comptime instruction and then handled. This allows parameters to be evaluated independently so that the type info for functions which have comptime or anytype parameters will still have types populated for parameters that do not depend on values of previous parameters (because evaluation of their param blocks will return successfully instead of `error.GenericPoison`). It also makes iteration over the block that contains function parameters slightly more efficient since it now only contains the param instructions. Finally, it fixes the case where a generic function type expression contains a function prototype. Formerly, this situation would cause shared state to clobber each other; now it is in a proper tree structure so that can't happen. This fix also required adding a field to Sema `comptime_args_fn_inst` to make sure that the `comptime_args` field passed into Sema is applied to the correct `func` instruction. Source location for `node_offset_asm_ret_ty` is fixed; it was pointing at the asm output name rather than the return type as intended. Generic function instantiation is fixed, notably with respect to parameter type expressions that depend on previous parameters, and with respect to types which must be always comptime-known. This involves passing all the comptime arguments at a callsite of a generic function, and allowing the generic function semantic analysis to coerce the values to the proper types (since it has access to the evaluated parameter type expressions) and then decide based on the type whether the parameter is runtime known or not. In the case of explicitly marked `comptime` parameters, there is a check at the semantic analysis of the `call` instruction. Semantic analysis of `call` instructions does type coercion on the arguments, which is needed both for generic functions and to make up for using `coerced_ty` result locations (mentioned above). Tasks left in this branch: * Implement the memoization table. * Add test coverage. * Improve error reporting and source locations for compile errors.	2021-08-04 21:11:31 -07:00
Andrew Kelley	382d201781	stage2: basic generic functions are working The general strategy is that Sema will pre-map comptime arguments into the inst_map, and then re-run the block body that contains the `param` and `func` instructions. This re-runs all the parameter type expressions except with comptime values populated. In Sema, param instructions are now handled specially: they detect whether they are comptime-elided or not. If so, they skip putting a value in the inst_map, since it is already pre-populated. If not, then they append to the `fields` field of `Sema` for use with the `func` instruction. So when the block body is re-run, a new function is generated with all the comptime arguments elided, and the new function type has only runtime parameters in it. TODO: give the generated Decls better names than "foo__anon_x". The new function is then added to the work queue to have its body analyzed and a runtime call AIR instruction to the new function is emitted. When the new function gets semantically analyzed, comptime parameters are pre-mapped to the corresponding `comptime_args` values rather than mapped to an `arg` AIR instruction. `comptime_args` is a new field that `Fn` has which is a `TypedValue` for each parameter. This field is non-null for generic function instantiations only. The values are the comptime arguments. For non-comptime parameters, a sentinel value is used. This is because we need to know the information of which parameters are comptime-known. Additionally: * AstGen: align and section expressions are evaluated in the scope that has comptime parameters in it. There are still some TODO items left; see the BRANCH_TODO file.	2021-08-03 22:34:22 -07:00
Andrew Kelley	609b84611d	stage2: rework runtime, comptime, inline function calls * ZIR function instructions encode the index of the block that contains the function instruction. This allows Zig to later scan the block and find the parameter instructions, which is needed for semantically analyzing function bodies. * Runtime function calls insert AIR arg instructions and then inserts Sema inst_map entries mapping the ZIR param instructions to them. * comptime/inline function call inserts Sema inst_map entries mapping the ZIR param instructions to the AIR callsite arguments. With this commit we are back to the tests passing.	2021-08-03 17:29:59 -07:00
Andrew Kelley	1472dc3ddb	stage2: update ZIR for generic functions ZIR encoding for functions is changed in preparation for generic function support. As an example: ```zig const std = @import("std"); const expect = std.testing.expect; test "example" { var x: usize = 0; x += checkSize(i32, 1); x += checkSize(bool, true); try expect(x == 5); } fn checkSize(comptime T: type, x: T) usize { _ = x; return @sizeOf(T); } ``` Previous ZIR for the `checkSize` function: ```zir [165] checkSize line(10) hash(0226f62e189fd0b1c5fca02cf4617562): %55 = block_inline({ %56 = decl_val("T") token_offset:11:35 %57 = as_node(@Ref.type_type, %56) node_offset:11:35 %69 = extended(func([comptime @Ref.type_type, %57], @Ref.usize_type, { %58 = arg("T") token_offset:11:23 %59 = as_node(@Ref.type_type, %58) node_offset:11:35 %60 = arg("x") token_offset:11:32 %61 = dbg_stmt(11, 4) ``` ZIR for the `checkSize` function after this commit: ```zir [157] checkSize line(10) hash(0226f62e189fd0b1c5fca02cf4617562): %55 = block_inline({ %56 = param_comptime("T", @Ref.type_type) token_offset:11:23 %57 = as_node(@Ref.type_type, %56) node_offset:11:35 %58 = param("x", %57) token_offset:11:32 %67 = func(@Ref.usize_type, { %59 = dbg_stmt(11, 4) ``` Noted differences: * Previously the type expression was redundantly repeated. * Previously the parameter names were redundantly stored in the ZIR extra array. * Instead of `arg` ZIR instructions as the first instructions within a function body, they are now outside the function body, in the same block as the `func` instruction. There are variants: - param - param_comptime - param_anytype - param_anytype_comptime * The param instructions additionally encode the type. * Because of the param instructions, the `func` instruction no longer encodes the list of parameter types or the comptime bits. It's implied that Sema will collect the parameters so that when a `func` instruction is encountered, they will be implicitly used to construct the function's type. This is so that we can satisfy all 3 ways of performing semantic analysis on a function: 1. runtime: Sema will insert AIR arg instructions for each parameter, and insert into the Sema inst_map ZIR param => AIR arg. 2. comptime/inline: Sema will insert into the inst_map ZIR param => callsite arguments. 3. generic: Sema will map only the comptime ZIR param instructions to the AIR instructions for the comptime arguments at the callsite, and then re-run Sema for the function's Decl. This will produce a new function which is the monomorphized function. Additionally: * AstGen: Update usage of deprecated `ensureCapacity` to `ensureUnusedCapacity` or `ensureTotalCapacity`. * Introduce `Type.fnInfo` for getting a bunch of data about a function type at once, and use it in `analyzeCall`. This commit starts a branch to implement generic functions in stage2. Test regressions have not been addressed yet.	2021-08-02 21:56:10 -07:00
Andrew Kelley	b465037a65	move some behavior tests to the "passing for stage2" section	2021-08-01 23:27:38 -07:00
Jakub Konka	eba153f88f	Merge pull request #9501 from ziglang/macho-objc-cleanup Add standalone Objective-C enabled on macOS only	2021-08-02 08:01:15 +02:00
Andrew Kelley	dae4c18aa7	stage2: ZIR encodes comptime parameters `func_extended` ZIR instructions now have a one of the unused flags used as a `has_comptime_bits` boolean. When set, it means 1 or more parameters are `comptime`. In this case, there is a u32 per every 32 parameters (usually just 1 u32) with each bit indicating whether the corresponding parameter is `comptime`. Sema uses this information to correctly mark generic functions as generic. There is now a TODO compile error in place in case a generic function call happens. A future commit will do the generic function call implementation.	2021-08-01 22:04:18 -07:00
Andrew Kelley	d5f173d28f	Merge pull request #9496 from Luukdegram/stage2-wasm stage2: wasm - Wrapping, intcast and optionals	2021-08-01 20:33:55 -04:00
Andrew Kelley	ddf14323ea	stage2: implement `@truncate`	2021-08-01 16:13:58 -07:00
Jakub Konka	0ce54a1416	add standalone Objective-C enabled on macOS only	2021-08-01 22:48:39 +02:00
Andrew Kelley	6ae0825e7f	Sema: implement comptime variables Sema now properly handles alloc_inferred and alloc_inferred_mut ZIR instructions inside a comptime execution context. In this case it creates Decl objects and points to them with the new `decl_ref_mut` Value Tag. `storePtr` is updated to mutate such Decl types and values. In this case it destroys the old arena and makes a new one, preventing memory growth during comptime code execution. Additionally: * Fix `storePtr` to emit a compile error for a pointer comptime-known to be undefined. * Fix `storePtr` to emit runtime instructions for all the cases that a pointer is comptime-known but does not support comptime dereferencing, such as `@intToPtr` on a hard-coded address, or an extern function. * Fix `ret_coerce` not coercing inside inline function call context.	2021-08-01 12:36:04 -07:00
Luuk de Gram	6e139d124b	wasm: Resolve feedback (wrapping arbitrary int sizes) - This ensures we honor the user's integer size when performing wrapping operations. - Also, instead of using ensureCapacity, we now use ensureUnusedCapacity.	2021-08-01 21:30:06 +02:00
Jakub Konka	d794f4cd2a	macho: add runaway section id when sorting sections	2021-08-01 18:05:07 +02:00
Meghan	7e52a096db	langref- fix packed struct error code	2021-08-01 12:42:48 +03:00
Meghan	32069d2330	langref- fix use after block error code	2021-08-01 12:42:05 +03:00
Luuk de Gram	a861b7d160	wasm: Test cases for optionals	2021-08-01 11:07:23 +02:00
Luuk de Gram	61de59e121	wasm: Implement optionals This uses the same approach as error unions, meaning it's a `WValue` with its tag set to `multi_value`. The initial index of the multi_value will contain the null-tag, used to check if the value is null or not. The other values will be the payload. To support the `.?` shorthand syntax, we save the result from checking the null-tag into a new local, which can then be loaded later in the block to either hit `unreachable` or set the actual payload value. Currently, it seems `.?` and `orelse unreachable` results in different AIR structure. TODO: Is this expected?	2021-08-01 11:07:23 +02:00
Luuk de Gram	e58976542b	wasm: Test cases for wrap+intcast instructions	2021-08-01 11:07:23 +02:00
Luuk de Gram	5667ab7dcd	wasm: Implement wrapping operands, add opcodes to wasm.zig - Some opcodes have the incorrect value set in std. - Some opcodes were missing and have now been added to std. - Adding wrapping operands for add,sub and mul. - Implement intCast which either extends or shortens the type.	2021-08-01 11:07:23 +02:00
Žiga Željko	5589edf45c	fix help for ast-check command	2021-08-01 03:57:34 -04:00
joachimschmidt557	0d09c6aed8	stage2 ARM: fix stack alignment Acording to the AAPCS32, the stack alignment at public interfaces should be 8, not 4.	2021-08-01 03:57:08 -04:00
Jakub Konka	0ce56f9305	macho: fix Trie and CodeSignature unit tests after the cleanup.	2021-08-01 09:06:56 +02:00
Jakub Konka	58bc713c17	macho: make Trie accept allocator as a param instead of storing it as a member of Trie struct.	2021-08-01 09:06:56 +02:00
Jakub Konka	d19fdf09ae	macho: make CodeSignature accept allocator as param instead storing it within the struct.	2021-08-01 09:06:56 +02:00
Jakub Konka	2e30bf23aa	macho: cleanup extracting objects from archives	2021-08-01 09:06:56 +02:00
Jakub Konka	0b15ba8334	macho: don't allocate Dylib on the heap instead, immediately transfer ownership to MachO struct. Also, revert back to try-ok-fail parsing approach of objects, archives, and dylibs. It seems easier to try and fail than check if the file is of a certain type given that a dylib may be a stub and parsing yaml twice in a row seems very wasteful. Hint for the future: if we optimise yaml/TAPI parsing, this approach may be rethought!	2021-08-01 09:06:56 +02:00
Jakub Konka	f023cdad7c	macho: don't allocate Archives on the heap instead, transfer ownership directly to MachO struct.	2021-08-01 09:06:56 +02:00
Jakub Konka	06396ddd7d	macho: don't allocate Objects on the heap instead, ownership is transferred to MachO. This makes Object management align closer with data-oriented design.	2021-08-01 09:06:56 +02:00
Jakub Konka	e73777333d	macho: don't store allocator in Dylib instance instead pass it in as an arg to a function that requires it.	2021-08-01 09:06:56 +02:00
Jakub Konka	586a19e3db	macho: don't store allocator in Archive instead pass it in functions as an arg when required.	2021-08-01 09:06:56 +02:00
Jakub Konka	c30cc4dbbf	macho: don't store allocator in Object instead, pass it in functions that require it. Also, when parsing relocs, make Object part of the context struct where we pass in additional goodies such as `MachO` or `Allocator`.	2021-08-01 09:06:56 +02:00
Andrew Kelley	1f95c50d9a	codegen: cmp lowering treats bools the same as unsigned int fixes a crash when lowering `a == b` and they are of type bool. I'm not worried about floats; I think we will probably add separate AIR instructions for floats.	2021-07-30 17:48:24 -07:00
Andrew Kelley	6e78c007df	Sema: improved AIR when one operand of bool cmp is known When doing `x == true` or `x == false` it is now lowered as either a no-op or a not, respectively, rather than a cmp instruction. This commit also extracts a zirCmpEq function out from zirCmp, reducing the amount of branching (on is_equality_cmp) in both functions.	2021-07-30 17:40:30 -07:00
Andrew Kelley	507dc1f2e7	stage2: fix hashing and comparison design flaw with Value * `Value.toType` accepts a buffer parameter instead of an allocator parameter and can no longer fail. * Module: remove the unused `mod: Module` parameter from various functions. `Value.compare` now accepts a `Type` parameter which indicates the type of both operands. There is also a `Value.compareHetero` which accepts only Value parameters and supports comparing mixed types. Likewise, `Value.eql` requires a `Type` parameter. * `Value.hash` is removed; instead the hash map context structs now have a `ty: Type` field, and the hash function lives there, where it has access to a Value's Type when it computes a hash. - This allowed the hash function to be greatly simplified and sound in the sense that the same Values, even with different representations, always hash to the same thing. * Sema: Fix source location of zirCmp when an operand is runtime known but needs to be comptime known. * Remove unused target parameter from `Value.floatCast`.	2021-07-30 16:17:59 -07:00
joachimschmidt557	84039a57e4	stage2 codegen: Implement genTypedValue for enums	2021-07-30 17:53:33 -04:00
Koakuma	7aaea20e7e	Add freeAndExit() implementation for Linux/SPARCv9	2021-07-30 13:23:41 -04:00
joachimschmidt557	f6b1fa9e29	stage2 codegen: genTypedValue for error unions and error sets	2021-07-30 13:22:33 -04:00
Andrew Kelley	e5e6ceda6a	Merge pull request #9486 from ziglang/comptime-pointers stage2: more principled approach to comptime pointers and garbage collection of unused anon decls	2021-07-30 01:40:32 -04:00
Andrew Kelley	040c6eaaa0	stage2: garbage collect unused anon decls After this change, the frontend and backend cooperate to keep track of which Decls are actually emitted into the machine code. When any backend sees a `decl_ref` Value, it must mark the corresponding Decl `alive` field to true. This prevents unused comptime data from spilling into the output object files. For example, if you do an `inline for` loop, previously, any intermediate value calculations would have gone into the object file. Now they are garbage collected immediately after the owner Decl has its machine code generated. In the frontend, when it is time to send a Decl to the linker, if it has not been marked "alive" then it is deleted instead. Additional improvements: * Resolve type ABI layouts after successful semantic analysis of a Decl. This is needed so that the backend has access to struct fields. * Sema: fix incorrect logic in resolveMaybeUndefVal. It should return "not comptime known" instead of a compile error for global variables. * `Value.pointerDeref` now returns `null` in the case that the pointer deref cannot happen at compile-time. This is true for global variables, for example. Another example is if a comptime known pointer has a hard coded address value. * Binary arithmetic sets the requireRuntimeBlock source location to the lhs_src or rhs_src as appropriate instead of on the operator node. * Fix LLVM codegen for slice_elem_val which had the wrong logic for when the operand was not a pointer. As noted in the comment in the implementation of deleteUnusedDecl, a future improvement will be to rework the frontend/linker interface to remove the frontend's responsibility of calling allocateDeclIndexes. I discovered some issues with the plan9 linker backend that are related to this, and worked around them for now.	2021-07-29 19:30:37 -07:00
Andrew Kelley	a5c6e51f03	stage2: more principled approach to comptime references * AIR no longer has a `variables` array. Instead of the `varptr` instruction, Sema emits a constant with a `decl_ref`. * AIR no longer has a `ref` instruction. There is no longer any instruction that takes a value and returns a pointer to it. If this is desired, Sema must either create an anynomous Decl and return a constant `decl_ref`, or in the case of a runtime value, emit an `alloc` instruction, `store` the value to it, and then return the `alloc`. * The `ref_val` Value Tag is eliminated. `decl_ref` should be used instead. Also added is `eu_payload_ptr` which points to the payload of an error union, given an error union pointer. In general, Sema should avoid calling `analyzeRef` if it can be helped. For example in the case of field_val and elem_val, there should never be a reason to create a temporary (alloc or decl). Recent previous commits made progress along that front. There is a new abstraction in Sema, which looks like this: var anon_decl = try block.startAnonDecl(); defer anon_decl.deinit(); // here 'anon_decl.arena()` may be used const decl = try anon_decl.finish(ty, val); // decl is typically now used with `decl_ref`. This pattern is used to upgrade `ref_val` usages to `decl_ref` usages. Additional improvements: * Sema: fix source location resolution for calling convention expression. * Sema: properly report "unable to resolve comptime value" for loads of global variables. There is now a set of functions which can be called if the callee wants to obtain the Value even if the tag is `variable` (indicating comptime-known address but runtime-known value). * Sema: `coerce` resolves builtin types before checking equality. * Sema: fix `u1_type` missing from `addType`, making this type have a slightly more efficient representation in AIR. * LLVM backend: fix `genTypedValue` for tags `decl_ref` and `variable` to properly do an LLVMConstBitCast. * Remove unused parameter from `Value.toEnum`. After this commit, some test cases are no longer passing. This is due to the more principled approach to comptime references causing more anonymous decls to get sent to the linker for codegen. However, in all these cases the decls are not actually referenced by the runtime machine code. A future commit in this branch will implement garbage collection of decls so that unused decls do not get sent to the linker for codegen. This will make the tests go back to passing.	2021-07-29 15:59:51 -07:00
Andrew Kelley	192b5d24cb	Merge pull request #8570 from vrischmann/thread-name add Thread.setName and Thread.getName	2021-07-29 13:25:13 -04:00
Vincent Rischmann	df4fe87163	thread: implement setName/getName	2021-07-29 12:02:12 +02:00
Vincent Rischmann	accde7fe2d	windows: add wrappers for LocalFree, SetThreadDescription and GetThreadDescription	2021-07-29 10:57:08 +02:00
Vincent Rischmann	0be1e74902	windows/kernel32: add LocalFree, SetThreadDescription and GetThreadDescription	2021-07-29 10:57:08 +02:00

1 2 3 4 5 ...

14878 Commits