mirror/zig - zig - Bouvais Git

mirror/zig

mirror of https://github.com/ziglang/zig.git synced 2025-12-06 06:13:07 +00:00

Author	SHA1	Message	Date
Frank Denis	d86685ac96	sha3: define block_length as the rate, not as the state size (#14132 ) In sponge-based constructions, the block size is not the same as the state size. For practical purposes, it's the same as the rate. Size this is a constant for a given type, we don't need to keep a copy of that value in the state itself. Just use the constant directly. This saves some bytes and may even be slightly faster. More importantly: Fixes #14128	2022-12-30 22:15:25 +00:00
Frank Denis	0d83487dd0	hkdf: add prk_length and extractInit() The HKDF extract function uses HMAC under the hood, but requiring applications to directly use HMAC functions reduces clarity and feels like the wrong abstraction. So, in order to get the PRK length, add a `prk_length` constant that applications can use directly. Also add an `extractInit()` function for cases where the keying material has to be provided as multiple chunks.	2022-12-29 17:56:50 -05:00
Veikka Tuominen	622311fb9a	update uses of overflow arithmetic builtins	2022-12-27 15:13:14 +02:00
Frank Denis	c9e3524d0b	HKDF allow expansion up to, and including <hash size> * 255 bytes (#14051 ) Fixes #14050	2022-12-23 21:38:27 +00:00
r00ster91	aac2d6b56f	std.builtin: rename Type.UnionField and Type.StructField's field_type to type	2022-12-17 14:11:33 +01:00
Veikka Tuominen	08b2d491bc	update usages of `@call`	2022-12-13 13:14:20 +02:00
Frank Denis	14416b522e	Revert "std.crypto.aes: use software implementation in comptime context (#13792 )" (#13798 ) This reverts commit d4adf4420071397d993bac629a9da27b33c67ca3. Unfortunately, this is not the right place to check if AES functions are being used at comptime or not.	2022-12-07 03:49:20 +00:00
Frank Denis	d4adf44200	std.crypto.aes: use software implementation in comptime context (#13792 ) Hardware-accelerated AES requires inline assembly code, which cannot work at comptime.	2022-12-06 22:48:19 +00:00
Frank Denis	397881fefb	treshold -> threshold	2022-12-05 19:25:10 -05:00
Frank Denis	4be1bb4aac	std.crypto benchmark: don't use a relative path to import std (#13772 )	2022-12-05 04:44:14 +00:00
Frank Denis	7411be3c9e	std.crypto.edwards25519: add a rejectLowOrder() function (#13668 ) Does what the name says: rejects generators of low-order groups. `clearCofactor()` was previously used to do it, but for e.g. cofactored signature verification, we don't need the result of an actual multiplication. Only check that we didn't end up with a low-order point, which is a faster operation.	2022-11-28 00:34:13 +01:00
Frank Denis	feb806a212	std.crypto.ed25519 incremental signatures: hash the fallback noise (#13643 ) If the noise parameter was null, we didn't use any noise at all. We unconditionally generated random noise (`noise2`) but didn't use it. Spotted by @cryptocode, thanks!	2022-11-24 12:13:37 +01:00
Frank Denis	ea05223b63	std.crypto.auth: add AEGIS MAC (#13607 ) * Update the AEGIS specification URL to the current draft * std.crypto.auth: add AEGIS MAC The Pelican-based authentication function of the AEGIS construction can be used independently from authenticated encryption, as a faster and more secure alternative to GHASH/POLYVAL/Poly1305. We already expose GHASH, POLYVAL and Poly1305 for use outside AES-GCM and ChaChaPoly, so there are no reasons not to expose the MAC from AEGIS as well. Like other 128-bit hash functions, finding a collision only requires ~2^64 attempts or inputs, which may still be acceptable for many practical applications. Benchmark (Apple M1): siphash128-1-3: 3222 MiB/s ghash: 8682 MiB/s aegis-128l mac: 12544 MiB/s Benchmark (Zen 2): siphash128-1-3: 4732 MiB/s ghash: 5563 MiB/s aegis-128l mac: 19270 MiB/s	2022-11-22 18:16:04 +01:00
Frank Denis	c45c6cd492	Add the POLYVAL universal hash function POLYVAL is GHASH's little brother, required by the AES-GCM-SIV construction. It's defined in RFC8452. The irreducible polynomial is a mirror of GHASH's (which doesn't change anything in our implementation that didn't reverse the raw bits to start with). But most importantly, POLYVAL encodes byte strings as little-endian instead of big-endian, which makes it a little bit faster on the vast majority of modern CPUs. So, both share the same code, just with comptime magic to use the correct endianness and only double the key for GHASH.	2022-11-20 18:13:19 -05:00
Frank Denis	4dd061a7ac	ghash: handle the .hi_lo case when no CLMUL acceleration is present, too	2022-11-17 23:54:21 +01:00
Frank Denis	3051e279a5	Reapply "std.crypto.onetimeauth.ghash: faster GHASH on modern CPUs (#13566 )" This reapplies commit 72d3f4b5dc0dda9fd0a048c2391f03604f4b30ac.	2022-11-17 23:52:58 +01:00
Andrew Kelley	72d3f4b5dc	Revert "std.crypto.onetimeauth.ghash: faster GHASH on modern CPUs (#13566 )" This reverts commit 7cfeae1ce7aa9f1b3a219d032c43bc2e694ba63b which is causing std lib tests to fail on wasm32-wasi.	2022-11-17 15:37:37 -07:00
Frank Denis	7cfeae1ce7	std.crypto.onetimeauth.ghash: faster GHASH on modern CPUs (#13566 ) * std.crypto.onetimeauth.ghash: faster GHASH on modern CPUs Carryless multiplication was slow on older Intel CPUs, justifying the need for using Karatsuba multiplication. This is not the case any more; using 4 multiplications to multiply two 128-bit numbers is actually faster than 3 multiplications + shifts and additions. This is also true on aarch64. Keep using Karatsuba only when targeting x86 (granted, this is a bit of a brutal shortcut, we should really list all the CPU models that had a slow clmul instruction). Also remove useless agg_2 treshold and restore the ability to precompute only H and H^2 in ReleaseSmall. Finally, avoid using u256. Using 128-bit registers is actually faster. * Use a switch, add some comments	2022-11-17 13:07:07 +01:00
Frank Denis	7eed028f9a	crypto.bcrypt: fix massive speed regression when using stage2 (#13518 ) state: State -> state: *const State Suggested by @nektro Fixes #13510	2022-11-14 16:37:19 +01:00
Naoki MATSUMOTO	b29057b6ab	std.crypto.ghash: fix uninitialized polynomial use (#13527 ) In the process of 'remaining blocks', the length of processed message can be from 1 to 79. The value of 'n-1' is ranged from 0 to 3. So, st.hx[i] must be initialized at least from st.hx[0] to st.hx[3]	2022-11-14 16:35:08 +01:00
Frank Denis	df7223c7f2	crypto.AesGcm: provision ghash for the final block	2022-11-11 18:04:22 +01:00
Frank Denis	59af6417bb	crypto.ghash: define aggregate tresholds as blocks, not bytes (#13507 ) These constants were read as a block count in initForBlockCount() but at the same time, as a size in update(). The unit could be blocks or bytes, but we should use the same one everywhere. So, use blocks as intended. Fixes #13506	2022-11-10 19:00:00 +01:00
Frank Denis	36e618aef1	crypto.ghash: compatibility with stage1 Defining the selector enum outside the function definition is required for stage1.	2022-11-08 16:59:53 +01:00
Frank Denis	7d48cb1138	std.crypto: make ghash faster, esp. for small messages (#13464 ) * std.crypto: make ghash faster, esp. for small messages Aggregated reduction requires 5 additional multiplications (to precompute the powers of H), in order to save 2 multiplications per batch. So, only use large batches when it's actually interesting to do so. For the last blocks, reuse the precomputations in order to perform a single reduction. Also, even in .ReleaseSmall, allow 2-block aggregation. The speedup is worth it, and the code increase is reasonable. And in .ReleaseFast, bump the upper batch size up to 16. Leverage comptime by the way instead of duplicating code. std/crypto/benchmark.zig on Apple M1: Zig 0.10.0: 2769 MiB/s Before: 6014 MiB/s After: 7334 MiB/s Normalize function names by the way. * Change clmul() to accept the half to be processed This avoids a bunch of truncate() calls. * Add more ghash tests to check all code paths	2022-11-07 21:45:29 +01:00
Frank Denis	32563e6829	crypto.core.aes: process 6 block in parallel instead of 8 on aarch64 (#13473 ) * crypto.core.aes: process 6 block in parallel instead of 8 on aarch64 At least on Apple Silicon, this is slightly faster than 8 blocks. * AES: add parallel blocks for tigerlake, rocketlake, alderlake, zen3	2022-11-07 12:28:37 +01:00
Frank Denis	907f3ef887	crypto.salsa20: make the number of rounds a comptime parameter (#13442 ) ...instead of hard-coding it to 20. - This is consistent with the ChaCha implementation - NaCl and libsodium, that this API is designed to interop with, also support 8 and 12 round variants. The 12 round variant, in particular, provides the same security level as the 20 round variant, but is obviously faster. - scrypt currently uses its own non optimized version of Salsa, just because it use 8 rounds instead of 20. This will help remove code duplication. No behavior nor public API changes. The Salsa20 and XSalsa20 still represent the 20-round variant.	2022-11-06 23:52:41 +01:00
Frank Denis	96793530cd	std.crypto.pwhash.bcrypt: inline the Feistel network function (#13416 ) std/crypto/benchmark.zig results: * Intel i5 before: 3.144 s/ops after: 1.922 s/ops * Apple M1 before: 2.067 s/ops after: 1.373 s/ops	2022-11-03 13:10:08 +01:00
Jacob Young	93d60d0de7	std: avoid vector usage with the C backend Vectors are not yet implemented in the C backend, so no reason to prevent code using the standard library from compiling in the meantime.	2022-11-01 20:38:37 -04:00
Frank Denis	0d192ee9ef	std.crypto.onetimeauth.Ghash: make GHASH 2 - 2.5x faster (#13374 ) Rewrite GHASH to use 128-bit multiplication over non-reversed integers, and up to 8 blocks aggregated reduction. lib/std/crypto/benchmark.zig results: Xeon E5: Before: 1604 MiB/s After: 4005 MiB/s Apple M1: Before: 2769 MiB/s After: 6014 MiB/s This also makes AES-GCM faster by the way.	2022-11-01 13:49:13 -04:00
Frank Denis	ddb9eac05c	ed25519: recommend using the seed to recover a key pair	2022-11-01 07:26:32 +01:00
Frank Denis	9e44710fc4	Ed25519.KeyPair.fromSecretKey() didn't compile after the API changes (#13386 ) Fixes #13378	2022-11-01 07:10:40 +01:00
Cody Tapscott	67fa3262b1	std.crypto: Use `featureSetHas` to gate intrinsics This also fixes a bug where the feature gating was not taking effect at comptime due to https://github.com/ziglang/zig/issues/6768	2022-10-28 17:17:08 -07:00
Cody Tapscott	f9fe548e41	std.crypto: Add `isComptime` guard around intrinsics Comptime code can't execute assembly code, so we need some way to force comptime code to use the generic path. This should be replaced with whatever is implemented for #868, when that day comes. I am seeing that the result for the hash is incorrect in stage1 and crashes stage2, so presumably this never worked correctly. I will follow up on that soon.	2022-10-28 15:21:10 -07:00
Cody Tapscott	4c1f71e866	std.crypto: Optimize SHA-256 intrinsics for AMD x86-64 This gets us most of the way back to the performance I had when I was using the LLVM intrinsics: - Intel Intel(R) Core(TM) i7-1068NG7 CPU @ 2.30GHz: 190.67 MB/s (w/o intrinsics) -> 1285.08 MB/s - AMD EPYC 7763 (VM) @ 2.45 GHz: 240.09 MB/s (w/o intrinsics) -> 1360.78 MB/s - Apple M1: 216.96 MB/s (w/o intrinsics) -> 2133.69 MB/s Minor changes to this source can swing performance from 400 MB/s to 1400 MB/s or... 20 MB/s, depending on how it interacts with the optimizer. I have a sneaking suspicion that despite LLVM inheriting GCC's extremely strict inline assembly semantics, its passes are rather skittish around inline assembly (and almost certainly, its instruction cost models can assume nothing)	2022-10-28 15:21:10 -07:00
Cody Tapscott	ee241c47ee	std.crypto: SHA-256 Properly gate comptime conditional This feature detection must be done at comptime so that we avoid generating invalid ASM for the target.	2022-10-28 15:21:10 -07:00
Cody Tapscott	10edb6d352	crypto.sha2: Use intrinsics for SHA-256 on x86-64 and AArch64 There's probably plenty of room to optimize these further in the future, but for the moment this gives ~3x improvement on Intel x86-64 processors, ~5x on AMD, and ~10x on M1 Macs. These extensions are very new - Most processors prior to 2020 do not support them. AVX-512 is a slightly older alternative that we could use on Intel for a much bigger performance bump, but it's been fused off on Intel's latest hybrid architectures and it relies on computing independent SHA hashes in parallel. In contrast, these SHA intrinsics provide the usual single-threaded, single-stream interface, and should continue working on new processors. AArch64 also has SHA-512 intrinsics that we could take advantage of in the future	2022-10-28 15:21:10 -07:00
Frank Denis	f28e4e03ee	std.sign.ecdsa: add support for incremental signatures (#13332 ) Similar to what was done for EdDSA, allow incremental creation and verification of ECDSA signatures. Doing so for ECDSA is trivial, and can be useful for TLS as well as the future package manager.	2022-10-28 16:25:37 +02:00
Frank Denis	9c0d975a09	Revamp the ed25519 API (#13309 )	2022-10-27 19:07:42 +02:00
Naoki MATSUMOTO	cd4865d88c	std.crypto.sign.ecdsa: accepts unusual parameters like EcdsaP384Sha256 (#13302 ) This commit accepts unusual parameters like EcdsaP384Sha256. Some certifictes(below certs are in /etc/ssl/certs/ca-certificates.crt on Ubuntu 22.04) use EcdsaP384Sha256 to sign itself. - Subject: C=GR, L=Athens, O=Hellenic Academic and Research Institutions Cert. Authority, CN=Hellenic Academic and Research Institutions ECC RootCA 2015 - Subject: C=US, ST=Texas, L=Houston, O=SSL Corporation, CN=SSL.com EV Root Certification Authority ECC - Subject: C=US, ST=Texas, L=Houston, O=SSL Corporation, CN=SSL.com Root Certification Authority ECC In verify(), hash array `h` is allocated to be larger than the scalar.encoded_length. The array is regarded as big-endian. Hash values are filled in the back of the array and the rest bytes in front are filled with zero. In sign(), the hash array is allocated and filled as same as verify(). In deterministicScalar(), hash bytes are insufficient to generate `k` To generate `k` without narrowing its value range, this commit uses algorithm stage h. in "Section 3.2 Generation of k" in RFC6979.	2022-10-26 13:18:06 +02:00
Frank Denis	22b71b1376	crypto/bcrypt: don't reimplement base64, just use a custom alphabet Now that std.base64 supports everything bcrypt needs to encode its parameters, we don't need to include another implementation.	2022-10-25 21:52:03 -07:00
Matheus C. França	b41b35f578	crypto/benchmark - replace testing allocator Fix error: Cannot use testing allocator outside of test block	2022-10-20 14:04:59 +03:00
Andrew Kelley	d3d24874c9	std: remove deprecated API for the upcoming release See #3811	2022-09-16 14:46:53 -04:00
Veikka Tuominen	62ff8871ed	stage2+stage1: remove type parameter from bit builtins Closes #12529 Closes #12511 Closes #6835	2022-08-22 11:19:20 +03:00
Veikka Tuominen	19af4b5cad	std: add workaround for stage2 bug	2022-08-09 16:19:55 +03:00
Frank Denis	fa321a07cd	crypto.sign.ed25519: include a context string in blind key signatures (#12316 ) The next revision of the specification is going to include a context string in the way blinded scalars are computed. See: https://github.com/cfrg/draft-irtf-cfrg-signature-key-blinding/issues/30#issuecomment-1180516152 https://github.com/cfrg/draft-irtf-cfrg-signature-key-blinding/pull/37	2022-08-03 15:25:15 +02:00
InKryption	a0d3a87ce1	std.fmt: require specifier for unwrapping ?T and E!T	2022-07-26 11:25:49 -07:00
r00ster	cff5d9c805	std.mem: add `first` method to `SplitIterator` and `SplitBackwardsIterator`	2022-07-25 22:04:30 +03:00
Andrew Kelley	934573fc5d	Revert "std.fmt: require specifier for unwrapping ?T and E!T." This reverts commit 7cbd586ace46a8e8cebab660ebca3cfc049305d9. This is causing a fail to build from source: ``` ./lib/std/fmt.zig:492:17: error: cannot format optional without a specifier (i.e. {?} or {any}) @compileError("cannot format optional without a specifier (i.e. {?} or {any})"); ^ ./src/link/MachO/Atom.zig:544:26: note: called from here log.debug(" RELA({s}) @ {x} => %{d} in object({d})", .{ ^ ``` I looked at the code to fix it but none of those args are optionals.	2022-07-24 11:50:10 -07:00
InKryption	7cbd586ace	std.fmt: require specifier for unwrapping ?T and E!T. Co-authored-by: Veikka Tuominen <git@vexu.eu>	2022-07-24 12:01:56 +03:00
Frank Denis	6f0807f50f	crypto.sign.ed25519: add support for blind key signatures (#11868 ) Key blinding allows public keys to be augmented with a secret scalar, making multiple signatures from the same signer unlinkable. https://datatracker.ietf.org/doc/draft-dew-cfrg-signature-key-blinding/ This is required by privacy-preserving applications such as Tor onion services and the PrivacyPass protocol.	2022-07-08 13:21:37 +02:00

1 2 3 4 5 ...

336 Commits