zig/rand at 1c7095cb7dfcba3537edf3624a61046c9b772b1f - zig

mirror/zig

mirror of https://github.com/ziglang/zig.git synced 2025-12-06 06:13:07 +00:00

History

std.crypto.chacha: support larger vectors on AVX2 and AVX512 targets (#15809 )

* std.crypto.chacha: support larger vectors on AVX2 and AVX512 targets

Ryzen 7 7700, ChaCha20/8 stream, long outputs:

Generic: 3268 MiB/s
AVX2   : 6023 MiB/s
AVX512 : 8086 MiB/s

Bump the rand.chacha buffer a tiny bit to take advantage of this.
More than 8 blocks doesn't seem to make any measurable difference.

ChaChaPoly also gets a small performance boost from this, albeit
Poly1305 remains the bottleneck.

Generic:  707 MiB/s
AVX2   :  981 MiB/s
AVX512 : 1202 MiB/s

aarch64 appears to generally benefit from 4-way vectorization.

Verified on Apple Silicon, but also on a Cortex A72.

2023-05-22 20:33:35 +02:00

Ascon.zig

Remove Gimli and Xoodoo from the standard library (#14928 )

2023-03-21 04:54:10 +00:00

benchmark.zig

Remove Gimli and Xoodoo from the standard library (#14928 )

2023-03-21 04:54:10 +00:00

ChaCha.zig

std.crypto.chacha: support larger vectors on AVX2 and AVX512 targets (#15809 )

2023-05-22 20:33:35 +02:00

Isaac64.zig

update codebase to use @memset and @memcpy