Frank Denis 6669885aa2
Faster BLAKE3 implementation (#25574)
This is a rewrite of the BLAKE3 implementation, with vectorization.

On Apple Silicon, the new implementation is about twice as fast as the previous one.

With AVX2, it is more than 4 times faster.

With AVX512, it is more than 7.5x faster than the previous implementation (from 678 MB/s to 5086 MB/s).
2025-10-15 14:03:56 +02:00
..
2025-10-04 02:31:02 +00:00
2025-08-30 00:48:50 -07:00
2025-08-29 17:14:26 -07:00
2025-08-28 18:30:57 -07:00
2025-09-16 23:13:58 +02:00
2025-08-28 18:30:57 -07:00
2025-04-29 22:07:30 +00:00
2024-10-28 14:54:02 +00:00
2025-07-22 09:41:44 -07:00
2025-08-29 17:14:26 -07:00
2025-08-29 17:14:26 -07:00
2025-08-28 18:30:57 -07:00
2025-07-14 00:14:21 -07:00
2025-08-28 18:30:57 -07:00
2025-08-29 17:14:26 -07:00