]> git.ipfire.org Git - thirdparty/zlib-ng.git/commit
Small optimization in 256 bit wide chunkset
authorAdam Stylinski <kungfujesus06@gmail.com>
Tue, 23 Dec 2025 23:58:10 +0000 (18:58 -0500)
committerHans Kristian Rosbach <hk-github@circlestorm.org>
Sat, 27 Dec 2025 22:55:09 +0000 (23:55 +0100)
commit67b3edfd01b42cb4bba5c20fd39fbe2ad00fcf22
tree2ca9dd8e258a1938f4f0f90eb2a27cca925732da
parent84b46aada25f882a0bdda59e90867e25a9a407e3
Small optimization in 256 bit wide chunkset

It turns out Intel only parses the bottom 4 bits of the shuffle vector.
This makes it already a sufficient permutation vector and saves us a
small bit of latency.
arch/x86/chunkset_avx2.c
arch/x86/chunkset_avx512.c