git.ipfire.org Git - thirdparty/zlib-ng.git/commit

]> git.ipfire.org Git - thirdparty/zlib-ng.git/commit

projects / thirdparty / zlib-ng.git / commit

author	Nathan Moin Vaziri <nathan@nathanm.com>
	Tue, 31 Mar 2026 20:12:33 +0000 (13:12 -0700)
committer	Hans Kristian Rosbach <hk-github@circlestorm.org>
	Mon, 22 Jun 2026 18:03:57 +0000 (20:03 +0200)
commit	9071377c5926189c4ee58a1072b554a202e65ead
tree	b33a1b6cd3badf24b8452a534136e293dc35becb	tree \| snapshot
parent	b1e704fef333371754831d0bac4f7c6a0a2f3400	commit \| diff

Use vaddvq_u32 for adler32 NEON horizontal reduction

Replace interleaved pairwise reduction with vaddvq_u32 to break the
dependency chain between s1 and s2 modulo computations. The original
code merged both accumulators through a shared addp, serializing the
subsequent umull/lsr/msub chains. Independent reductions allow them
to execute in parallel.

On AArch64 this maps to the ADDV instruction. A compatibility shim
in neon_intrins.h emulates this on 32-bit ARM using vadd and vpadd.

arch/arm/adler32_neon.c		diff \| blob \| blame \| history
arch/arm/neon_intrins.h		diff \| blob \| blame \| history

Mirror of https://github.com/zlib-ng/zlib-ng.git

RSS Atom