]> git.ipfire.org Git - thirdparty/glibc.git/commit - sysdeps/x86_64/multiarch/memrchr-avx2.S
x86-64: Optimize memrchr with AVX2
authorH.J. Lu <hjl.tools@gmail.com>
Fri, 9 Jun 2017 12:44:30 +0000 (05:44 -0700)
committerH.J. Lu <hjl.tools@gmail.com>
Fri, 9 Jun 2017 12:44:41 +0000 (05:44 -0700)
commit5ac7aa1d7cce8580f8225c33c819991abca102b9
treea03a58010d692384042b6430b288e8197d77f1b4
parent8fe57365bfb5a417d911ab715a5671b3b1d7b155
x86-64: Optimize memrchr with AVX2

Optimize memrchr with AVX2 to search 32 bytes with a single vector
compare instruction.  It is as fast as SSE2 memrchr for small data
sizes and up to 1X faster for large data sizes on Haswell.  Select
AVX2 memrchr on AVX2 machines where vzeroupper is preferred and AVX
unaligned load is fast.

* sysdeps/x86_64/multiarch/Makefile (sysdep_routines): Add
memrchr-sse2 and memrchr-avx2.
* sysdeps/x86_64/multiarch/ifunc-impl-list.c
(__libc_ifunc_impl_list): Add tests for __memrchr_avx2 and
__memrchr_sse2.
* sysdeps/x86_64/multiarch/memrchr-avx2.S: New file.
* sysdeps/x86_64/multiarch/memrchr-sse2.S: Likewise.
* sysdeps/x86_64/multiarch/memrchr.c: Likewise.
ChangeLog
sysdeps/x86_64/multiarch/Makefile
sysdeps/x86_64/multiarch/ifunc-impl-list.c
sysdeps/x86_64/multiarch/memrchr-avx2.S [new file with mode: 0644]
sysdeps/x86_64/multiarch/memrchr-sse2.S [new file with mode: 0644]
sysdeps/x86_64/multiarch/memrchr.c [new file with mode: 0644]