]> git.ipfire.org Git - thirdparty/glibc.git/commit
x86: Optimize str{n}casecmp TOLOWER logic in strcmp-sse42.S
authorNoah Goldstein <goldstein.w.n@gmail.com>
Wed, 23 Mar 2022 21:57:38 +0000 (16:57 -0500)
committerSunil K Pandey <skpgkp2@gmail.com>
Tue, 17 May 2022 03:45:07 +0000 (20:45 -0700)
commitc523a2b1cc524fc689b4e3b04c013da57b9739d5
tree6a9b75c4db54cdf7dbffb19c05755f84ee0b8714
parent82e29f43496bf8a66de5680622ccd6e9e75fbbca
x86: Optimize str{n}casecmp TOLOWER logic in strcmp-sse42.S

Slightly faster method of doing TOLOWER that saves an
instruction.

Also replace the hard coded 5-byte no with .p2align 4. On builds with
CET enabled this misaligned entry to strcasecmp.

geometric_mean(N=40) of all benchmarks New / Original: .920

All string/memory tests pass.
Reviewed-by: H.J. Lu <hjl.tools@gmail.com>
(cherry picked from commit d154758e618ec9324f5d339c46db0aa27e8b1226)
sysdeps/x86_64/multiarch/strcmp-sse42.S