]> git.ipfire.org Git - thirdparty/glibc.git/commit
x86: Optimize str{n}casecmp TOLOWER logic in strcmp-sse42.S
authorNoah Goldstein <goldstein.w.n@gmail.com>
Wed, 23 Mar 2022 21:57:38 +0000 (16:57 -0500)
committerNoah Goldstein <goldstein.w.n@gmail.com>
Fri, 25 Mar 2022 16:46:13 +0000 (11:46 -0500)
commitd154758e618ec9324f5d339c46db0aa27e8b1226
tree1ab03f58b477730088bdb512e15b315196403047
parent670b54bc585ea4a94f3b2e9272ba44aa6b730b73
x86: Optimize str{n}casecmp TOLOWER logic in strcmp-sse42.S

Slightly faster method of doing TOLOWER that saves an
instruction.

Also replace the hard coded 5-byte no with .p2align 4. On builds with
CET enabled this misaligned entry to strcasecmp.

geometric_mean(N=40) of all benchmarks New / Original: .920

All string/memory tests pass.
Reviewed-by: H.J. Lu <hjl.tools@gmail.com>
sysdeps/x86_64/multiarch/strcmp-sse42.S