]> git.ipfire.org Git - thirdparty/glibc.git/commit
AArch64: Improve codegen of AdvSIMD expf family
authorJoana Cruz <Joana.Cruz@arm.com>
Tue, 17 Dec 2024 14:50:33 +0000 (14:50 +0000)
committerWilco Dijkstra <wilco.dijkstra@arm.com>
Tue, 17 Dec 2024 15:28:22 +0000 (15:28 +0000)
commitcff9648d0b50d19cdaf685f6767add040d4e1a8e
treea39793fc33b1f96bd1f03f5b966aedb7338a1c66
parent6914774b9d3460876d9ad4482782213ec01a752e
AArch64: Improve codegen of AdvSIMD expf family

Load the polynomial evaluation coefficients into 2 vectors and use lanewise MLAs.
Also use intrinsics instead of native operations.
expf: 3% improvement in throughput microbenchmark on Neoverse V1, exp2f: 5%,
exp10f: 13%, coshf: 14%.

Reviewed-by: Wilco Dijkstra <Wilco.Dijkstra@arm.com>
sysdeps/aarch64/fpu/coshf_advsimd.c
sysdeps/aarch64/fpu/exp10f_advsimd.c
sysdeps/aarch64/fpu/exp2f_advsimd.c
sysdeps/aarch64/fpu/expf_advsimd.c
sysdeps/aarch64/fpu/v_expf_inline.h