]> git.ipfire.org Git - thirdparty/glibc.git/commit
powerpc: Optimized st{r,p}ncpy for POWER8/PPC64
authorAdhemerval Zanella <azanella@linux.vnet.ibm.com>
Wed, 31 Dec 2014 16:47:41 +0000 (11:47 -0500)
committerAdhemerval Zanella <azanella@linux.vnet.ibm.com>
Wed, 14 Jan 2015 12:58:02 +0000 (07:58 -0500)
commita38f68f12fd03374d599eeb0b6943e50b0ff7348
treee8a235f5f34fcb66ff5ff656b357bbd48dfa3a0e
parent4242356131256e54ca3e96b0c6f2af773b7a69c8
powerpc: Optimized st{r,p}ncpy for POWER8/PPC64

This patch adds an optimized POWER8 st{r,p}ncpy using unaligned accesses.
It shows 10%-80% improvement over the optimized POWER7 one that uses
only aligned accesses, specially on unaligned inputs.

The algorithm first read and check 16 bytes (if inputs do not cross a 4K
page size).  The it realign source to 16-bytes and issue a 16 bytes read
and compare loop to speedup null byte checks for large strings.  Also,
different from POWER7 optimization, the null pad is done inline in the
implementation using possible unaligned accesses, instead of realying on
a memset call.  Special case is added for page cross reads.
ChangeLog
NEWS
sysdeps/powerpc/powerpc64/multiarch/Makefile
sysdeps/powerpc/powerpc64/multiarch/ifunc-impl-list.c
sysdeps/powerpc/powerpc64/multiarch/stpncpy-power8.S [new file with mode: 0644]
sysdeps/powerpc/powerpc64/multiarch/stpncpy.c
sysdeps/powerpc/powerpc64/multiarch/strncpy-power8.S [new file with mode: 0644]
sysdeps/powerpc/powerpc64/multiarch/strncpy.c
sysdeps/powerpc/powerpc64/power8/stpncpy.S [new file with mode: 0644]
sysdeps/powerpc/powerpc64/power8/strncpy.S [new file with mode: 0644]