]> git.ipfire.org Git - thirdparty/glibc.git/commit
Added memset optimized with AVX512 for KNL hardware.
authorAndrew Senkevich <andrew.senkevich@intel.com>
Fri, 18 Dec 2015 23:47:28 +0000 (02:47 +0300)
committerAndrew Senkevich <andrew.senkevich@intel.com>
Fri, 18 Dec 2015 23:47:28 +0000 (02:47 +0300)
commit83d776f979342f923b5c3d2a5b43afab841c6086
tree180682939f88351b00817f2092e24817ddbdf07f
parent794950ed1d29853158d783d57f72260f5665afe5
Added memset optimized with AVX512 for KNL hardware.

It shows improvement up to 28% over AVX2 memset (performance results
attached at <https://sourceware.org/ml/libc-alpha/2015-12/msg00052.html>).

    * sysdeps/x86_64/multiarch/memset-avx512-no-vzeroupper.S: New file.
    * sysdeps/x86_64/multiarch/Makefile (sysdep_routines): Added new file.
    * sysdeps/x86_64/multiarch/ifunc-impl-list.c: Added new tests.
    * sysdeps/x86_64/multiarch/memset.S: Added new IFUNC branch.
    * sysdeps/x86_64/multiarch/memset_chk.S: Likewise.
    * sysdeps/x86/cpu-features.h (bit_Prefer_No_VZEROUPPER,
    index_Prefer_No_VZEROUPPER): New.
    * sysdeps/x86/cpu-features.c (init_cpu_features): Set the
    Prefer_No_VZEROUPPER for Knights Landing.
ChangeLog
sysdeps/x86/cpu-features.c
sysdeps/x86/cpu-features.h
sysdeps/x86_64/multiarch/Makefile
sysdeps/x86_64/multiarch/ifunc-impl-list.c
sysdeps/x86_64/multiarch/memset-avx512-no-vzeroupper.S [new file with mode: 0644]
sysdeps/x86_64/multiarch/memset.S
sysdeps/x86_64/multiarch/memset_chk.S