]> git.ipfire.org Git - thirdparty/vectorscan.git/commit
Speed up truffle with 256b TBL instructions (#290)
authorKonstantinos Margaritis <markos@users.noreply.github.com>
Thu, 23 May 2024 06:38:24 +0000 (09:38 +0300)
committerGitHub <noreply@github.com>
Thu, 23 May 2024 06:38:24 +0000 (09:38 +0300)
commit0e271ccf9a102d88fac2090938228bd5bd5f95ad
tree3e57d41f80ca2261cf9e4ea40a570fcb6cd69b73
parentb312112e872d5c9c2a70762847c3bd76789dfdcb
parent938c026256e344521a6dea4e4a6c509ebc907a16
Speed up truffle with 256b TBL instructions (#290)

256b wide SVE vectors allow some simplification of truffle. Up to 40%
speedup on graviton3. Going from 12500 MB/s to 17000 MB/s onhe
microbenchmark.
SVE2 also offer this capability for 128b vector with a speedup around
25% compared to normal SVE

Add unit tests and benchmark for this wide variant