From: Konstantinos Margaritis Date: Thu, 23 May 2024 06:38:24 +0000 (+0300) Subject: Speed up truffle with 256b TBL instructions (#290) X-Git-Tag: vectorscan/5.4.12^2~31 X-Git-Url: http://git.ipfire.org/cgi-bin/gitweb.cgi?a=commitdiff_plain;h=0e271ccf9a102d88fac2090938228bd5bd5f95ad;p=thirdparty%2Fvectorscan.git Speed up truffle with 256b TBL instructions (#290) 256b wide SVE vectors allow some simplification of truffle. Up to 40% speedup on graviton3. Going from 12500 MB/s to 17000 MB/s onhe microbenchmark. SVE2 also offer this capability for 128b vector with a speedup around 25% compared to normal SVE Add unit tests and benchmark for this wide variant --- 0e271ccf9a102d88fac2090938228bd5bd5f95ad