]> git.ipfire.org Git - thirdparty/zstd.git/commit
deeper prefetching pipeline for decompressSequencesLong 2614/head
authorYann Collet <yann.collet.73@gmail.com>
Wed, 5 May 2021 17:04:03 +0000 (10:04 -0700)
committerYann Collet <yann.collet.73@gmail.com>
Wed, 5 May 2021 17:04:03 +0000 (10:04 -0700)
commit7ef6d7b36ca34eb4adef6f9780b0953d51643bb7
tree474d6fa1be37f1702cce3110b2a59fa349331178
parent455fd1a067cd1cac7c33ee58b886a3a365a1dd18
deeper prefetching pipeline for decompressSequencesLong

pipeline increased from 4 to 8 slots.
This change substantially improves decompression speed when there are long distance offsets.
example with enwik9 compressed at level 22 :
gcc-9 : 947 -> 1039 MB/s
clang-10: 884 -> 946 MB/s

I also checked the "cold dictionary" scenario,
and found a smaller benefit, around ~2%
(measurements are more noisy for this scenario).
lib/decompress/zstd_decompress_block.c