git.ipfire.org Git - thirdparty/zstd.git/log

]> git.ipfire.org Git - thirdparty/zstd.git/log

projects / thirdparty / zstd.git / log

commit | commitdiff | tree

Yann Collet [Tue, 13 Mar 2018 20:22:07 +0000 (13:22 -0700)]

Merge pull request #1044 from baldurk/remove-utf8-characters

Remove non-ASCII characters in header file comments

commit | commitdiff | tree

Baldur Karlsson [Tue, 13 Mar 2018 20:02:21 +0000 (20:02 +0000)]

Remove non-ASCII characters in header file comments

* Replaced a non-breaking space and an en dash with a plain space and
a hyphen.
* This means the files are simple ASCII and less likely to run into
codepage issues.

commit | commitdiff | tree

Yann Collet [Tue, 13 Mar 2018 01:28:34 +0000 (18:28 -0700)]

Merge pull request #1042 from JesseTG/jtg/t0-printout

Made -H's printout specify the semantics of -T0

commit | commitdiff | tree

Yann Collet [Tue, 13 Mar 2018 01:21:46 +0000 (18:21 -0700)]

Merge pull request #1039 from facebook/zstd_decompress

Removed zstd_decompress_impl.h

commit | commitdiff | tree

Jesse Talavera-Greenberg [Tue, 13 Mar 2018 00:43:32 +0000 (20:43 -0400)]

Made -H's printout specify the semantics of -T0

commit | commitdiff | tree

Yann Collet [Sun, 11 Mar 2018 10:06:52 +0000 (03:06 -0700)]

minor compression level adaptation

level 12 compresses slightly more and faster
due to better btlazy2 mode

commit | commitdiff | tree

Yann Collet [Sat, 10 Mar 2018 18:38:30 +0000 (10:38 -0800)]

Merge pull request #1038 from HaydnTrigg/dev

Visual Studio 2017 build scripts

commit | commitdiff | tree

Haydn Trigg [Sat, 10 Mar 2018 13:45:31 +0000 (00:15 +1030)]

Visual Studio 2017 build scripts

commit | commitdiff | tree

Yann Collet [Sat, 10 Mar 2018 03:48:06 +0000 (19:48 -0800)]

re-integrate ZSTD_decompressSequencesLong() into zstd_decompress.c

removed zstd_decompress_impl.h

commit | commitdiff | tree

Yann Collet [Sat, 10 Mar 2018 03:35:57 +0000 (19:35 -0800)]

incorporated ZSTD_decompressSequences() into zstd_decompress()

commit | commitdiff | tree

Yann Collet [Sat, 10 Mar 2018 02:16:10 +0000 (18:16 -0800)]

incorporated ZSTD_initFseState() inside zstd_decompress.c

commit | commitdiff | tree

Yann Collet [Sat, 10 Mar 2018 02:11:14 +0000 (18:11 -0800)]

incorporate ZSTD_decodeSequenceLong() within zstd_decompress.c

commit | commitdiff | tree

Yann Collet [Sat, 10 Mar 2018 02:03:25 +0000 (18:03 -0800)]

restored ZSTD_decodeSequence() into zstd_decompress.c

commit | commitdiff | tree

Yann Collet [Sat, 10 Mar 2018 00:22:34 +0000 (16:22 -0800)]

updated NEWS in preparation for v1.3.4

commit | commitdiff | tree

Yann Collet [Fri, 9 Mar 2018 19:53:24 +0000 (11:53 -0800)]

Merge branch 'dev' of github.com:facebook/zstd into dev

commit | commitdiff | tree

Yann Collet [Fri, 9 Mar 2018 19:52:14 +0000 (11:52 -0800)]

gen_html: changed CFLAGS for CXXFLAGS

since it's associated with $(CXX)

commit | commitdiff | tree

Yann Collet [Wed, 7 Mar 2018 20:14:05 +0000 (12:14 -0800)]

Merge pull request #1036 from terrelln/thread-void

[threading] Cast unused arguments to void

commit | commitdiff | tree

Yann Collet [Wed, 7 Mar 2018 18:26:35 +0000 (10:26 -0800)]

Merge pull request #1034 from facebook/longOffsetMode

Dynamic selection of long offset mode

commit | commitdiff | tree

Nick Terrell [Thu, 1 Mar 2018 04:57:30 +0000 (20:57 -0800)]

[threading] Cast unused arguments to void

commit | commitdiff | tree

Yann Collet [Wed, 7 Mar 2018 02:15:26 +0000 (18:15 -0800)]

improved comments

following @terrelln suggestions

commit | commitdiff | tree

Yann Collet [Wed, 7 Mar 2018 01:55:23 +0000 (17:55 -0800)]

Merge pull request #1033 from facebook/benchDecode

fix benchmark issue when measuring decoding speed only

commit | commitdiff | tree

Yann Collet [Tue, 6 Mar 2018 09:50:19 +0000 (01:50 -0800)]

fixed 0-seq blocks in block-decompression mode

commit | commitdiff | tree

Yann Collet [Tue, 6 Mar 2018 00:41:08 +0000 (16:41 -0800)]

long offset mode : new default threshold for 32-bit

commit | commitdiff | tree

Yann Collet [Tue, 6 Mar 2018 00:16:49 +0000 (16:16 -0800)]

long offset mode : new default threshold for 64-bits mode

commit | commitdiff | tree

Yann Collet [Mon, 5 Mar 2018 23:12:10 +0000 (15:12 -0800)]

fixed counting long distance weights

commit | commitdiff | tree

Yann Collet [Mon, 5 Mar 2018 22:09:00 +0000 (14:09 -0800)]

Merge branch 'benchDecode' into longOffsetMode

commit | commitdiff | tree

Yann Collet [Mon, 5 Mar 2018 21:50:07 +0000 (13:50 -0800)]

fix benchmark issue when measuring only decoding speed

zstd bench module can focus on decompression speed _only_.
This is useful when trying to measure performance
on large input data compressed using a high level
as compression time becomes problematic (too long).

This mode is triggered by command : zstd -b -d

Problem was : in such a mode,
measured decoding speed was > 10% slower
than in nominal mode (compression + decompression),
making decompression benchmark mode much less useful.

This patch fixes the issue.
It's not completely clear why, but
moving the `memcpy()` operation sooner in the pipeline fixed it.

I can still measure some difference, but it is in the < 2% range,
so it's much more tolerable.

also : it doesn't matter anymore in which order are selected
commands `-b` and `-d`.
The combination always triggers bench_decodeOnly mode.

commit | commitdiff | tree

Yann Collet [Mon, 5 Mar 2018 21:10:10 +0000 (13:10 -0800)]

Merge branch 'dev' into longOffsetMode

commit | commitdiff | tree

Yann Collet [Mon, 5 Mar 2018 21:08:59 +0000 (13:08 -0800)]

re-inserted offset evaluation

commit | commitdiff | tree

Yann Collet [Mon, 5 Mar 2018 21:03:24 +0000 (13:03 -0800)]

Merge pull request #1032 from facebook/bmi2

Enable DYNAMIC_BMI2 for clang

commit | commitdiff | tree

Yann Collet [Mon, 5 Mar 2018 19:59:54 +0000 (11:59 -0800)]

Merge branch 'dev' into longOffsetMode

commit | commitdiff | tree

Yann Collet [Mon, 5 Mar 2018 19:49:48 +0000 (11:49 -0800)]

Merge pull request #1029 from cemeyer/dev

FIO_addFInfo: Fully initialize output 'total' struct

commit | commitdiff | tree

Yann Collet [Mon, 5 Mar 2018 00:05:59 +0000 (16:05 -0800)]

DYNAMIC_BMI2 enabled for clang

clang only claims compatibility with gcc 4.2.
Consequently, recent patch which reserved DYNAMIC_BMI2 for gcc >= 4.8
also disabled it for clang.

fix : __clang__ is now enough to enable DYNAMIC_BMI2
(associated with other existing conditions : x64/x64, !bmi2)

commit | commitdiff | tree

Yann Collet [Fri, 2 Mar 2018 01:52:15 +0000 (17:52 -0800)]

Merge pull request #1031 from facebook/inline48

force_inline HUF_decodeSymbol*()

commit | commitdiff | tree

Yann Collet [Thu, 1 Mar 2018 23:02:18 +0000 (15:02 -0800)]

limit DYNAMIC_BMI2 to gcc >= 4.8

attribute bmi2 not supported by gcc 4.4

commit | commitdiff | tree

Yann Collet [Thu, 1 Mar 2018 19:28:42 +0000 (11:28 -0800)]

force inlining of HUF_decodeSymbol*() functions

which was not done properly by gcc 4.8
resulting in major performance difference.

ex :
zstd -b1 silesia.tar
before : dec 680 MB/s
after : dec 710 MB/s (without bmi2)
after : dec 770 MB/s (with DYNAMIC_BMI2)

commit | commitdiff | tree

Conrad Meyer [Wed, 28 Feb 2018 22:16:30 +0000 (14:16 -0800)]

FIO_addFInfo: Fully initialize output 'total' struct

Silence a Coverity warning about 'windowSize' being uninitialized.
(Yes, nothing that calls this routine actually uses the windowSize
value. Still, appeasing Coverity is pretty harmless in this case.)

commit | commitdiff | tree

Yann Collet [Wed, 28 Feb 2018 01:37:05 +0000 (17:37 -0800)]

update doc/README.md on provided tools to test 3rd party implementations

commit | commitdiff | tree

Yann Collet [Wed, 28 Feb 2018 01:09:10 +0000 (17:09 -0800)]

Merge pull request #1026 from terrelln/lrm-window

LDM manages its own window round buffer

commit | commitdiff | tree

Nick Terrell [Sat, 24 Feb 2018 02:17:44 +0000 (18:17 -0800)]

LDM manages its own window round buffer

commit | commitdiff | tree

Yann Collet [Tue, 27 Feb 2018 17:57:01 +0000 (09:57 -0800)]

Merge pull request #1025 from facebook/huf

Huf

commit | commitdiff | tree

Yann Collet [Tue, 27 Feb 2018 02:27:51 +0000 (18:27 -0800)]

fixed wrong size test

commit | commitdiff | tree

Yann Collet [Tue, 27 Feb 2018 01:18:33 +0000 (17:18 -0800)]

Merge pull request #1024 from terrelln/window-split

Split the window state into substructure

commit | commitdiff | tree

Yann Collet [Mon, 26 Feb 2018 23:11:50 +0000 (15:11 -0800)]

added error code workSpace_tooSmall

commit | commitdiff | tree

Yann Collet [Mon, 26 Feb 2018 22:52:23 +0000 (14:52 -0800)]

minor cleaning of huff0

Update code documentation, and properly names a few "magic constants".
Also, HUF_compress_internal() gets a cleaner way
to determine size of tables inside workspace.

commit | commitdiff | tree

Nick Terrell [Mon, 26 Feb 2018 21:30:24 +0000 (13:30 -0800)]

Reduce ZSTD_CHAINLOG_MAX to 29 in 32-bit mode

commit | commitdiff | tree

Nick Terrell [Sat, 24 Feb 2018 00:48:18 +0000 (16:48 -0800)]

Split the window state into substructure

commit | commitdiff | tree

Yann Collet [Sat, 24 Feb 2018 01:36:51 +0000 (17:36 -0800)]

Merge pull request #1021 from terrelln/lrm-split

Split block compresser out of long range matcher

commit | commitdiff | tree

Yann Collet [Thu, 22 Feb 2018 23:44:26 +0000 (15:44 -0800)]

minor nit from Mac XCode

commit | commitdiff | tree

Nick Terrell [Thu, 22 Feb 2018 03:30:05 +0000 (19:30 -0800)]

Remove long range matcher immediate repcode check

The compression ratio gets about 0.01% worse on the files I tested, but the
code is much simpler.

commit | commitdiff | tree

Nick Terrell [Wed, 21 Feb 2018 03:34:43 +0000 (19:34 -0800)]

Split block compresser out of long range matcher

* `ZSTD_ldm_generateSequences()` generates the LDM sequences and
  stores them in a table. It should work with any chunk size, but
  is currently only called one block at a time.
* `ZSTD_ldm_blockCompress()` emits the pre-defined sequences, and
  instead of encoding the literals directly, it passes them to a
  secondary block compressor. The code to handle chunk sizes greater
  than the block size is currently commented out, since it is unused.
  The next PR will uncomment exercise this code.
* During optimal parsing, ensure LDM `minMatchLength` is at least
  `targetLength`. Also don't emit repcode matches in the LDM block
  compressor. Enabling the LDM with the optimal parser now actually improves
  the compression ratio.
* The compression ratio is very similar to before. It is very slightly
  different, because the repcode handling is slightly different. If I remove
  immediate repcode checking in both branches the compressed size is exactly
  the same.
* The speed looks to be the same or better than before.

Up Next (in a separate PR)
--------------------------

Allow sequence generation to happen prior to compression, and produce more
than a block worth of sequences. Expose some API for zstdmt to consume.
This will test out some currently untested code in
`ZSTD_ldm_blockCompress()`.

commit | commitdiff | tree

Yann Collet [Thu, 22 Feb 2018 22:30:43 +0000 (14:30 -0800)]

Merge pull request #1022 from facebook/bmi2IntoC

Implemented BMI2 functions directly within huf_decompress.c

commit | commitdiff | tree

Yann Collet [Thu, 22 Feb 2018 07:52:45 +0000 (23:52 -0800)]

Implemented BMI2 functions directly within huf_decompress.c

This makes it easier to edit for maintenance and evolutions
(I plan to experiment modifications in huffman decompression functions).

The methology followed seems broadly applicable to other BMI2 modules.

Performance was tracked rigorously at each step,
there is no noticeable loss (nor win) of performance compared to `#include` version.

Note however that 4X decoder variants tend to be extremely sensitive to code alignment.
This source code resulted in pretty good performance for gcc 7.2 and 7.3,
but future changes (even in other parts of the code) might trigger the issue again.

commit | commitdiff | tree

Yann Collet [Wed, 21 Feb 2018 22:51:39 +0000 (14:51 -0800)]

Merge pull request #1020 from facebook/betterBench

updated fullbench measurement methodology

commit | commitdiff | tree

Yann Collet [Wed, 21 Feb 2018 19:42:23 +0000 (11:42 -0800)]

fullbench includes assert.h

as it is missing for Windows

commit | commitdiff | tree

Yann Collet [Wed, 21 Feb 2018 19:34:49 +0000 (11:34 -0800)]

fixed huf_compress workspace size

commit | commitdiff | tree

Yann Collet [Wed, 21 Feb 2018 15:30:26 +0000 (07:30 -0800)]

update fullbench measurement methodology

to use less calls to time(), like bench.c.

also upgraded accuracy to nanosecond.

commit | commitdiff | tree

Yann Collet [Wed, 21 Feb 2018 13:47:08 +0000 (05:47 -0800)]

Merge pull request #1019 from facebook/betterBench

improve benchmark measurement for small inputs

commit | commitdiff | tree

Yann Collet [Wed, 21 Feb 2018 00:52:28 +0000 (16:52 -0800)]

fixed minor conversion warning

commit | commitdiff | tree

Yann Collet [Tue, 20 Feb 2018 23:34:59 +0000 (15:34 -0800)]

Merge pull request #1017 from terrelln/c-bmi2

[compress] Support BMI2

commit | commitdiff | tree

Yann Collet [Tue, 20 Feb 2018 23:33:56 +0000 (15:33 -0800)]

use TIMELOOP_NANOSEC

as suggested by @terrelln

commit | commitdiff | tree

Yann Collet [Tue, 20 Feb 2018 22:48:09 +0000 (14:48 -0800)]

improve benchmark measurement for small inputs

by invoking time() once per batch, instead of once per compression / decompression.
Batch is dynamically resized so that each round lasts approximately 1 second.

Also : increases time accuracy to nanosecond

commit | commitdiff | tree

Nick Terrell [Tue, 20 Feb 2018 22:12:11 +0000 (14:12 -0800)]

[BMI2] Add comments to the bmi2 variable in the contexts

commit | commitdiff | tree

Yann Collet [Tue, 20 Feb 2018 20:54:49 +0000 (12:54 -0800)]

added clarification comments in zstd_errors.h

answering some points in #1018

commit | commitdiff | tree

Yann Collet [Thu, 15 Feb 2018 19:40:34 +0000 (11:40 -0800)]

Merge pull request #1011 from terrelln/bmi2

[decompress] Support BMI2

commit | commitdiff | tree

Nick Terrell [Thu, 15 Feb 2018 03:20:32 +0000 (19:20 -0800)]

[compress] Support BMI2

commit | commitdiff | tree

Nick Terrell [Sat, 3 Feb 2018 02:03:09 +0000 (18:03 -0800)]

[decompress] Support BMI2

commit | commitdiff | tree

Yann Collet [Tue, 13 Feb 2018 22:56:35 +0000 (14:56 -0800)]

fixed --single-thread

was incorrectly set to -T0 (use as many cores as possible) previously

commit | commitdiff | tree

Yann Collet [Tue, 13 Feb 2018 20:05:54 +0000 (12:05 -0800)]

Merge pull request #1014 from facebook/fasterDec

Faster decoding speed

commit | commitdiff | tree

Yann Collet [Tue, 13 Feb 2018 18:09:01 +0000 (10:09 -0800)]

pretify last unit test output

commit | commitdiff | tree

Yann Collet [Tue, 13 Feb 2018 18:02:25 +0000 (10:02 -0800)]

added code comment on how to generate default tables

as suggested by @terrelln

commit | commitdiff | tree

Yann Collet [Tue, 13 Feb 2018 00:52:15 +0000 (16:52 -0800)]

added SEQSYMBOL_TABLE_SIZE()

as suggested by @terrelln's comment

commit | commitdiff | tree

Yann Collet [Sun, 11 Feb 2018 05:05:48 +0000 (21:05 -0800)]

fixed logo path

commit | commitdiff | tree

Yann Collet [Sat, 10 Feb 2018 22:24:44 +0000 (14:24 -0800)]

Merge branch 'dev' into fasterDec

commit | commitdiff | tree

Yann Collet [Sat, 10 Feb 2018 22:19:54 +0000 (14:19 -0800)]

Merge pull request #1010 from facebook/flexibleLevel

Updatable compression parameters

commit | commitdiff | tree

Yann Collet [Sat, 10 Feb 2018 19:54:49 +0000 (11:54 -0800)]

Merge branch 'dev' into flexibleLevel

commit | commitdiff | tree

Yann Collet [Fri, 9 Feb 2018 23:34:59 +0000 (15:34 -0800)]

fixed gcc warning on a switch code path

commit | commitdiff | tree

Yann Collet [Sat, 10 Feb 2018 00:13:55 +0000 (16:13 -0800)]

Merge pull request #1013 from facebook/fasterDec32

Disable Long Offset mode in 32-bits

commit | commitdiff | tree

Yann Collet [Fri, 9 Feb 2018 23:53:27 +0000 (15:53 -0800)]

specify new command --single-thread

commit | commitdiff | tree

Yann Collet [Fri, 9 Feb 2018 23:15:46 +0000 (15:15 -0800)]

fix : offset table pointer when using default table

commit | commitdiff | tree

Yann Collet [Fri, 9 Feb 2018 21:10:32 +0000 (13:10 -0800)]

fixed strict aliasing issue

tuned threshold

commit | commitdiff | tree

Yann Collet [Fri, 9 Feb 2018 20:33:28 +0000 (12:33 -0800)]

decide long-offset mode based on offcode statistics

threshold vaguely estimated

commit | commitdiff | tree

Yann Collet [Fri, 9 Feb 2018 17:46:08 +0000 (09:46 -0800)]

fixed minor nit as per @terrelln's comments

commit | commitdiff | tree

Yann Collet [Fri, 9 Feb 2018 17:15:05 +0000 (09:15 -0800)]

Merge branch 'dev' into flexibleLevel

commit | commitdiff | tree

Yann Collet [Fri, 9 Feb 2018 17:14:25 +0000 (09:14 -0800)]

re-enabled ZSTD_decompressSequencesLong()

commit | commitdiff | tree

Yann Collet [Fri, 9 Feb 2018 14:01:02 +0000 (06:01 -0800)]

pre-calculated default decoding tables

commit | commitdiff | tree

Yann Collet [Fri, 9 Feb 2018 13:12:13 +0000 (05:12 -0800)]

fixed mono-symbol distribution

commit | commitdiff | tree

Yann Collet [Fri, 9 Feb 2018 12:50:58 +0000 (04:50 -0800)]

fixed default fse distributions

but would be better to pre-calculate tables, for speed

commit | commitdiff | tree

Yann Collet [Fri, 9 Feb 2018 12:25:15 +0000 (04:25 -0800)]

addBits and baseline into FSE decoding table

note : unfinished
- need new default tables
- need modify long mode

commit | commitdiff | tree

Yann Collet [Fri, 9 Feb 2018 10:41:02 +0000 (02:41 -0800)]

Merge branch 'dev' into fasterDec

commit | commitdiff | tree

Yann Collet [Thu, 8 Feb 2018 07:13:19 +0000 (23:13 -0800)]

fixed streaming_memory_usage example

also:
ensure zstd.h is read from ../lib (instead of /usr/include)

commit | commitdiff | tree

Yann Collet [Wed, 7 Feb 2018 22:22:35 +0000 (14:22 -0800)]

Merged ZSTD_preserveUnsortedMark() into ZSTD_reduceIndex()

as it's faster, due to one memory scan instead of two
(confirmed by microbenchmark).

Note : as ZSTD_reduceIndex() is rarely invoked,
it does not translate into a visible gain.
Consider it an exercise in auto-vectorization and micro-benchmarking.

commit | commitdiff | tree

Yann Collet [Mon, 5 Feb 2018 19:46:02 +0000 (11:46 -0800)]

minor : modified ZSTD_preserveUnsortedMark() to be more vectorization friendly

commit | commitdiff | tree

Yann Collet [Sun, 4 Feb 2018 07:54:10 +0000 (23:54 -0800)]

faster decoding in 32-bits mode for long offsets (tentative)

On my laptop:
Before:
./zstd32 -b --zstd=wlog=27 silesia.tar enwik8 -S
3#silesia.tar       : 211984896 ->  66683478 (3.179),  97.6 MB/s , 400.7 MB/s
3#enwik8            : 100000000 ->  35643153 (2.806),  76.5 MB/s , 303.2 MB/s

After:
./zstd32 -b --zstd=wlog=27 silesia.tar enwik8 -S
3#silesia.tar       : 211984896 ->  66683478 (3.179),  97.4 MB/s , 435.0 MB/s
3#enwik8            : 100000000 ->  35643153 (2.806),  76.2 MB/s , 338.1 MB/s

Mileage vary, depending on file, and cpu type.
But a generic rule is : x86 benefits less from "long-offset mode" than x64,
maybe due to register pressure.
On "entropy", long-mode is _never_ a win for x86.
On my laptop though, it may, depending on file and compression level
(enwik8 benefits more from "long-mode" than silesia).

commit | commitdiff | tree

Yann Collet [Sat, 3 Feb 2018 00:31:20 +0000 (16:31 -0800)]

ensure compression parameters are updated when only compression level is changed

commit | commitdiff | tree

Yann Collet [Fri, 2 Feb 2018 23:58:13 +0000 (15:58 -0800)]

zstdmt: applies new parameters on the fly

when invoked from ZSTD_compress_generic()

commit | commitdiff | tree

Yann Collet [Fri, 2 Feb 2018 22:24:56 +0000 (14:24 -0800)]

fileio: create dedicated function to generate zstd frames

like other formats

commit | commitdiff | tree

Yann Collet [Fri, 2 Feb 2018 18:04:55 +0000 (10:04 -0800)]

Merge pull request #1006 from systemcrash/patch-2

Update README.md