]> git.ipfire.org Git - thirdparty/git.git/blob - Documentation/git-pack-objects.txt
Merge branch 'ea/blame-use-oideq'
[thirdparty/git.git] / Documentation / git-pack-objects.txt
1 git-pack-objects(1)
2 ===================
3
4 NAME
5 ----
6 git-pack-objects - Create a packed archive of objects
7
8
9 SYNOPSIS
10 --------
11 [verse]
12 'git pack-objects' [-q | --progress | --all-progress] [--all-progress-implied]
13 [--no-reuse-delta] [--delta-base-offset] [--non-empty]
14 [--local] [--incremental] [--window=<n>] [--depth=<n>]
15 [--revs [--unpacked | --all]] [--keep-pack=<pack-name>]
16 [--stdout [--filter=<filter-spec>] | base-name]
17 [--shallow] [--keep-true-parents] [--[no-]sparse] < object-list
18
19
20 DESCRIPTION
21 -----------
22 Reads list of objects from the standard input, and writes either one or
23 more packed archives with the specified base-name to disk, or a packed
24 archive to the standard output.
25
26 A packed archive is an efficient way to transfer a set of objects
27 between two repositories as well as an access efficient archival
28 format. In a packed archive, an object is either stored as a
29 compressed whole or as a difference from some other object.
30 The latter is often called a delta.
31
32 The packed archive format (.pack) is designed to be self-contained
33 so that it can be unpacked without any further information. Therefore,
34 each object that a delta depends upon must be present within the pack.
35
36 A pack index file (.idx) is generated for fast, random access to the
37 objects in the pack. Placing both the index file (.idx) and the packed
38 archive (.pack) in the pack/ subdirectory of $GIT_OBJECT_DIRECTORY (or
39 any of the directories on $GIT_ALTERNATE_OBJECT_DIRECTORIES)
40 enables Git to read from the pack archive.
41
42 The 'git unpack-objects' command can read the packed archive and
43 expand the objects contained in the pack into "one-file
44 one-object" format; this is typically done by the smart-pull
45 commands when a pack is created on-the-fly for efficient network
46 transport by their peers.
47
48
49 OPTIONS
50 -------
51 base-name::
52 Write into pairs of files (.pack and .idx), using
53 <base-name> to determine the name of the created file.
54 When this option is used, the two files in a pair are written in
55 <base-name>-<SHA-1>.{pack,idx} files. <SHA-1> is a hash
56 based on the pack content and is written to the standard
57 output of the command.
58
59 --stdout::
60 Write the pack contents (what would have been written to
61 .pack file) out to the standard output.
62
63 --revs::
64 Read the revision arguments from the standard input, instead of
65 individual object names. The revision arguments are processed
66 the same way as 'git rev-list' with the `--objects` flag
67 uses its `commit` arguments to build the list of objects it
68 outputs. The objects on the resulting list are packed.
69 Besides revisions, `--not` or `--shallow <SHA-1>` lines are
70 also accepted.
71
72 --unpacked::
73 This implies `--revs`. When processing the list of
74 revision arguments read from the standard input, limit
75 the objects packed to those that are not already packed.
76
77 --all::
78 This implies `--revs`. In addition to the list of
79 revision arguments read from the standard input, pretend
80 as if all refs under `refs/` are specified to be
81 included.
82
83 --include-tag::
84 Include unasked-for annotated tags if the object they
85 reference was included in the resulting packfile. This
86 can be useful to send new tags to native Git clients.
87
88 --window=<n>::
89 --depth=<n>::
90 These two options affect how the objects contained in
91 the pack are stored using delta compression. The
92 objects are first internally sorted by type, size and
93 optionally names and compared against the other objects
94 within --window to see if using delta compression saves
95 space. --depth limits the maximum delta depth; making
96 it too deep affects the performance on the unpacker
97 side, because delta data needs to be applied that many
98 times to get to the necessary object.
99 +
100 The default value for --window is 10 and --depth is 50. The maximum
101 depth is 4095.
102
103 --window-memory=<n>::
104 This option provides an additional limit on top of `--window`;
105 the window size will dynamically scale down so as to not take
106 up more than '<n>' bytes in memory. This is useful in
107 repositories with a mix of large and small objects to not run
108 out of memory with a large window, but still be able to take
109 advantage of the large window for the smaller objects. The
110 size can be suffixed with "k", "m", or "g".
111 `--window-memory=0` makes memory usage unlimited. The default
112 is taken from the `pack.windowMemory` configuration variable.
113
114 --max-pack-size=<n>::
115 In unusual scenarios, you may not be able to create files
116 larger than a certain size on your filesystem, and this option
117 can be used to tell the command to split the output packfile
118 into multiple independent packfiles, each not larger than the
119 given size. The size can be suffixed with
120 "k", "m", or "g". The minimum size allowed is limited to 1 MiB.
121 This option
122 prevents the creation of a bitmap index.
123 The default is unlimited, unless the config variable
124 `pack.packSizeLimit` is set.
125
126 --honor-pack-keep::
127 This flag causes an object already in a local pack that
128 has a .keep file to be ignored, even if it would have
129 otherwise been packed.
130
131 --keep-pack=<pack-name>::
132 This flag causes an object already in the given pack to be
133 ignored, even if it would have otherwise been
134 packed. `<pack-name>` is the pack file name without
135 leading directory (e.g. `pack-123.pack`). The option could be
136 specified multiple times to keep multiple packs.
137
138 --incremental::
139 This flag causes an object already in a pack to be ignored
140 even if it would have otherwise been packed.
141
142 --local::
143 This flag causes an object that is borrowed from an alternate
144 object store to be ignored even if it would have otherwise been
145 packed.
146
147 --non-empty::
148 Only create a packed archive if it would contain at
149 least one object.
150
151 --progress::
152 Progress status is reported on the standard error stream
153 by default when it is attached to a terminal, unless -q
154 is specified. This flag forces progress status even if
155 the standard error stream is not directed to a terminal.
156
157 --all-progress::
158 When --stdout is specified then progress report is
159 displayed during the object count and compression phases
160 but inhibited during the write-out phase. The reason is
161 that in some cases the output stream is directly linked
162 to another command which may wish to display progress
163 status of its own as it processes incoming pack data.
164 This flag is like --progress except that it forces progress
165 report for the write-out phase as well even if --stdout is
166 used.
167
168 --all-progress-implied::
169 This is used to imply --all-progress whenever progress display
170 is activated. Unlike --all-progress this flag doesn't actually
171 force any progress display by itself.
172
173 -q::
174 This flag makes the command not to report its progress
175 on the standard error stream.
176
177 --no-reuse-delta::
178 When creating a packed archive in a repository that
179 has existing packs, the command reuses existing deltas.
180 This sometimes results in a slightly suboptimal pack.
181 This flag tells the command not to reuse existing deltas
182 but compute them from scratch.
183
184 --no-reuse-object::
185 This flag tells the command not to reuse existing object data at all,
186 including non deltified object, forcing recompression of everything.
187 This implies --no-reuse-delta. Useful only in the obscure case where
188 wholesale enforcement of a different compression level on the
189 packed data is desired.
190
191 --compression=<n>::
192 Specifies compression level for newly-compressed data in the
193 generated pack. If not specified, pack compression level is
194 determined first by pack.compression, then by core.compression,
195 and defaults to -1, the zlib default, if neither is set.
196 Add --no-reuse-object if you want to force a uniform compression
197 level on all data no matter the source.
198
199 --[no-]sparse::
200 Toggle the "sparse" algorithm to determine which objects to include in
201 the pack, when combined with the "--revs" option. This algorithm
202 only walks trees that appear in paths that introduce new objects.
203 This can have significant performance benefits when computing
204 a pack to send a small change. However, it is possible that extra
205 objects are added to the pack-file if the included commits contain
206 certain types of direct renames. If this option is not included,
207 it defaults to the value of `pack.useSparse`, which is true unless
208 otherwise specified.
209
210 --thin::
211 Create a "thin" pack by omitting the common objects between a
212 sender and a receiver in order to reduce network transfer. This
213 option only makes sense in conjunction with --stdout.
214 +
215 Note: A thin pack violates the packed archive format by omitting
216 required objects and is thus unusable by Git without making it
217 self-contained. Use `git index-pack --fix-thin`
218 (see linkgit:git-index-pack[1]) to restore the self-contained property.
219
220 --shallow::
221 Optimize a pack that will be provided to a client with a shallow
222 repository. This option, combined with --thin, can result in a
223 smaller pack at the cost of speed.
224
225 --delta-base-offset::
226 A packed archive can express the base object of a delta as
227 either a 20-byte object name or as an offset in the
228 stream, but ancient versions of Git don't understand the
229 latter. By default, 'git pack-objects' only uses the
230 former format for better compatibility. This option
231 allows the command to use the latter format for
232 compactness. Depending on the average delta chain
233 length, this option typically shrinks the resulting
234 packfile by 3-5 per-cent.
235 +
236 Note: Porcelain commands such as `git gc` (see linkgit:git-gc[1]),
237 `git repack` (see linkgit:git-repack[1]) pass this option by default
238 in modern Git when they put objects in your repository into pack files.
239 So does `git bundle` (see linkgit:git-bundle[1]) when it creates a bundle.
240
241 --threads=<n>::
242 Specifies the number of threads to spawn when searching for best
243 delta matches. This requires that pack-objects be compiled with
244 pthreads otherwise this option is ignored with a warning.
245 This is meant to reduce packing time on multiprocessor machines.
246 The required amount of memory for the delta search window is
247 however multiplied by the number of threads.
248 Specifying 0 will cause Git to auto-detect the number of CPU's
249 and set the number of threads accordingly.
250
251 --index-version=<version>[,<offset>]::
252 This is intended to be used by the test suite only. It allows
253 to force the version for the generated pack index, and to force
254 64-bit index entries on objects located above the given offset.
255
256 --keep-true-parents::
257 With this option, parents that are hidden by grafts are packed
258 nevertheless.
259
260 --filter=<filter-spec>::
261 Requires `--stdout`. Omits certain objects (usually blobs) from
262 the resulting packfile. See linkgit:git-rev-list[1] for valid
263 `<filter-spec>` forms.
264
265 --no-filter::
266 Turns off any previous `--filter=` argument.
267
268 --missing=<missing-action>::
269 A debug option to help with future "partial clone" development.
270 This option specifies how missing objects are handled.
271 +
272 The form '--missing=error' requests that pack-objects stop with an error if
273 a missing object is encountered. If the repository is a partial clone, an
274 attempt to fetch missing objects will be made before declaring them missing.
275 This is the default action.
276 +
277 The form '--missing=allow-any' will allow object traversal to continue
278 if a missing object is encountered. No fetch of a missing object will occur.
279 Missing objects will silently be omitted from the results.
280 +
281 The form '--missing=allow-promisor' is like 'allow-any', but will only
282 allow object traversal to continue for EXPECTED promisor missing objects.
283 No fetch of a missing object will occur. An unexpected missing object will
284 raise an error.
285
286 --exclude-promisor-objects::
287 Omit objects that are known to be in the promisor remote. (This
288 option has the purpose of operating only on locally created objects,
289 so that when we repack, we still maintain a distinction between
290 locally created objects [without .promisor] and objects from the
291 promisor remote [with .promisor].) This is used with partial clone.
292
293 --keep-unreachable::
294 Objects unreachable from the refs in packs named with
295 --unpacked= option are added to the resulting pack, in
296 addition to the reachable objects that are not in packs marked
297 with *.keep files. This implies `--revs`.
298
299 --pack-loose-unreachable::
300 Pack unreachable loose objects (and their loose counterparts
301 removed). This implies `--revs`.
302
303 --unpack-unreachable::
304 Keep unreachable objects in loose form. This implies `--revs`.
305
306 --delta-islands::
307 Restrict delta matches based on "islands". See DELTA ISLANDS
308 below.
309
310
311 DELTA ISLANDS
312 -------------
313
314 When possible, `pack-objects` tries to reuse existing on-disk deltas to
315 avoid having to search for new ones on the fly. This is an important
316 optimization for serving fetches, because it means the server can avoid
317 inflating most objects at all and just send the bytes directly from
318 disk. This optimization can't work when an object is stored as a delta
319 against a base which the receiver does not have (and which we are not
320 already sending). In that case the server "breaks" the delta and has to
321 find a new one, which has a high CPU cost. Therefore it's important for
322 performance that the set of objects in on-disk delta relationships match
323 what a client would fetch.
324
325 In a normal repository, this tends to work automatically. The objects
326 are mostly reachable from the branches and tags, and that's what clients
327 fetch. Any deltas we find on the server are likely to be between objects
328 the client has or will have.
329
330 But in some repository setups, you may have several related but separate
331 groups of ref tips, with clients tending to fetch those groups
332 independently. For example, imagine that you are hosting several "forks"
333 of a repository in a single shared object store, and letting clients
334 view them as separate repositories through `GIT_NAMESPACE` or separate
335 repos using the alternates mechanism. A naive repack may find that the
336 optimal delta for an object is against a base that is only found in
337 another fork. But when a client fetches, they will not have the base
338 object, and we'll have to find a new delta on the fly.
339
340 A similar situation may exist if you have many refs outside of
341 `refs/heads/` and `refs/tags/` that point to related objects (e.g.,
342 `refs/pull` or `refs/changes` used by some hosting providers). By
343 default, clients fetch only heads and tags, and deltas against objects
344 found only in those other groups cannot be sent as-is.
345
346 Delta islands solve this problem by allowing you to group your refs into
347 distinct "islands". Pack-objects computes which objects are reachable
348 from which islands, and refuses to make a delta from an object `A`
349 against a base which is not present in all of `A`'s islands. This
350 results in slightly larger packs (because we miss some delta
351 opportunities), but guarantees that a fetch of one island will not have
352 to recompute deltas on the fly due to crossing island boundaries.
353
354 When repacking with delta islands the delta window tends to get
355 clogged with candidates that are forbidden by the config. Repacking
356 with a big --window helps (and doesn't take as long as it otherwise
357 might because we can reject some object pairs based on islands before
358 doing any computation on the content).
359
360 Islands are configured via the `pack.island` option, which can be
361 specified multiple times. Each value is a left-anchored regular
362 expressions matching refnames. For example:
363
364 -------------------------------------------
365 [pack]
366 island = refs/heads/
367 island = refs/tags/
368 -------------------------------------------
369
370 puts heads and tags into an island (whose name is the empty string; see
371 below for more on naming). Any refs which do not match those regular
372 expressions (e.g., `refs/pull/123`) is not in any island. Any object
373 which is reachable only from `refs/pull/` (but not heads or tags) is
374 therefore not a candidate to be used as a base for `refs/heads/`.
375
376 Refs are grouped into islands based on their "names", and two regexes
377 that produce the same name are considered to be in the same
378 island. The names are computed from the regexes by concatenating any
379 capture groups from the regex, with a '-' dash in between. (And if
380 there are no capture groups, then the name is the empty string, as in
381 the above example.) This allows you to create arbitrary numbers of
382 islands. Only up to 14 such capture groups are supported though.
383
384 For example, imagine you store the refs for each fork in
385 `refs/virtual/ID`, where `ID` is a numeric identifier. You might then
386 configure:
387
388 -------------------------------------------
389 [pack]
390 island = refs/virtual/([0-9]+)/heads/
391 island = refs/virtual/([0-9]+)/tags/
392 island = refs/virtual/([0-9]+)/(pull)/
393 -------------------------------------------
394
395 That puts the heads and tags for each fork in their own island (named
396 "1234" or similar), and the pull refs for each go into their own
397 "1234-pull".
398
399 Note that we pick a single island for each regex to go into, using "last
400 one wins" ordering (which allows repo-specific config to take precedence
401 over user-wide config, and so forth).
402
403 SEE ALSO
404 --------
405 linkgit:git-rev-list[1]
406 linkgit:git-repack[1]
407 linkgit:git-prune-packed[1]
408
409 GIT
410 ---
411 Part of the linkgit:git[1] suite