]> git.ipfire.org Git - thirdparty/git.git/blame - Documentation/gitattributes.txt
The fourth batch for 2.18
[thirdparty/git.git] / Documentation / gitattributes.txt
CommitLineData
88e7fdf2
JH
1gitattributes(5)
2================
3
4NAME
5----
6gitattributes - defining attributes per path
7
8SYNOPSIS
9--------
e5b5c1d2 10$GIT_DIR/info/attributes, .gitattributes
88e7fdf2
JH
11
12
13DESCRIPTION
14-----------
15
16A `gitattributes` file is a simple text file that gives
17`attributes` to pathnames.
18
19Each line in `gitattributes` file is of form:
20
3f74c8e8 21 pattern attr1 attr2 ...
88e7fdf2 22
3f74c8e8 23That is, a pattern followed by an attributes list,
860a74d9
NTND
24separated by whitespaces. Leading and trailing whitespaces are
25ignored. Lines that begin with '#' are ignored. Patterns
26that begin with a double quote are quoted in C style.
27When the pattern matches the path in question, the attributes
28listed on the line are given to the path.
88e7fdf2
JH
29
30Each attribute can be in one of these states for a given path:
31
32Set::
33
34 The path has the attribute with special value "true";
35 this is specified by listing only the name of the
36 attribute in the attribute list.
37
38Unset::
39
40 The path has the attribute with special value "false";
41 this is specified by listing the name of the attribute
42 prefixed with a dash `-` in the attribute list.
43
44Set to a value::
45
46 The path has the attribute with specified string value;
47 this is specified by listing the name of the attribute
48 followed by an equal sign `=` and its value in the
49 attribute list.
50
51Unspecified::
52
3f74c8e8 53 No pattern matches the path, and nothing says if
b9d14ffb
JH
54 the path has or does not have the attribute, the
55 attribute for the path is said to be Unspecified.
88e7fdf2 56
3f74c8e8 57When more than one pattern matches the path, a later line
b9d14ffb 58overrides an earlier line. This overriding is done per
b635ed97
JK
59attribute.
60
61The rules by which the pattern matches paths are the same as in
62`.gitignore` files (see linkgit:gitignore[5]), with a few exceptions:
63
64 - negative patterns are forbidden
65
66 - patterns that match a directory do not recursively match paths
67 inside that directory (so using the trailing-slash `path/` syntax is
68 pointless in an attributes file; use `path/**` instead)
88e7fdf2 69
2de9b711 70When deciding what attributes are assigned to a path, Git
88e7fdf2
JH
71consults `$GIT_DIR/info/attributes` file (which has the highest
72precedence), `.gitattributes` file in the same directory as the
20ff3ec2
JM
73path in question, and its parent directories up to the toplevel of the
74work tree (the further the directory that contains `.gitattributes`
6df42ab9
PO
75is from the path in question, the lower its precedence). Finally
76global and system-wide files are considered (they have the lowest
77precedence).
88e7fdf2 78
40701adb
NTND
79When the `.gitattributes` file is missing from the work tree, the
80path in the index is used as a fall-back. During checkout process,
81`.gitattributes` in the index is used and then the file in the
82working tree is used as a fall-back.
83
90b22907 84If you wish to affect only a single repository (i.e., to assign
6df42ab9
PO
85attributes to files that are particular to
86one user's workflow for that repository), then
90b22907
JK
87attributes should be placed in the `$GIT_DIR/info/attributes` file.
88Attributes which should be version-controlled and distributed to other
89repositories (i.e., attributes of interest to all users) should go into
6df42ab9
PO
90`.gitattributes` files. Attributes that should affect all repositories
91for a single user should be placed in a file specified by the
da0005b8 92`core.attributesFile` configuration option (see linkgit:git-config[1]).
684e40f6
HKNN
93Its default value is $XDG_CONFIG_HOME/git/attributes. If $XDG_CONFIG_HOME
94is either not set or empty, $HOME/.config/git/attributes is used instead.
6df42ab9
PO
95Attributes for all users on a system should be placed in the
96`$(prefix)/etc/gitattributes` file.
90b22907 97
faa4e8ce 98Sometimes you would need to override a setting of an attribute
0922570c 99for a path to `Unspecified` state. This can be done by listing
88e7fdf2
JH
100the name of the attribute prefixed with an exclamation point `!`.
101
102
103EFFECTS
104-------
105
2de9b711 106Certain operations by Git can be influenced by assigning
ae7aa499
JH
107particular attributes to a path. Currently, the following
108operations are attributes-aware.
88e7fdf2
JH
109
110Checking-out and checking-in
111~~~~~~~~~~~~~~~~~~~~~~~~~~~~
112
3fed15f5 113These attributes affect how the contents stored in the
88e7fdf2 114repository are copied to the working tree files when commands
0b444cdb 115such as 'git checkout' and 'git merge' run. They also affect how
2de9b711 116Git stores the contents you prepare in the working tree in the
0b444cdb 117repository upon 'git add' and 'git commit'.
88e7fdf2 118
5ec3e670 119`text`
3fed15f5
JH
120^^^^^^
121
fd6cce9e
EB
122This attribute enables and controls end-of-line normalization. When a
123text file is normalized, its line endings are converted to LF in the
124repository. To control what line ending style is used in the working
125directory, use the `eol` attribute for a single file and the
942e7747 126`core.eol` configuration variable for all text files.
65237284 127Note that `core.autocrlf` overrides `core.eol`
3fed15f5 128
88e7fdf2
JH
129Set::
130
5ec3e670 131 Setting the `text` attribute on a path enables end-of-line
fd6cce9e
EB
132 normalization and marks the path as a text file. End-of-line
133 conversion takes place without guessing the content type.
88e7fdf2
JH
134
135Unset::
136
2de9b711 137 Unsetting the `text` attribute on a path tells Git not to
bbb896d8 138 attempt any end-of-line conversion upon checkin or checkout.
88e7fdf2 139
fd6cce9e 140Set to string value "auto"::
88e7fdf2 141
5ec3e670 142 When `text` is set to "auto", the path is marked for automatic
65237284
TB
143 end-of-line conversion. If Git decides that the content is
144 text, its line endings are converted to LF on checkin.
2e3a16b2 145 When the file has been committed with CRLF, no conversion is done.
88e7fdf2 146
88e7fdf2
JH
147Unspecified::
148
2de9b711 149 If the `text` attribute is unspecified, Git uses the
942e7747
EB
150 `core.autocrlf` configuration variable to determine if the
151 file should be converted.
88e7fdf2 152
2de9b711 153Any other value causes Git to act as if `text` has been left
fd6cce9e 154unspecified.
88e7fdf2 155
fd6cce9e
EB
156`eol`
157^^^^^
88e7fdf2 158
fd6cce9e 159This attribute sets a specific line-ending style to be used in the
65237284 160working directory. It enables end-of-line conversion without any
3bc4b8f7
BB
161content checks, effectively setting the `text` attribute. Note that
162setting this attribute on paths which are in the index with CRLF line
163endings may make the paths to be considered dirty. Adding the path to
164the index again will normalize the line endings in the index.
88e7fdf2 165
fd6cce9e 166Set to string value "crlf"::
88e7fdf2 167
2de9b711 168 This setting forces Git to normalize line endings for this
942e7747
EB
169 file on checkin and convert them to CRLF when the file is
170 checked out.
fd6cce9e
EB
171
172Set to string value "lf"::
173
2de9b711 174 This setting forces Git to normalize line endings to LF on
fd6cce9e 175 checkin and prevents conversion to CRLF when the file is
942e7747 176 checked out.
5ec3e670
EB
177
178Backwards compatibility with `crlf` attribute
179^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
180
181For backwards compatibility, the `crlf` attribute is interpreted as
182follows:
183
184------------------------
185crlf text
186-crlf -text
187crlf=input eol=lf
188------------------------
fd6cce9e
EB
189
190End-of-line conversion
191^^^^^^^^^^^^^^^^^^^^^^
192
2de9b711 193While Git normally leaves file contents alone, it can be configured to
fd6cce9e
EB
194normalize line endings to LF in the repository and, optionally, to
195convert them to CRLF when files are checked out.
196
fd6cce9e
EB
197If you simply want to have CRLF line endings in your working directory
198regardless of the repository you are working with, you can set the
65237284 199config variable "core.autocrlf" without using any attributes.
fd6cce9e
EB
200
201------------------------
202[core]
203 autocrlf = true
204------------------------
205
e28eae31 206This does not force normalization of text files, but does ensure
fd6cce9e
EB
207that text files that you introduce to the repository have their line
208endings normalized to LF when they are added, and that files that are
942e7747 209already normalized in the repository stay normalized.
fd6cce9e 210
e28eae31
TB
211If you want to ensure that text files that any contributor introduces to
212the repository have their line endings normalized, you can set the
213`text` attribute to "auto" for _all_ files.
88e7fdf2 214
fd6cce9e 215------------------------
5ec3e670 216* text=auto
fd6cce9e
EB
217------------------------
218
e28eae31
TB
219The attributes allow a fine-grained control, how the line endings
220are converted.
221Here is an example that will make Git normalize .txt, .vcproj and .sh
222files, ensure that .vcproj files have CRLF and .sh files have LF in
223the working directory, and prevent .jpg files from being normalized
224regardless of their content.
225
226------------------------
227* text=auto
228*.txt text
229*.vcproj text eol=crlf
230*.sh text eol=lf
231*.jpg -text
232------------------------
233
234NOTE: When `text=auto` conversion is enabled in a cross-platform
235project using push and pull to a central repository the text files
236containing CRLFs should be normalized.
fd6cce9e 237
e28eae31 238From a clean working directory:
fd6cce9e
EB
239
240-------------------------------------------------
e28eae31 241$ echo "* text=auto" >.gitattributes
9472935d 242$ git add --renormalize .
fd6cce9e 243$ git status # Show files that will be normalized
fd6cce9e
EB
244$ git commit -m "Introduce end-of-line normalization"
245-------------------------------------------------
246
247If any files that should not be normalized show up in 'git status',
5ec3e670 248unset their `text` attribute before running 'git add -u'.
fd6cce9e
EB
249
250------------------------
5ec3e670 251manual.pdf -text
fd6cce9e 252------------------------
88e7fdf2 253
2de9b711 254Conversely, text files that Git does not detect can have normalization
fd6cce9e 255enabled manually.
88e7fdf2 256
fd6cce9e 257------------------------
5ec3e670 258weirdchars.txt text
fd6cce9e 259------------------------
88e7fdf2 260
2de9b711 261If `core.safecrlf` is set to "true" or "warn", Git verifies if
21e5ad50 262the conversion is reversible for the current setting of
2de9b711
TA
263`core.autocrlf`. For "true", Git rejects irreversible
264conversions; for "warn", Git only prints a warning but accepts
21e5ad50
SP
265an irreversible conversion. The safety triggers to prevent such
266a conversion done to the files in the work tree, but there are a
267few exceptions. Even though...
268
0b444cdb 269- 'git add' itself does not touch the files in the work tree, the
21e5ad50
SP
270 next checkout would, so the safety triggers;
271
0b444cdb 272- 'git apply' to update a text file with a patch does touch the files
21e5ad50
SP
273 in the work tree, but the operation is about text files and CRLF
274 conversion is about fixing the line ending inconsistencies, so the
275 safety does not trigger;
276
0b444cdb
TR
277- 'git diff' itself does not touch the files in the work tree, it is
278 often run to inspect the changes you intend to next 'git add'. To
21e5ad50
SP
279 catch potential problems early, safety triggers.
280
88e7fdf2 281
3fed15f5
JH
282`ident`
283^^^^^^^
284
2de9b711 285When the attribute `ident` is set for a path, Git replaces
2c850f12 286`$Id$` in the blob object with `$Id:`, followed by the
3fed15f5
JH
28740-character hexadecimal blob object name, followed by a dollar
288sign `$` upon checkout. Any byte sequence that begins with
af9b54bb
AP
289`$Id:` and ends with `$` in the worktree file is replaced
290with `$Id$` upon check-in.
3fed15f5
JH
291
292
aa4ed402
JH
293`filter`
294^^^^^^^^
295
c05ef938 296A `filter` attribute can be set to a string value that names a
aa4ed402
JH
297filter driver specified in the configuration.
298
c05ef938 299A filter driver consists of a `clean` command and a `smudge`
aa4ed402 300command, either of which can be left unspecified. Upon
c05ef938
WC
301checkout, when the `smudge` command is specified, the command is
302fed the blob object from its standard input, and its standard
303output is used to update the worktree file. Similarly, the
304`clean` command is used to convert the contents of worktree file
edcc8581
LS
305upon checkin. By default these commands process only a single
306blob and terminate. If a long running `process` filter is used
307in place of `clean` and/or `smudge` filters, then Git can process
308all blobs with a single filter command invocation for the entire
309life of a single Git command, for example `git add --all`. If a
310long running `process` filter is configured then it always takes
311precedence over a configured single blob filter. See section
312below for the description of the protocol used to communicate with
313a `process` filter.
aa4ed402 314
36daaaca
JB
315One use of the content filtering is to massage the content into a shape
316that is more convenient for the platform, filesystem, and the user to use.
317For this mode of operation, the key phrase here is "more convenient" and
318not "turning something unusable into usable". In other words, the intent
319is that if someone unsets the filter driver definition, or does not have
320the appropriate filter program, the project should still be usable.
321
322Another use of the content filtering is to store the content that cannot
323be directly used in the repository (e.g. a UUID that refers to the true
2de9b711 324content stored outside Git, or an encrypted content) and turn it into a
36daaaca
JB
325usable form upon checkout (e.g. download the external content, or decrypt
326the encrypted content).
327
328These two filters behave differently, and by default, a filter is taken as
329the former, massaging the contents into more convenient shape. A missing
330filter driver definition in the config, or a filter driver that exits with
331a non-zero status, is not an error but makes the filter a no-op passthru.
332
333You can declare that a filter turns a content that by itself is unusable
334into a usable content by setting the filter.<driver>.required configuration
335variable to `true`.
aa4ed402 336
9472935d
TB
337Note: Whenever the clean filter is changed, the repo should be renormalized:
338$ git add --renormalize .
339
d79f5d17
NS
340For example, in .gitattributes, you would assign the `filter`
341attribute for paths.
342
343------------------------
344*.c filter=indent
345------------------------
346
347Then you would define a "filter.indent.clean" and "filter.indent.smudge"
348configuration in your .git/config to specify a pair of commands to
349modify the contents of C programs when the source files are checked
350in ("clean" is run) and checked out (no change is made because the
351command is "cat").
352
353------------------------
354[filter "indent"]
355 clean = indent
356 smudge = cat
357------------------------
358
f217f0e8
EB
359For best results, `clean` should not alter its output further if it is
360run twice ("clean->clean" should be equivalent to "clean"), and
361multiple `smudge` commands should not alter `clean`'s output
362("smudge->smudge->clean" should be equivalent to "clean"). See the
363section on merging below.
364
365The "indent" filter is well-behaved in this regard: it will not modify
366input that is already correctly indented. In this case, the lack of a
367smudge filter means that the clean filter _must_ accept its own output
368without modifying it.
369
36daaaca
JB
370If a filter _must_ succeed in order to make the stored contents usable,
371you can declare that the filter is `required`, in the configuration:
372
373------------------------
374[filter "crypt"]
375 clean = openssl enc ...
376 smudge = openssl enc -d ...
377 required
378------------------------
379
a2b665de
PW
380Sequence "%f" on the filter command line is replaced with the name of
381the file the filter is working on. A filter might use this in keyword
382substitution. For example:
383
384------------------------
385[filter "p4"]
386 clean = git-p4-filter --clean %f
387 smudge = git-p4-filter --smudge %f
388------------------------
389
52db4b04
JH
390Note that "%f" is the name of the path that is being worked on. Depending
391on the version that is being filtered, the corresponding file on disk may
392not exist, or may have different contents. So, smudge and clean commands
393should not try to access the file on disk, but only act as filters on the
394content provided to them on standard input.
aa4ed402 395
edcc8581
LS
396Long Running Filter Process
397^^^^^^^^^^^^^^^^^^^^^^^^^^^
398
399If the filter command (a string value) is defined via
400`filter.<driver>.process` then Git can process all blobs with a
401single filter invocation for the entire life of a single Git
addad105
JT
402command. This is achieved by using the long-running process protocol
403(described in technical/long-running-process-protocol.txt).
404
405When Git encounters the first file that needs to be cleaned or smudged,
406it starts the filter and performs the handshake. In the handshake, the
407welcome message sent by Git is "git-filter-client", only version 2 is
408suppported, and the supported capabilities are "clean", "smudge", and
409"delay".
edcc8581
LS
410
411Afterwards Git sends a list of "key=value" pairs terminated with
412a flush packet. The list will contain at least the filter command
413(based on the supported capabilities) and the pathname of the file
414to filter relative to the repository root. Right after the flush packet
415Git sends the content split in zero or more pkt-line packets and a
416flush packet to terminate content. Please note, that the filter
417must not send any response before it received the content and the
c6b0831c
LS
418final flush packet. Also note that the "value" of a "key=value" pair
419can contain the "=" character whereas the key would never contain
420that character.
edcc8581
LS
421------------------------
422packet: git> command=smudge
423packet: git> pathname=path/testfile.dat
424packet: git> 0000
425packet: git> CONTENT
426packet: git> 0000
427------------------------
428
429The filter is expected to respond with a list of "key=value" pairs
430terminated with a flush packet. If the filter does not experience
431problems then the list must contain a "success" status. Right after
432these packets the filter is expected to send the content in zero
433or more pkt-line packets and a flush packet at the end. Finally, a
434second list of "key=value" pairs terminated with a flush packet
435is expected. The filter can change the status in the second list
436or keep the status as is with an empty list. Please note that the
437empty list must be terminated with a flush packet regardless.
438
439------------------------
440packet: git< status=success
441packet: git< 0000
442packet: git< SMUDGED_CONTENT
443packet: git< 0000
444packet: git< 0000 # empty list, keep "status=success" unchanged!
445------------------------
446
447If the result content is empty then the filter is expected to respond
448with a "success" status and a flush packet to signal the empty content.
449------------------------
450packet: git< status=success
451packet: git< 0000
452packet: git< 0000 # empty content!
453packet: git< 0000 # empty list, keep "status=success" unchanged!
454------------------------
455
456In case the filter cannot or does not want to process the content,
457it is expected to respond with an "error" status.
458------------------------
459packet: git< status=error
460packet: git< 0000
461------------------------
462
463If the filter experiences an error during processing, then it can
464send the status "error" after the content was (partially or
465completely) sent.
466------------------------
467packet: git< status=success
468packet: git< 0000
469packet: git< HALF_WRITTEN_ERRONEOUS_CONTENT
470packet: git< 0000
471packet: git< status=error
472packet: git< 0000
473------------------------
474
475In case the filter cannot or does not want to process the content
476as well as any future content for the lifetime of the Git process,
477then it is expected to respond with an "abort" status at any point
478in the protocol.
479------------------------
480packet: git< status=abort
481packet: git< 0000
482------------------------
483
484Git neither stops nor restarts the filter process in case the
485"error"/"abort" status is set. However, Git sets its exit code
486according to the `filter.<driver>.required` flag, mimicking the
487behavior of the `filter.<driver>.clean` / `filter.<driver>.smudge`
488mechanism.
489
490If the filter dies during the communication or does not adhere to
491the protocol then Git will stop the filter process and restart it
492with the next file that needs to be processed. Depending on the
493`filter.<driver>.required` flag Git will interpret that as error.
494
2841e8f8
LS
495Delay
496^^^^^
497
498If the filter supports the "delay" capability, then Git can send the
499flag "can-delay" after the filter command and pathname. This flag
500denotes that the filter can delay filtering the current blob (e.g. to
501compensate network latencies) by responding with no content but with
502the status "delayed" and a flush packet.
503------------------------
504packet: git> command=smudge
505packet: git> pathname=path/testfile.dat
506packet: git> can-delay=1
507packet: git> 0000
508packet: git> CONTENT
509packet: git> 0000
510packet: git< status=delayed
511packet: git< 0000
512------------------------
513
514If the filter supports the "delay" capability then it must support the
515"list_available_blobs" command. If Git sends this command, then the
516filter is expected to return a list of pathnames representing blobs
517that have been delayed earlier and are now available.
518The list must be terminated with a flush packet followed
519by a "success" status that is also terminated with a flush packet. If
520no blobs for the delayed paths are available, yet, then the filter is
521expected to block the response until at least one blob becomes
522available. The filter can tell Git that it has no more delayed blobs
523by sending an empty list. As soon as the filter responds with an empty
524list, Git stops asking. All blobs that Git has not received at this
525point are considered missing and will result in an error.
526
527------------------------
528packet: git> command=list_available_blobs
529packet: git> 0000
530packet: git< pathname=path/testfile.dat
531packet: git< pathname=path/otherfile.dat
532packet: git< 0000
533packet: git< status=success
534packet: git< 0000
535------------------------
536
537After Git received the pathnames, it will request the corresponding
538blobs again. These requests contain a pathname and an empty content
539section. The filter is expected to respond with the smudged content
540in the usual way as explained above.
541------------------------
542packet: git> command=smudge
543packet: git> pathname=path/testfile.dat
544packet: git> 0000
545packet: git> 0000 # empty content!
546packet: git< status=success
547packet: git< 0000
548packet: git< SMUDGED_CONTENT
549packet: git< 0000
550packet: git< 0000 # empty list, keep "status=success" unchanged!
551------------------------
552
553Example
554^^^^^^^
555
0f71fa27
LS
556A long running filter demo implementation can be found in
557`contrib/long-running-filter/example.pl` located in the Git
558core repository. If you develop your own long running filter
edcc8581
LS
559process then the `GIT_TRACE_PACKET` environment variables can be
560very helpful for debugging (see linkgit:git[1]).
561
562Please note that you cannot use an existing `filter.<driver>.clean`
563or `filter.<driver>.smudge` command with `filter.<driver>.process`
564because the former two use a different inter process communication
565protocol than the latter one.
566
567
aa4ed402
JH
568Interaction between checkin/checkout attributes
569^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
570
571In the check-in codepath, the worktree file is first converted
572with `filter` driver (if specified and corresponding driver
573defined), then the result is processed with `ident` (if
5ec3e670 574specified), and then finally with `text` (again, if specified
aa4ed402
JH
575and applicable).
576
577In the check-out codepath, the blob content is first converted
5ec3e670 578with `text`, and then `ident` and fed to `filter`.
aa4ed402
JH
579
580
f217f0e8
EB
581Merging branches with differing checkin/checkout attributes
582^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
583
584If you have added attributes to a file that cause the canonical
585repository format for that file to change, such as adding a
586clean/smudge filter or text/eol/ident attributes, merging anything
587where the attribute is not in place would normally cause merge
588conflicts.
589
2de9b711 590To prevent these unnecessary merge conflicts, Git can be told to run a
f217f0e8
EB
591virtual check-out and check-in of all three stages of a file when
592resolving a three-way merge by setting the `merge.renormalize`
593configuration variable. This prevents changes caused by check-in
594conversion from causing spurious merge conflicts when a converted file
595is merged with an unconverted file.
596
597As long as a "smudge->clean" results in the same output as a "clean"
598even on files that are already smudged, this strategy will
599automatically resolve all filter-related conflicts. Filters that do
600not act in this way may cause additional merge conflicts that must be
601resolved manually.
602
603
88e7fdf2
JH
604Generating diff text
605~~~~~~~~~~~~~~~~~~~~
606
4f73e240
JN
607`diff`
608^^^^^^
609
2de9b711
TA
610The attribute `diff` affects how Git generates diffs for particular
611files. It can tell Git whether to generate a textual patch for the path
678852d9 612or to treat the path as a binary file. It can also affect what line is
2de9b711
TA
613shown on the hunk header `@@ -k,l +n,m @@` line, tell Git to use an
614external command to generate the diff, or ask Git to convert binary
678852d9 615files to a text format before generating the diff.
88e7fdf2
JH
616
617Set::
618
619 A path to which the `diff` attribute is set is treated
620 as text, even when they contain byte values that
621 normally never appear in text files, such as NUL.
622
623Unset::
624
625 A path to which the `diff` attribute is unset will
678852d9
JK
626 generate `Binary files differ` (or a binary patch, if
627 binary patches are enabled).
88e7fdf2
JH
628
629Unspecified::
630
631 A path to which the `diff` attribute is unspecified
632 first gets its contents inspected, and if it looks like
6bf3b813
NTND
633 text and is smaller than core.bigFileThreshold, it is treated
634 as text. Otherwise it would generate `Binary files differ`.
88e7fdf2 635
2cc3167c
JH
636String::
637
678852d9
JK
638 Diff is shown using the specified diff driver. Each driver may
639 specify one or more options, as described in the following
640 section. The options for the diff driver "foo" are defined
641 by the configuration variables in the "diff.foo" section of the
2de9b711 642 Git config file.
2cc3167c
JH
643
644
678852d9
JK
645Defining an external diff driver
646^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
2cc3167c
JH
647
648The definition of a diff driver is done in `gitconfig`, not
649`gitattributes` file, so strictly speaking this manual page is a
650wrong place to talk about it. However...
651
678852d9 652To define an external diff driver `jcdiff`, add a section to your
2cc3167c
JH
653`$GIT_DIR/config` file (or `$HOME/.gitconfig` file) like this:
654
655----------------------------------------------------------------
656[diff "jcdiff"]
657 command = j-c-diff
658----------------------------------------------------------------
659
2de9b711 660When Git needs to show you a diff for the path with `diff`
2cc3167c
JH
661attribute set to `jcdiff`, it calls the command you specified
662with the above configuration, i.e. `j-c-diff`, with 7
663parameters, just like `GIT_EXTERNAL_DIFF` program is called.
9e1f0a85 664See linkgit:git[1] for details.
88e7fdf2
JH
665
666
ae7aa499
JH
667Defining a custom hunk-header
668^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
669
c882c01e 670Each group of changes (called a "hunk") in the textual diff output
ae7aa499
JH
671is prefixed with a line of the form:
672
673 @@ -k,l +n,m @@ TEXT
674
c882c01e
GD
675This is called a 'hunk header'. The "TEXT" portion is by default a line
676that begins with an alphabet, an underscore or a dollar sign; this
677matches what GNU 'diff -p' output uses. This default selection however
678is not suited for some contents, and you can use a customized pattern
679to make a selection.
ae7aa499 680
c882c01e 681First, in .gitattributes, you would assign the `diff` attribute
ae7aa499
JH
682for paths.
683
684------------------------
685*.tex diff=tex
686------------------------
687
edb7e82f 688Then, you would define a "diff.tex.xfuncname" configuration to
ae7aa499 689specify a regular expression that matches a line that you would
c4c86d23
JK
690want to appear as the hunk header "TEXT". Add a section to your
691`$GIT_DIR/config` file (or `$HOME/.gitconfig` file) like this:
ae7aa499
JH
692
693------------------------
694[diff "tex"]
45d9414f 695 xfuncname = "^(\\\\(sub)*section\\{.*)$"
ae7aa499
JH
696------------------------
697
698Note. A single level of backslashes are eaten by the
699configuration file parser, so you would need to double the
700backslashes; the pattern above picks a line that begins with a
02783075 701backslash, and zero or more occurrences of `sub` followed by
ae7aa499
JH
702`section` followed by open brace, to the end of line.
703
704There are a few built-in patterns to make this easier, and `tex`
705is one of them, so you do not have to write the above in your
706configuration file (you still need to enable this with the
d08ed6d6
GH
707attribute mechanism, via `.gitattributes`). The following built in
708patterns are available:
709
e90d065e
AJ
710- `ada` suitable for source code in the Ada language.
711
23b5beb2
GH
712- `bibtex` suitable for files with BibTeX coded references.
713
80c49c3d
TR
714- `cpp` suitable for source code in the C and C++ languages.
715
b221207d
PO
716- `csharp` suitable for source code in the C# language.
717
0719f3ee
WD
718- `css` suitable for cascading style sheets.
719
909a5494
BC
720- `fortran` suitable for source code in the Fortran language.
721
69f9c87d
ZB
722- `fountain` suitable for Fountain documents.
723
1dbf0c0a
AG
724- `golang` suitable for source code in the Go language.
725
af9ce1ff
AE
726- `html` suitable for HTML/XHTML documents.
727
b66e00f1 728- `java` suitable for source code in the Java language.
d08ed6d6 729
53b10a14
GH
730- `matlab` suitable for source code in the MATLAB language.
731
5d1e958e
JS
732- `objc` suitable for source code in the Objective-C language.
733
d08ed6d6
GH
734- `pascal` suitable for source code in the Pascal/Delphi language.
735
71a5d4bc
JN
736- `perl` suitable for source code in the Perl language.
737
af9ce1ff
AE
738- `php` suitable for source code in the PHP language.
739
7c17205b
KS
740- `python` suitable for source code in the Python language.
741
d08ed6d6
GH
742- `ruby` suitable for source code in the Ruby language.
743
744- `tex` suitable for source code for LaTeX documents.
ae7aa499
JH
745
746
80c49c3d
TR
747Customizing word diff
748^^^^^^^^^^^^^^^^^^^^^
749
882749a0 750You can customize the rules that `git diff --word-diff` uses to
80c49c3d 751split words in a line, by specifying an appropriate regular expression
ae3b970a 752in the "diff.*.wordRegex" configuration variable. For example, in TeX
80c49c3d
TR
753a backslash followed by a sequence of letters forms a command, but
754several such commands can be run together without intervening
c4c86d23
JK
755whitespace. To separate them, use a regular expression in your
756`$GIT_DIR/config` file (or `$HOME/.gitconfig` file) like this:
80c49c3d
TR
757
758------------------------
759[diff "tex"]
ae3b970a 760 wordRegex = "\\\\[a-zA-Z]+|[{}]|\\\\.|[^\\{}[:space:]]+"
80c49c3d
TR
761------------------------
762
763A built-in pattern is provided for all languages listed in the
764previous section.
765
766
678852d9
JK
767Performing text diffs of binary files
768^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
769
770Sometimes it is desirable to see the diff of a text-converted
771version of some binary files. For example, a word processor
772document can be converted to an ASCII text representation, and
773the diff of the text shown. Even though this conversion loses
774some information, the resulting diff is useful for human
775viewing (but cannot be applied directly).
776
777The `textconv` config option is used to define a program for
778performing such a conversion. The program should take a single
779argument, the name of a file to convert, and produce the
780resulting text on stdout.
781
782For example, to show the diff of the exif information of a
783file instead of the binary information (assuming you have the
c4c86d23
JK
784exif tool installed), add the following section to your
785`$GIT_DIR/config` file (or `$HOME/.gitconfig` file):
678852d9
JK
786
787------------------------
788[diff "jpg"]
789 textconv = exif
790------------------------
791
792NOTE: The text conversion is generally a one-way conversion;
793in this example, we lose the actual image contents and focus
794just on the text data. This means that diffs generated by
795textconv are _not_ suitable for applying. For this reason,
796only `git diff` and the `git log` family of commands (i.e.,
797log, whatchanged, show) will perform text conversion. `git
798format-patch` will never generate this output. If you want to
799send somebody a text-converted diff of a binary file (e.g.,
800because it quickly conveys the changes you have made), you
801should generate it separately and send it as a comment _in
802addition to_ the usual binary diff that you might send.
803
d9bae1a1 804Because text conversion can be slow, especially when doing a
2de9b711 805large number of them with `git log -p`, Git provides a mechanism
d9bae1a1
JK
806to cache the output and use it in future diffs. To enable
807caching, set the "cachetextconv" variable in your diff driver's
808config. For example:
809
810------------------------
811[diff "jpg"]
812 textconv = exif
813 cachetextconv = true
814------------------------
815
816This will cache the result of running "exif" on each blob
817indefinitely. If you change the textconv config variable for a
2de9b711 818diff driver, Git will automatically invalidate the cache entries
d9bae1a1
JK
819and re-run the textconv filter. If you want to invalidate the
820cache manually (e.g., because your version of "exif" was updated
821and now produces better output), you can remove the cache
822manually with `git update-ref -d refs/notes/textconv/jpg` (where
823"jpg" is the name of the diff driver, as in the example above).
678852d9 824
55601c6a
JK
825Choosing textconv versus external diff
826^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
827
828If you want to show differences between binary or specially-formatted
829blobs in your repository, you can choose to use either an external diff
830command, or to use textconv to convert them to a diff-able text format.
831Which method you choose depends on your exact situation.
832
833The advantage of using an external diff command is flexibility. You are
834not bound to find line-oriented changes, nor is it necessary for the
835output to resemble unified diff. You are free to locate and report
836changes in the most appropriate way for your data format.
837
838A textconv, by comparison, is much more limiting. You provide a
2de9b711 839transformation of the data into a line-oriented text format, and Git
55601c6a
JK
840uses its regular diff tools to generate the output. There are several
841advantages to choosing this method:
842
8431. Ease of use. It is often much simpler to write a binary to text
844 transformation than it is to perform your own diff. In many cases,
845 existing programs can be used as textconv filters (e.g., exif,
846 odt2txt).
847
8482. Git diff features. By performing only the transformation step
2de9b711 849 yourself, you can still utilize many of Git's diff features,
55601c6a
JK
850 including colorization, word-diff, and combined diffs for merges.
851
8523. Caching. Textconv caching can speed up repeated diffs, such as those
853 you might trigger by running `git log -p`.
854
855
ab435611
JK
856Marking files as binary
857^^^^^^^^^^^^^^^^^^^^^^^
858
859Git usually guesses correctly whether a blob contains text or binary
860data by examining the beginning of the contents. However, sometimes you
861may want to override its decision, either because a blob contains binary
862data later in the file, or because the content, while technically
863composed of text characters, is opaque to a human reader. For example,
f745acb0 864many postscript files contain only ASCII characters, but produce noisy
ab435611
JK
865and meaningless diffs.
866
867The simplest way to mark a file as binary is to unset the diff
868attribute in the `.gitattributes` file:
869
870------------------------
871*.ps -diff
872------------------------
873
2de9b711 874This will cause Git to generate `Binary files differ` (or a binary
ab435611
JK
875patch, if binary patches are enabled) instead of a regular diff.
876
877However, one may also want to specify other diff driver attributes. For
878example, you might want to use `textconv` to convert postscript files to
f745acb0 879an ASCII representation for human viewing, but otherwise treat them as
ab435611
JK
880binary files. You cannot specify both `-diff` and `diff=ps` attributes.
881The solution is to use the `diff.*.binary` config option:
882
883------------------------
884[diff "ps"]
885 textconv = ps2ascii
886 binary = true
887------------------------
888
88e7fdf2
JH
889Performing a three-way merge
890~~~~~~~~~~~~~~~~~~~~~~~~~~~~
891
4f73e240
JN
892`merge`
893^^^^^^^
894
b547ce0b 895The attribute `merge` affects how three versions of a file are
88e7fdf2 896merged when a file-level merge is necessary during `git merge`,
57f6ec02 897and other commands such as `git revert` and `git cherry-pick`.
88e7fdf2
JH
898
899Set::
900
901 Built-in 3-way merge driver is used to merge the
2fd02c92 902 contents in a way similar to 'merge' command of `RCS`
88e7fdf2
JH
903 suite. This is suitable for ordinary text files.
904
905Unset::
906
907 Take the version from the current branch as the
908 tentative merge result, and declare that the merge has
b547ce0b 909 conflicts. This is suitable for binary files that do
88e7fdf2
JH
910 not have a well-defined merge semantics.
911
912Unspecified::
913
914 By default, this uses the same built-in 3-way merge
b547ce0b
AS
915 driver as is the case when the `merge` attribute is set.
916 However, the `merge.default` configuration variable can name
917 different merge driver to be used with paths for which the
88e7fdf2
JH
918 `merge` attribute is unspecified.
919
2cc3167c 920String::
88e7fdf2
JH
921
922 3-way merge is performed using the specified custom
923 merge driver. The built-in 3-way merge driver can be
924 explicitly specified by asking for "text" driver; the
925 built-in "take the current branch" driver can be
b9d14ffb 926 requested with "binary".
88e7fdf2
JH
927
928
0e545f75
JH
929Built-in merge drivers
930^^^^^^^^^^^^^^^^^^^^^^
931
932There are a few built-in low-level merge drivers defined that
933can be asked for via the `merge` attribute.
934
935text::
936
937 Usual 3-way file level merge for text files. Conflicted
938 regions are marked with conflict markers `<<<<<<<`,
939 `=======` and `>>>>>>>`. The version from your branch
940 appears before the `=======` marker, and the version
941 from the merged branch appears after the `=======`
942 marker.
943
944binary::
945
946 Keep the version from your branch in the work tree, but
947 leave the path in the conflicted state for the user to
948 sort out.
949
950union::
951
952 Run 3-way file level merge for text files, but take
953 lines from both versions, instead of leaving conflict
954 markers. This tends to leave the added lines in the
955 resulting file in random order and the user should
956 verify the result. Do not use this if you do not
957 understand the implications.
958
959
88e7fdf2
JH
960Defining a custom merge driver
961^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
962
0e545f75
JH
963The definition of a merge driver is done in the `.git/config`
964file, not in the `gitattributes` file, so strictly speaking this
965manual page is a wrong place to talk about it. However...
88e7fdf2
JH
966
967To define a custom merge driver `filfre`, add a section to your
968`$GIT_DIR/config` file (or `$HOME/.gitconfig` file) like this:
969
970----------------------------------------------------------------
971[merge "filfre"]
972 name = feel-free merge driver
ef45bb1f 973 driver = filfre %O %A %B %L %P
88e7fdf2
JH
974 recursive = binary
975----------------------------------------------------------------
976
977The `merge.*.name` variable gives the driver a human-readable
978name.
979
980The `merge.*.driver` variable's value is used to construct a
981command to run to merge ancestor's version (`%O`), current
982version (`%A`) and the other branches' version (`%B`). These
983three tokens are replaced with the names of temporary files that
984hold the contents of these versions when the command line is
16758621
BW
985built. Additionally, %L will be replaced with the conflict marker
986size (see below).
88e7fdf2
JH
987
988The merge driver is expected to leave the result of the merge in
989the file named with `%A` by overwriting it, and exit with zero
990status if it managed to merge them cleanly, or non-zero if there
991were conflicts.
992
993The `merge.*.recursive` variable specifies what other merge
994driver to use when the merge driver is called for an internal
995merge between common ancestors, when there are more than one.
996When left unspecified, the driver itself is used for both
997internal merge and the final merge.
998
ef45bb1f
JH
999The merge driver can learn the pathname in which the merged result
1000will be stored via placeholder `%P`.
1001
88e7fdf2 1002
4c734803
JH
1003`conflict-marker-size`
1004^^^^^^^^^^^^^^^^^^^^^^
1005
1006This attribute controls the length of conflict markers left in
1007the work tree file during a conflicted merge. Only setting to
1008the value to a positive integer has any meaningful effect.
1009
1010For example, this line in `.gitattributes` can be used to tell the merge
1011machinery to leave much longer (instead of the usual 7-character-long)
1012conflict markers when merging the file `Documentation/git-merge.txt`
1013results in a conflict.
1014
1015------------------------
1016Documentation/git-merge.txt conflict-marker-size=32
1017------------------------
1018
1019
cf1b7869
JH
1020Checking whitespace errors
1021~~~~~~~~~~~~~~~~~~~~~~~~~~
1022
1023`whitespace`
1024^^^^^^^^^^^^
1025
1026The `core.whitespace` configuration variable allows you to define what
2fd02c92 1027'diff' and 'apply' should consider whitespace errors for all paths in
5162e697 1028the project (See linkgit:git-config[1]). This attribute gives you finer
cf1b7869
JH
1029control per path.
1030
1031Set::
1032
2de9b711 1033 Notice all types of potential whitespace errors known to Git.
f4b05a49
JS
1034 The tab width is taken from the value of the `core.whitespace`
1035 configuration variable.
cf1b7869
JH
1036
1037Unset::
1038
1039 Do not notice anything as error.
1040
1041Unspecified::
1042
f4b05a49 1043 Use the value of the `core.whitespace` configuration variable to
cf1b7869
JH
1044 decide what to notice as error.
1045
1046String::
1047
1048 Specify a comma separate list of common whitespace problems to
f4b05a49 1049 notice in the same format as the `core.whitespace` configuration
cf1b7869
JH
1050 variable.
1051
1052
8a33dd8b
JH
1053Creating an archive
1054~~~~~~~~~~~~~~~~~~~
1055
08b51f51
JH
1056`export-ignore`
1057^^^^^^^^^^^^^^^
1058
1059Files and directories with the attribute `export-ignore` won't be added to
1060archive files.
1061
8a33dd8b
JH
1062`export-subst`
1063^^^^^^^^^^^^^^
1064
2de9b711 1065If the attribute `export-subst` is set for a file then Git will expand
8a33dd8b 1066several placeholders when adding this file to an archive. The
08b51f51 1067expansion depends on the availability of a commit ID, i.e., if
8a33dd8b
JH
1068linkgit:git-archive[1] has been given a tree instead of a commit or a
1069tag then no replacement will be done. The placeholders are the same
1070as those for the option `--pretty=format:` of linkgit:git-log[1],
1071except that they need to be wrapped like this: `$Format:PLACEHOLDERS$`
1072in the file. E.g. the string `$Format:%H$` will be replaced by the
1073commit hash.
1074
1075
975457f1
NG
1076Packing objects
1077~~~~~~~~~~~~~~~
1078
1079`delta`
1080^^^^^^^
1081
1082Delta compression will not be attempted for blobs for paths with the
1083attribute `delta` set to false.
1084
1085
a2df1fb2
AG
1086Viewing files in GUI tools
1087~~~~~~~~~~~~~~~~~~~~~~~~~~
1088
1089`encoding`
1090^^^^^^^^^^
1091
1092The value of this attribute specifies the character encoding that should
1093be used by GUI tools (e.g. linkgit:gitk[1] and linkgit:git-gui[1]) to
1094display the contents of the relevant file. Note that due to performance
1095considerations linkgit:gitk[1] does not use this attribute unless you
1096manually enable per-file encodings in its options.
1097
1098If this attribute is not set or has an invalid value, the value of the
1099`gui.encoding` configuration variable is used instead
1100(See linkgit:git-config[1]).
1101
1102
0922570c 1103USING MACRO ATTRIBUTES
bbb896d8
JH
1104----------------------
1105
1106You do not want any end-of-line conversions applied to, nor textual diffs
1107produced for, any binary file you track. You would need to specify e.g.
1108
1109------------
5ec3e670 1110*.jpg -text -diff
bbb896d8
JH
1111------------
1112
1113but that may become cumbersome, when you have many attributes. Using
0922570c 1114macro attributes, you can define an attribute that, when set, also
98e84066 1115sets or unsets a number of other attributes at the same time. The
0922570c 1116system knows a built-in macro attribute, `binary`:
bbb896d8
JH
1117
1118------------
1119*.jpg binary
1120------------
1121
98e84066 1122Setting the "binary" attribute also unsets the "text" and "diff"
0922570c 1123attributes as above. Note that macro attributes can only be "Set",
98e84066
MH
1124though setting one might have the effect of setting or unsetting other
1125attributes or even returning other attributes to the "Unspecified"
1126state.
bbb896d8
JH
1127
1128
0922570c 1129DEFINING MACRO ATTRIBUTES
bbb896d8
JH
1130-------------------------
1131
e78e6967
MH
1132Custom macro attributes can be defined only in top-level gitattributes
1133files (`$GIT_DIR/info/attributes`, the `.gitattributes` file at the
1134top level of the working tree, or the global or system-wide
1135gitattributes files), not in `.gitattributes` files in working tree
1136subdirectories. The built-in macro attribute "binary" is equivalent
1137to:
bbb896d8
JH
1138
1139------------
155a4b71 1140[attr]binary -diff -merge -text
bbb896d8
JH
1141------------
1142
1143
88e7fdf2
JH
1144EXAMPLE
1145-------
1146
1147If you have these three `gitattributes` file:
1148
1149----------------------------------------------------------------
1150(in $GIT_DIR/info/attributes)
1151
1152a* foo !bar -baz
1153
1154(in .gitattributes)
1155abc foo bar baz
1156
1157(in t/.gitattributes)
1158ab* merge=filfre
1159abc -foo -bar
1160*.c frotz
1161----------------------------------------------------------------
1162
1163the attributes given to path `t/abc` are computed as follows:
1164
11651. By examining `t/.gitattributes` (which is in the same
2de9b711 1166 directory as the path in question), Git finds that the first
88e7fdf2
JH
1167 line matches. `merge` attribute is set. It also finds that
1168 the second line matches, and attributes `foo` and `bar`
1169 are unset.
1170
11712. Then it examines `.gitattributes` (which is in the parent
1172 directory), and finds that the first line matches, but
1173 `t/.gitattributes` file already decided how `merge`, `foo`
1174 and `bar` attributes should be given to this path, so it
1175 leaves `foo` and `bar` unset. Attribute `baz` is set.
1176
5c759f96 11773. Finally it examines `$GIT_DIR/info/attributes`. This file
88e7fdf2
JH
1178 is used to override the in-tree settings. The first line is
1179 a match, and `foo` is set, `bar` is reverted to unspecified
1180 state, and `baz` is unset.
1181
02783075 1182As the result, the attributes assignment to `t/abc` becomes:
88e7fdf2
JH
1183
1184----------------------------------------------------------------
1185foo set to true
1186bar unspecified
1187baz set to false
1188merge set to string value "filfre"
1189frotz unspecified
1190----------------------------------------------------------------
1191
1192
cde15181
MH
1193SEE ALSO
1194--------
1195linkgit:git-check-attr[1].
8460b2fc 1196
88e7fdf2
JH
1197GIT
1198---
9e1f0a85 1199Part of the linkgit:git[1] suite