]> git.ipfire.org Git - thirdparty/git.git/blame - Documentation/gitattributes.txt
add: introduce "--renormalize"
[thirdparty/git.git] / Documentation / gitattributes.txt
CommitLineData
88e7fdf2
JH
1gitattributes(5)
2================
3
4NAME
5----
6gitattributes - defining attributes per path
7
8SYNOPSIS
9--------
e5b5c1d2 10$GIT_DIR/info/attributes, .gitattributes
88e7fdf2
JH
11
12
13DESCRIPTION
14-----------
15
16A `gitattributes` file is a simple text file that gives
17`attributes` to pathnames.
18
19Each line in `gitattributes` file is of form:
20
3f74c8e8 21 pattern attr1 attr2 ...
88e7fdf2 22
3f74c8e8 23That is, a pattern followed by an attributes list,
860a74d9
NTND
24separated by whitespaces. Leading and trailing whitespaces are
25ignored. Lines that begin with '#' are ignored. Patterns
26that begin with a double quote are quoted in C style.
27When the pattern matches the path in question, the attributes
28listed on the line are given to the path.
88e7fdf2
JH
29
30Each attribute can be in one of these states for a given path:
31
32Set::
33
34 The path has the attribute with special value "true";
35 this is specified by listing only the name of the
36 attribute in the attribute list.
37
38Unset::
39
40 The path has the attribute with special value "false";
41 this is specified by listing the name of the attribute
42 prefixed with a dash `-` in the attribute list.
43
44Set to a value::
45
46 The path has the attribute with specified string value;
47 this is specified by listing the name of the attribute
48 followed by an equal sign `=` and its value in the
49 attribute list.
50
51Unspecified::
52
3f74c8e8 53 No pattern matches the path, and nothing says if
b9d14ffb
JH
54 the path has or does not have the attribute, the
55 attribute for the path is said to be Unspecified.
88e7fdf2 56
3f74c8e8 57When more than one pattern matches the path, a later line
b9d14ffb 58overrides an earlier line. This overriding is done per
3f74c8e8
JS
59attribute. The rules how the pattern matches paths are the
60same as in `.gitignore` files; see linkgit:gitignore[5].
82dce998 61Unlike `.gitignore`, negative patterns are forbidden.
88e7fdf2 62
2de9b711 63When deciding what attributes are assigned to a path, Git
88e7fdf2
JH
64consults `$GIT_DIR/info/attributes` file (which has the highest
65precedence), `.gitattributes` file in the same directory as the
20ff3ec2
JM
66path in question, and its parent directories up to the toplevel of the
67work tree (the further the directory that contains `.gitattributes`
6df42ab9
PO
68is from the path in question, the lower its precedence). Finally
69global and system-wide files are considered (they have the lowest
70precedence).
88e7fdf2 71
40701adb
NTND
72When the `.gitattributes` file is missing from the work tree, the
73path in the index is used as a fall-back. During checkout process,
74`.gitattributes` in the index is used and then the file in the
75working tree is used as a fall-back.
76
90b22907 77If you wish to affect only a single repository (i.e., to assign
6df42ab9
PO
78attributes to files that are particular to
79one user's workflow for that repository), then
90b22907
JK
80attributes should be placed in the `$GIT_DIR/info/attributes` file.
81Attributes which should be version-controlled and distributed to other
82repositories (i.e., attributes of interest to all users) should go into
6df42ab9
PO
83`.gitattributes` files. Attributes that should affect all repositories
84for a single user should be placed in a file specified by the
da0005b8 85`core.attributesFile` configuration option (see linkgit:git-config[1]).
684e40f6
HKNN
86Its default value is $XDG_CONFIG_HOME/git/attributes. If $XDG_CONFIG_HOME
87is either not set or empty, $HOME/.config/git/attributes is used instead.
6df42ab9
PO
88Attributes for all users on a system should be placed in the
89`$(prefix)/etc/gitattributes` file.
90b22907 90
faa4e8ce 91Sometimes you would need to override a setting of an attribute
0922570c 92for a path to `Unspecified` state. This can be done by listing
88e7fdf2
JH
93the name of the attribute prefixed with an exclamation point `!`.
94
95
96EFFECTS
97-------
98
2de9b711 99Certain operations by Git can be influenced by assigning
ae7aa499
JH
100particular attributes to a path. Currently, the following
101operations are attributes-aware.
88e7fdf2
JH
102
103Checking-out and checking-in
104~~~~~~~~~~~~~~~~~~~~~~~~~~~~
105
3fed15f5 106These attributes affect how the contents stored in the
88e7fdf2 107repository are copied to the working tree files when commands
0b444cdb 108such as 'git checkout' and 'git merge' run. They also affect how
2de9b711 109Git stores the contents you prepare in the working tree in the
0b444cdb 110repository upon 'git add' and 'git commit'.
88e7fdf2 111
5ec3e670 112`text`
3fed15f5
JH
113^^^^^^
114
fd6cce9e
EB
115This attribute enables and controls end-of-line normalization. When a
116text file is normalized, its line endings are converted to LF in the
117repository. To control what line ending style is used in the working
118directory, use the `eol` attribute for a single file and the
942e7747 119`core.eol` configuration variable for all text files.
65237284 120Note that `core.autocrlf` overrides `core.eol`
3fed15f5 121
88e7fdf2
JH
122Set::
123
5ec3e670 124 Setting the `text` attribute on a path enables end-of-line
fd6cce9e
EB
125 normalization and marks the path as a text file. End-of-line
126 conversion takes place without guessing the content type.
88e7fdf2
JH
127
128Unset::
129
2de9b711 130 Unsetting the `text` attribute on a path tells Git not to
bbb896d8 131 attempt any end-of-line conversion upon checkin or checkout.
88e7fdf2 132
fd6cce9e 133Set to string value "auto"::
88e7fdf2 134
5ec3e670 135 When `text` is set to "auto", the path is marked for automatic
65237284
TB
136 end-of-line conversion. If Git decides that the content is
137 text, its line endings are converted to LF on checkin.
2e3a16b2 138 When the file has been committed with CRLF, no conversion is done.
88e7fdf2 139
88e7fdf2
JH
140Unspecified::
141
2de9b711 142 If the `text` attribute is unspecified, Git uses the
942e7747
EB
143 `core.autocrlf` configuration variable to determine if the
144 file should be converted.
88e7fdf2 145
2de9b711 146Any other value causes Git to act as if `text` has been left
fd6cce9e 147unspecified.
88e7fdf2 148
fd6cce9e
EB
149`eol`
150^^^^^
88e7fdf2 151
fd6cce9e 152This attribute sets a specific line-ending style to be used in the
65237284 153working directory. It enables end-of-line conversion without any
3bc4b8f7
BB
154content checks, effectively setting the `text` attribute. Note that
155setting this attribute on paths which are in the index with CRLF line
156endings may make the paths to be considered dirty. Adding the path to
157the index again will normalize the line endings in the index.
88e7fdf2 158
fd6cce9e 159Set to string value "crlf"::
88e7fdf2 160
2de9b711 161 This setting forces Git to normalize line endings for this
942e7747
EB
162 file on checkin and convert them to CRLF when the file is
163 checked out.
fd6cce9e
EB
164
165Set to string value "lf"::
166
2de9b711 167 This setting forces Git to normalize line endings to LF on
fd6cce9e 168 checkin and prevents conversion to CRLF when the file is
942e7747 169 checked out.
5ec3e670
EB
170
171Backwards compatibility with `crlf` attribute
172^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
173
174For backwards compatibility, the `crlf` attribute is interpreted as
175follows:
176
177------------------------
178crlf text
179-crlf -text
180crlf=input eol=lf
181------------------------
fd6cce9e
EB
182
183End-of-line conversion
184^^^^^^^^^^^^^^^^^^^^^^
185
2de9b711 186While Git normally leaves file contents alone, it can be configured to
fd6cce9e
EB
187normalize line endings to LF in the repository and, optionally, to
188convert them to CRLF when files are checked out.
189
fd6cce9e
EB
190If you simply want to have CRLF line endings in your working directory
191regardless of the repository you are working with, you can set the
65237284 192config variable "core.autocrlf" without using any attributes.
fd6cce9e
EB
193
194------------------------
195[core]
196 autocrlf = true
197------------------------
198
e28eae31 199This does not force normalization of text files, but does ensure
fd6cce9e
EB
200that text files that you introduce to the repository have their line
201endings normalized to LF when they are added, and that files that are
942e7747 202already normalized in the repository stay normalized.
fd6cce9e 203
e28eae31
TB
204If you want to ensure that text files that any contributor introduces to
205the repository have their line endings normalized, you can set the
206`text` attribute to "auto" for _all_ files.
88e7fdf2 207
fd6cce9e 208------------------------
5ec3e670 209* text=auto
fd6cce9e
EB
210------------------------
211
e28eae31
TB
212The attributes allow a fine-grained control, how the line endings
213are converted.
214Here is an example that will make Git normalize .txt, .vcproj and .sh
215files, ensure that .vcproj files have CRLF and .sh files have LF in
216the working directory, and prevent .jpg files from being normalized
217regardless of their content.
218
219------------------------
220* text=auto
221*.txt text
222*.vcproj text eol=crlf
223*.sh text eol=lf
224*.jpg -text
225------------------------
226
227NOTE: When `text=auto` conversion is enabled in a cross-platform
228project using push and pull to a central repository the text files
229containing CRLFs should be normalized.
fd6cce9e 230
e28eae31 231From a clean working directory:
fd6cce9e
EB
232
233-------------------------------------------------
e28eae31 234$ echo "* text=auto" >.gitattributes
9472935d 235$ git add --renormalize .
fd6cce9e 236$ git status # Show files that will be normalized
fd6cce9e
EB
237$ git commit -m "Introduce end-of-line normalization"
238-------------------------------------------------
239
240If any files that should not be normalized show up in 'git status',
5ec3e670 241unset their `text` attribute before running 'git add -u'.
fd6cce9e
EB
242
243------------------------
5ec3e670 244manual.pdf -text
fd6cce9e 245------------------------
88e7fdf2 246
2de9b711 247Conversely, text files that Git does not detect can have normalization
fd6cce9e 248enabled manually.
88e7fdf2 249
fd6cce9e 250------------------------
5ec3e670 251weirdchars.txt text
fd6cce9e 252------------------------
88e7fdf2 253
2de9b711 254If `core.safecrlf` is set to "true" or "warn", Git verifies if
21e5ad50 255the conversion is reversible for the current setting of
2de9b711
TA
256`core.autocrlf`. For "true", Git rejects irreversible
257conversions; for "warn", Git only prints a warning but accepts
21e5ad50
SP
258an irreversible conversion. The safety triggers to prevent such
259a conversion done to the files in the work tree, but there are a
260few exceptions. Even though...
261
0b444cdb 262- 'git add' itself does not touch the files in the work tree, the
21e5ad50
SP
263 next checkout would, so the safety triggers;
264
0b444cdb 265- 'git apply' to update a text file with a patch does touch the files
21e5ad50
SP
266 in the work tree, but the operation is about text files and CRLF
267 conversion is about fixing the line ending inconsistencies, so the
268 safety does not trigger;
269
0b444cdb
TR
270- 'git diff' itself does not touch the files in the work tree, it is
271 often run to inspect the changes you intend to next 'git add'. To
21e5ad50
SP
272 catch potential problems early, safety triggers.
273
88e7fdf2 274
3fed15f5
JH
275`ident`
276^^^^^^^
277
2de9b711 278When the attribute `ident` is set for a path, Git replaces
2c850f12 279`$Id$` in the blob object with `$Id:`, followed by the
3fed15f5
JH
28040-character hexadecimal blob object name, followed by a dollar
281sign `$` upon checkout. Any byte sequence that begins with
af9b54bb
AP
282`$Id:` and ends with `$` in the worktree file is replaced
283with `$Id$` upon check-in.
3fed15f5
JH
284
285
aa4ed402
JH
286`filter`
287^^^^^^^^
288
c05ef938 289A `filter` attribute can be set to a string value that names a
aa4ed402
JH
290filter driver specified in the configuration.
291
c05ef938 292A filter driver consists of a `clean` command and a `smudge`
aa4ed402 293command, either of which can be left unspecified. Upon
c05ef938
WC
294checkout, when the `smudge` command is specified, the command is
295fed the blob object from its standard input, and its standard
296output is used to update the worktree file. Similarly, the
297`clean` command is used to convert the contents of worktree file
edcc8581
LS
298upon checkin. By default these commands process only a single
299blob and terminate. If a long running `process` filter is used
300in place of `clean` and/or `smudge` filters, then Git can process
301all blobs with a single filter command invocation for the entire
302life of a single Git command, for example `git add --all`. If a
303long running `process` filter is configured then it always takes
304precedence over a configured single blob filter. See section
305below for the description of the protocol used to communicate with
306a `process` filter.
aa4ed402 307
36daaaca
JB
308One use of the content filtering is to massage the content into a shape
309that is more convenient for the platform, filesystem, and the user to use.
310For this mode of operation, the key phrase here is "more convenient" and
311not "turning something unusable into usable". In other words, the intent
312is that if someone unsets the filter driver definition, or does not have
313the appropriate filter program, the project should still be usable.
314
315Another use of the content filtering is to store the content that cannot
316be directly used in the repository (e.g. a UUID that refers to the true
2de9b711 317content stored outside Git, or an encrypted content) and turn it into a
36daaaca
JB
318usable form upon checkout (e.g. download the external content, or decrypt
319the encrypted content).
320
321These two filters behave differently, and by default, a filter is taken as
322the former, massaging the contents into more convenient shape. A missing
323filter driver definition in the config, or a filter driver that exits with
324a non-zero status, is not an error but makes the filter a no-op passthru.
325
326You can declare that a filter turns a content that by itself is unusable
327into a usable content by setting the filter.<driver>.required configuration
328variable to `true`.
aa4ed402 329
9472935d
TB
330Note: Whenever the clean filter is changed, the repo should be renormalized:
331$ git add --renormalize .
332
d79f5d17
NS
333For example, in .gitattributes, you would assign the `filter`
334attribute for paths.
335
336------------------------
337*.c filter=indent
338------------------------
339
340Then you would define a "filter.indent.clean" and "filter.indent.smudge"
341configuration in your .git/config to specify a pair of commands to
342modify the contents of C programs when the source files are checked
343in ("clean" is run) and checked out (no change is made because the
344command is "cat").
345
346------------------------
347[filter "indent"]
348 clean = indent
349 smudge = cat
350------------------------
351
f217f0e8
EB
352For best results, `clean` should not alter its output further if it is
353run twice ("clean->clean" should be equivalent to "clean"), and
354multiple `smudge` commands should not alter `clean`'s output
355("smudge->smudge->clean" should be equivalent to "clean"). See the
356section on merging below.
357
358The "indent" filter is well-behaved in this regard: it will not modify
359input that is already correctly indented. In this case, the lack of a
360smudge filter means that the clean filter _must_ accept its own output
361without modifying it.
362
36daaaca
JB
363If a filter _must_ succeed in order to make the stored contents usable,
364you can declare that the filter is `required`, in the configuration:
365
366------------------------
367[filter "crypt"]
368 clean = openssl enc ...
369 smudge = openssl enc -d ...
370 required
371------------------------
372
a2b665de
PW
373Sequence "%f" on the filter command line is replaced with the name of
374the file the filter is working on. A filter might use this in keyword
375substitution. For example:
376
377------------------------
378[filter "p4"]
379 clean = git-p4-filter --clean %f
380 smudge = git-p4-filter --smudge %f
381------------------------
382
52db4b04
JH
383Note that "%f" is the name of the path that is being worked on. Depending
384on the version that is being filtered, the corresponding file on disk may
385not exist, or may have different contents. So, smudge and clean commands
386should not try to access the file on disk, but only act as filters on the
387content provided to them on standard input.
aa4ed402 388
edcc8581
LS
389Long Running Filter Process
390^^^^^^^^^^^^^^^^^^^^^^^^^^^
391
392If the filter command (a string value) is defined via
393`filter.<driver>.process` then Git can process all blobs with a
394single filter invocation for the entire life of a single Git
395command. This is achieved by using a packet format (pkt-line,
396see technical/protocol-common.txt) based protocol over standard
397input and standard output as follows. All packets, except for the
398"*CONTENT" packets and the "0000" flush packet, are considered
399text and therefore are terminated by a LF.
400
401Git starts the filter when it encounters the first file
402that needs to be cleaned or smudged. After the filter started
403Git sends a welcome message ("git-filter-client"), a list of supported
404protocol version numbers, and a flush packet. Git expects to read a welcome
405response message ("git-filter-server"), exactly one protocol version number
406from the previously sent list, and a flush packet. All further
407communication will be based on the selected version. The remaining
408protocol description below documents "version=2". Please note that
409"version=42" in the example below does not exist and is only there
410to illustrate how the protocol would look like with more than one
411version.
412
413After the version negotiation Git sends a list of all capabilities that
414it supports and a flush packet. Git expects to read a list of desired
415capabilities, which must be a subset of the supported capabilities list,
416and a flush packet as response:
417------------------------
418packet: git> git-filter-client
419packet: git> version=2
420packet: git> version=42
421packet: git> 0000
422packet: git< git-filter-server
423packet: git< version=2
424packet: git< 0000
425packet: git> capability=clean
426packet: git> capability=smudge
427packet: git> capability=not-yet-invented
428packet: git> 0000
429packet: git< capability=clean
430packet: git< capability=smudge
431packet: git< 0000
432------------------------
2841e8f8
LS
433Supported filter capabilities in version 2 are "clean", "smudge",
434and "delay".
edcc8581
LS
435
436Afterwards Git sends a list of "key=value" pairs terminated with
437a flush packet. The list will contain at least the filter command
438(based on the supported capabilities) and the pathname of the file
439to filter relative to the repository root. Right after the flush packet
440Git sends the content split in zero or more pkt-line packets and a
441flush packet to terminate content. Please note, that the filter
442must not send any response before it received the content and the
c6b0831c
LS
443final flush packet. Also note that the "value" of a "key=value" pair
444can contain the "=" character whereas the key would never contain
445that character.
edcc8581
LS
446------------------------
447packet: git> command=smudge
448packet: git> pathname=path/testfile.dat
449packet: git> 0000
450packet: git> CONTENT
451packet: git> 0000
452------------------------
453
454The filter is expected to respond with a list of "key=value" pairs
455terminated with a flush packet. If the filter does not experience
456problems then the list must contain a "success" status. Right after
457these packets the filter is expected to send the content in zero
458or more pkt-line packets and a flush packet at the end. Finally, a
459second list of "key=value" pairs terminated with a flush packet
460is expected. The filter can change the status in the second list
461or keep the status as is with an empty list. Please note that the
462empty list must be terminated with a flush packet regardless.
463
464------------------------
465packet: git< status=success
466packet: git< 0000
467packet: git< SMUDGED_CONTENT
468packet: git< 0000
469packet: git< 0000 # empty list, keep "status=success" unchanged!
470------------------------
471
472If the result content is empty then the filter is expected to respond
473with a "success" status and a flush packet to signal the empty content.
474------------------------
475packet: git< status=success
476packet: git< 0000
477packet: git< 0000 # empty content!
478packet: git< 0000 # empty list, keep "status=success" unchanged!
479------------------------
480
481In case the filter cannot or does not want to process the content,
482it is expected to respond with an "error" status.
483------------------------
484packet: git< status=error
485packet: git< 0000
486------------------------
487
488If the filter experiences an error during processing, then it can
489send the status "error" after the content was (partially or
490completely) sent.
491------------------------
492packet: git< status=success
493packet: git< 0000
494packet: git< HALF_WRITTEN_ERRONEOUS_CONTENT
495packet: git< 0000
496packet: git< status=error
497packet: git< 0000
498------------------------
499
500In case the filter cannot or does not want to process the content
501as well as any future content for the lifetime of the Git process,
502then it is expected to respond with an "abort" status at any point
503in the protocol.
504------------------------
505packet: git< status=abort
506packet: git< 0000
507------------------------
508
509Git neither stops nor restarts the filter process in case the
510"error"/"abort" status is set. However, Git sets its exit code
511according to the `filter.<driver>.required` flag, mimicking the
512behavior of the `filter.<driver>.clean` / `filter.<driver>.smudge`
513mechanism.
514
515If the filter dies during the communication or does not adhere to
516the protocol then Git will stop the filter process and restart it
517with the next file that needs to be processed. Depending on the
518`filter.<driver>.required` flag Git will interpret that as error.
519
2841e8f8
LS
520After the filter has processed a command it is expected to wait for
521a "key=value" list containing the next command. Git will close
edcc8581
LS
522the command pipe on exit. The filter is expected to detect EOF
523and exit gracefully on its own. Git will wait until the filter
524process has stopped.
525
2841e8f8
LS
526Delay
527^^^^^
528
529If the filter supports the "delay" capability, then Git can send the
530flag "can-delay" after the filter command and pathname. This flag
531denotes that the filter can delay filtering the current blob (e.g. to
532compensate network latencies) by responding with no content but with
533the status "delayed" and a flush packet.
534------------------------
535packet: git> command=smudge
536packet: git> pathname=path/testfile.dat
537packet: git> can-delay=1
538packet: git> 0000
539packet: git> CONTENT
540packet: git> 0000
541packet: git< status=delayed
542packet: git< 0000
543------------------------
544
545If the filter supports the "delay" capability then it must support the
546"list_available_blobs" command. If Git sends this command, then the
547filter is expected to return a list of pathnames representing blobs
548that have been delayed earlier and are now available.
549The list must be terminated with a flush packet followed
550by a "success" status that is also terminated with a flush packet. If
551no blobs for the delayed paths are available, yet, then the filter is
552expected to block the response until at least one blob becomes
553available. The filter can tell Git that it has no more delayed blobs
554by sending an empty list. As soon as the filter responds with an empty
555list, Git stops asking. All blobs that Git has not received at this
556point are considered missing and will result in an error.
557
558------------------------
559packet: git> command=list_available_blobs
560packet: git> 0000
561packet: git< pathname=path/testfile.dat
562packet: git< pathname=path/otherfile.dat
563packet: git< 0000
564packet: git< status=success
565packet: git< 0000
566------------------------
567
568After Git received the pathnames, it will request the corresponding
569blobs again. These requests contain a pathname and an empty content
570section. The filter is expected to respond with the smudged content
571in the usual way as explained above.
572------------------------
573packet: git> command=smudge
574packet: git> pathname=path/testfile.dat
575packet: git> 0000
576packet: git> 0000 # empty content!
577packet: git< status=success
578packet: git< 0000
579packet: git< SMUDGED_CONTENT
580packet: git< 0000
581packet: git< 0000 # empty list, keep "status=success" unchanged!
582------------------------
583
584Example
585^^^^^^^
586
0f71fa27
LS
587A long running filter demo implementation can be found in
588`contrib/long-running-filter/example.pl` located in the Git
589core repository. If you develop your own long running filter
edcc8581
LS
590process then the `GIT_TRACE_PACKET` environment variables can be
591very helpful for debugging (see linkgit:git[1]).
592
593Please note that you cannot use an existing `filter.<driver>.clean`
594or `filter.<driver>.smudge` command with `filter.<driver>.process`
595because the former two use a different inter process communication
596protocol than the latter one.
597
598
aa4ed402
JH
599Interaction between checkin/checkout attributes
600^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
601
602In the check-in codepath, the worktree file is first converted
603with `filter` driver (if specified and corresponding driver
604defined), then the result is processed with `ident` (if
5ec3e670 605specified), and then finally with `text` (again, if specified
aa4ed402
JH
606and applicable).
607
608In the check-out codepath, the blob content is first converted
5ec3e670 609with `text`, and then `ident` and fed to `filter`.
aa4ed402
JH
610
611
f217f0e8
EB
612Merging branches with differing checkin/checkout attributes
613^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
614
615If you have added attributes to a file that cause the canonical
616repository format for that file to change, such as adding a
617clean/smudge filter or text/eol/ident attributes, merging anything
618where the attribute is not in place would normally cause merge
619conflicts.
620
2de9b711 621To prevent these unnecessary merge conflicts, Git can be told to run a
f217f0e8
EB
622virtual check-out and check-in of all three stages of a file when
623resolving a three-way merge by setting the `merge.renormalize`
624configuration variable. This prevents changes caused by check-in
625conversion from causing spurious merge conflicts when a converted file
626is merged with an unconverted file.
627
628As long as a "smudge->clean" results in the same output as a "clean"
629even on files that are already smudged, this strategy will
630automatically resolve all filter-related conflicts. Filters that do
631not act in this way may cause additional merge conflicts that must be
632resolved manually.
633
634
88e7fdf2
JH
635Generating diff text
636~~~~~~~~~~~~~~~~~~~~
637
4f73e240
JN
638`diff`
639^^^^^^
640
2de9b711
TA
641The attribute `diff` affects how Git generates diffs for particular
642files. It can tell Git whether to generate a textual patch for the path
678852d9 643or to treat the path as a binary file. It can also affect what line is
2de9b711
TA
644shown on the hunk header `@@ -k,l +n,m @@` line, tell Git to use an
645external command to generate the diff, or ask Git to convert binary
678852d9 646files to a text format before generating the diff.
88e7fdf2
JH
647
648Set::
649
650 A path to which the `diff` attribute is set is treated
651 as text, even when they contain byte values that
652 normally never appear in text files, such as NUL.
653
654Unset::
655
656 A path to which the `diff` attribute is unset will
678852d9
JK
657 generate `Binary files differ` (or a binary patch, if
658 binary patches are enabled).
88e7fdf2
JH
659
660Unspecified::
661
662 A path to which the `diff` attribute is unspecified
663 first gets its contents inspected, and if it looks like
6bf3b813
NTND
664 text and is smaller than core.bigFileThreshold, it is treated
665 as text. Otherwise it would generate `Binary files differ`.
88e7fdf2 666
2cc3167c
JH
667String::
668
678852d9
JK
669 Diff is shown using the specified diff driver. Each driver may
670 specify one or more options, as described in the following
671 section. The options for the diff driver "foo" are defined
672 by the configuration variables in the "diff.foo" section of the
2de9b711 673 Git config file.
2cc3167c
JH
674
675
678852d9
JK
676Defining an external diff driver
677^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
2cc3167c
JH
678
679The definition of a diff driver is done in `gitconfig`, not
680`gitattributes` file, so strictly speaking this manual page is a
681wrong place to talk about it. However...
682
678852d9 683To define an external diff driver `jcdiff`, add a section to your
2cc3167c
JH
684`$GIT_DIR/config` file (or `$HOME/.gitconfig` file) like this:
685
686----------------------------------------------------------------
687[diff "jcdiff"]
688 command = j-c-diff
689----------------------------------------------------------------
690
2de9b711 691When Git needs to show you a diff for the path with `diff`
2cc3167c
JH
692attribute set to `jcdiff`, it calls the command you specified
693with the above configuration, i.e. `j-c-diff`, with 7
694parameters, just like `GIT_EXTERNAL_DIFF` program is called.
9e1f0a85 695See linkgit:git[1] for details.
88e7fdf2
JH
696
697
ae7aa499
JH
698Defining a custom hunk-header
699^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
700
c882c01e 701Each group of changes (called a "hunk") in the textual diff output
ae7aa499
JH
702is prefixed with a line of the form:
703
704 @@ -k,l +n,m @@ TEXT
705
c882c01e
GD
706This is called a 'hunk header'. The "TEXT" portion is by default a line
707that begins with an alphabet, an underscore or a dollar sign; this
708matches what GNU 'diff -p' output uses. This default selection however
709is not suited for some contents, and you can use a customized pattern
710to make a selection.
ae7aa499 711
c882c01e 712First, in .gitattributes, you would assign the `diff` attribute
ae7aa499
JH
713for paths.
714
715------------------------
716*.tex diff=tex
717------------------------
718
edb7e82f 719Then, you would define a "diff.tex.xfuncname" configuration to
ae7aa499 720specify a regular expression that matches a line that you would
c4c86d23
JK
721want to appear as the hunk header "TEXT". Add a section to your
722`$GIT_DIR/config` file (or `$HOME/.gitconfig` file) like this:
ae7aa499
JH
723
724------------------------
725[diff "tex"]
45d9414f 726 xfuncname = "^(\\\\(sub)*section\\{.*)$"
ae7aa499
JH
727------------------------
728
729Note. A single level of backslashes are eaten by the
730configuration file parser, so you would need to double the
731backslashes; the pattern above picks a line that begins with a
02783075 732backslash, and zero or more occurrences of `sub` followed by
ae7aa499
JH
733`section` followed by open brace, to the end of line.
734
735There are a few built-in patterns to make this easier, and `tex`
736is one of them, so you do not have to write the above in your
737configuration file (you still need to enable this with the
d08ed6d6
GH
738attribute mechanism, via `.gitattributes`). The following built in
739patterns are available:
740
e90d065e
AJ
741- `ada` suitable for source code in the Ada language.
742
23b5beb2
GH
743- `bibtex` suitable for files with BibTeX coded references.
744
80c49c3d
TR
745- `cpp` suitable for source code in the C and C++ languages.
746
b221207d
PO
747- `csharp` suitable for source code in the C# language.
748
0719f3ee
WD
749- `css` suitable for cascading style sheets.
750
909a5494
BC
751- `fortran` suitable for source code in the Fortran language.
752
69f9c87d
ZB
753- `fountain` suitable for Fountain documents.
754
af9ce1ff
AE
755- `html` suitable for HTML/XHTML documents.
756
b66e00f1 757- `java` suitable for source code in the Java language.
d08ed6d6 758
53b10a14
GH
759- `matlab` suitable for source code in the MATLAB language.
760
5d1e958e
JS
761- `objc` suitable for source code in the Objective-C language.
762
d08ed6d6
GH
763- `pascal` suitable for source code in the Pascal/Delphi language.
764
71a5d4bc
JN
765- `perl` suitable for source code in the Perl language.
766
af9ce1ff
AE
767- `php` suitable for source code in the PHP language.
768
7c17205b
KS
769- `python` suitable for source code in the Python language.
770
d08ed6d6
GH
771- `ruby` suitable for source code in the Ruby language.
772
773- `tex` suitable for source code for LaTeX documents.
ae7aa499
JH
774
775
80c49c3d
TR
776Customizing word diff
777^^^^^^^^^^^^^^^^^^^^^
778
882749a0 779You can customize the rules that `git diff --word-diff` uses to
80c49c3d 780split words in a line, by specifying an appropriate regular expression
ae3b970a 781in the "diff.*.wordRegex" configuration variable. For example, in TeX
80c49c3d
TR
782a backslash followed by a sequence of letters forms a command, but
783several such commands can be run together without intervening
c4c86d23
JK
784whitespace. To separate them, use a regular expression in your
785`$GIT_DIR/config` file (or `$HOME/.gitconfig` file) like this:
80c49c3d
TR
786
787------------------------
788[diff "tex"]
ae3b970a 789 wordRegex = "\\\\[a-zA-Z]+|[{}]|\\\\.|[^\\{}[:space:]]+"
80c49c3d
TR
790------------------------
791
792A built-in pattern is provided for all languages listed in the
793previous section.
794
795
678852d9
JK
796Performing text diffs of binary files
797^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
798
799Sometimes it is desirable to see the diff of a text-converted
800version of some binary files. For example, a word processor
801document can be converted to an ASCII text representation, and
802the diff of the text shown. Even though this conversion loses
803some information, the resulting diff is useful for human
804viewing (but cannot be applied directly).
805
806The `textconv` config option is used to define a program for
807performing such a conversion. The program should take a single
808argument, the name of a file to convert, and produce the
809resulting text on stdout.
810
811For example, to show the diff of the exif information of a
812file instead of the binary information (assuming you have the
c4c86d23
JK
813exif tool installed), add the following section to your
814`$GIT_DIR/config` file (or `$HOME/.gitconfig` file):
678852d9
JK
815
816------------------------
817[diff "jpg"]
818 textconv = exif
819------------------------
820
821NOTE: The text conversion is generally a one-way conversion;
822in this example, we lose the actual image contents and focus
823just on the text data. This means that diffs generated by
824textconv are _not_ suitable for applying. For this reason,
825only `git diff` and the `git log` family of commands (i.e.,
826log, whatchanged, show) will perform text conversion. `git
827format-patch` will never generate this output. If you want to
828send somebody a text-converted diff of a binary file (e.g.,
829because it quickly conveys the changes you have made), you
830should generate it separately and send it as a comment _in
831addition to_ the usual binary diff that you might send.
832
d9bae1a1 833Because text conversion can be slow, especially when doing a
2de9b711 834large number of them with `git log -p`, Git provides a mechanism
d9bae1a1
JK
835to cache the output and use it in future diffs. To enable
836caching, set the "cachetextconv" variable in your diff driver's
837config. For example:
838
839------------------------
840[diff "jpg"]
841 textconv = exif
842 cachetextconv = true
843------------------------
844
845This will cache the result of running "exif" on each blob
846indefinitely. If you change the textconv config variable for a
2de9b711 847diff driver, Git will automatically invalidate the cache entries
d9bae1a1
JK
848and re-run the textconv filter. If you want to invalidate the
849cache manually (e.g., because your version of "exif" was updated
850and now produces better output), you can remove the cache
851manually with `git update-ref -d refs/notes/textconv/jpg` (where
852"jpg" is the name of the diff driver, as in the example above).
678852d9 853
55601c6a
JK
854Choosing textconv versus external diff
855^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
856
857If you want to show differences between binary or specially-formatted
858blobs in your repository, you can choose to use either an external diff
859command, or to use textconv to convert them to a diff-able text format.
860Which method you choose depends on your exact situation.
861
862The advantage of using an external diff command is flexibility. You are
863not bound to find line-oriented changes, nor is it necessary for the
864output to resemble unified diff. You are free to locate and report
865changes in the most appropriate way for your data format.
866
867A textconv, by comparison, is much more limiting. You provide a
2de9b711 868transformation of the data into a line-oriented text format, and Git
55601c6a
JK
869uses its regular diff tools to generate the output. There are several
870advantages to choosing this method:
871
8721. Ease of use. It is often much simpler to write a binary to text
873 transformation than it is to perform your own diff. In many cases,
874 existing programs can be used as textconv filters (e.g., exif,
875 odt2txt).
876
8772. Git diff features. By performing only the transformation step
2de9b711 878 yourself, you can still utilize many of Git's diff features,
55601c6a
JK
879 including colorization, word-diff, and combined diffs for merges.
880
8813. Caching. Textconv caching can speed up repeated diffs, such as those
882 you might trigger by running `git log -p`.
883
884
ab435611
JK
885Marking files as binary
886^^^^^^^^^^^^^^^^^^^^^^^
887
888Git usually guesses correctly whether a blob contains text or binary
889data by examining the beginning of the contents. However, sometimes you
890may want to override its decision, either because a blob contains binary
891data later in the file, or because the content, while technically
892composed of text characters, is opaque to a human reader. For example,
f745acb0 893many postscript files contain only ASCII characters, but produce noisy
ab435611
JK
894and meaningless diffs.
895
896The simplest way to mark a file as binary is to unset the diff
897attribute in the `.gitattributes` file:
898
899------------------------
900*.ps -diff
901------------------------
902
2de9b711 903This will cause Git to generate `Binary files differ` (or a binary
ab435611
JK
904patch, if binary patches are enabled) instead of a regular diff.
905
906However, one may also want to specify other diff driver attributes. For
907example, you might want to use `textconv` to convert postscript files to
f745acb0 908an ASCII representation for human viewing, but otherwise treat them as
ab435611
JK
909binary files. You cannot specify both `-diff` and `diff=ps` attributes.
910The solution is to use the `diff.*.binary` config option:
911
912------------------------
913[diff "ps"]
914 textconv = ps2ascii
915 binary = true
916------------------------
917
88e7fdf2
JH
918Performing a three-way merge
919~~~~~~~~~~~~~~~~~~~~~~~~~~~~
920
4f73e240
JN
921`merge`
922^^^^^^^
923
b547ce0b 924The attribute `merge` affects how three versions of a file are
88e7fdf2 925merged when a file-level merge is necessary during `git merge`,
57f6ec02 926and other commands such as `git revert` and `git cherry-pick`.
88e7fdf2
JH
927
928Set::
929
930 Built-in 3-way merge driver is used to merge the
2fd02c92 931 contents in a way similar to 'merge' command of `RCS`
88e7fdf2
JH
932 suite. This is suitable for ordinary text files.
933
934Unset::
935
936 Take the version from the current branch as the
937 tentative merge result, and declare that the merge has
b547ce0b 938 conflicts. This is suitable for binary files that do
88e7fdf2
JH
939 not have a well-defined merge semantics.
940
941Unspecified::
942
943 By default, this uses the same built-in 3-way merge
b547ce0b
AS
944 driver as is the case when the `merge` attribute is set.
945 However, the `merge.default` configuration variable can name
946 different merge driver to be used with paths for which the
88e7fdf2
JH
947 `merge` attribute is unspecified.
948
2cc3167c 949String::
88e7fdf2
JH
950
951 3-way merge is performed using the specified custom
952 merge driver. The built-in 3-way merge driver can be
953 explicitly specified by asking for "text" driver; the
954 built-in "take the current branch" driver can be
b9d14ffb 955 requested with "binary".
88e7fdf2
JH
956
957
0e545f75
JH
958Built-in merge drivers
959^^^^^^^^^^^^^^^^^^^^^^
960
961There are a few built-in low-level merge drivers defined that
962can be asked for via the `merge` attribute.
963
964text::
965
966 Usual 3-way file level merge for text files. Conflicted
967 regions are marked with conflict markers `<<<<<<<`,
968 `=======` and `>>>>>>>`. The version from your branch
969 appears before the `=======` marker, and the version
970 from the merged branch appears after the `=======`
971 marker.
972
973binary::
974
975 Keep the version from your branch in the work tree, but
976 leave the path in the conflicted state for the user to
977 sort out.
978
979union::
980
981 Run 3-way file level merge for text files, but take
982 lines from both versions, instead of leaving conflict
983 markers. This tends to leave the added lines in the
984 resulting file in random order and the user should
985 verify the result. Do not use this if you do not
986 understand the implications.
987
988
88e7fdf2
JH
989Defining a custom merge driver
990^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
991
0e545f75
JH
992The definition of a merge driver is done in the `.git/config`
993file, not in the `gitattributes` file, so strictly speaking this
994manual page is a wrong place to talk about it. However...
88e7fdf2
JH
995
996To define a custom merge driver `filfre`, add a section to your
997`$GIT_DIR/config` file (or `$HOME/.gitconfig` file) like this:
998
999----------------------------------------------------------------
1000[merge "filfre"]
1001 name = feel-free merge driver
ef45bb1f 1002 driver = filfre %O %A %B %L %P
88e7fdf2
JH
1003 recursive = binary
1004----------------------------------------------------------------
1005
1006The `merge.*.name` variable gives the driver a human-readable
1007name.
1008
1009The `merge.*.driver` variable's value is used to construct a
1010command to run to merge ancestor's version (`%O`), current
1011version (`%A`) and the other branches' version (`%B`). These
1012three tokens are replaced with the names of temporary files that
1013hold the contents of these versions when the command line is
16758621
BW
1014built. Additionally, %L will be replaced with the conflict marker
1015size (see below).
88e7fdf2
JH
1016
1017The merge driver is expected to leave the result of the merge in
1018the file named with `%A` by overwriting it, and exit with zero
1019status if it managed to merge them cleanly, or non-zero if there
1020were conflicts.
1021
1022The `merge.*.recursive` variable specifies what other merge
1023driver to use when the merge driver is called for an internal
1024merge between common ancestors, when there are more than one.
1025When left unspecified, the driver itself is used for both
1026internal merge and the final merge.
1027
ef45bb1f
JH
1028The merge driver can learn the pathname in which the merged result
1029will be stored via placeholder `%P`.
1030
88e7fdf2 1031
4c734803
JH
1032`conflict-marker-size`
1033^^^^^^^^^^^^^^^^^^^^^^
1034
1035This attribute controls the length of conflict markers left in
1036the work tree file during a conflicted merge. Only setting to
1037the value to a positive integer has any meaningful effect.
1038
1039For example, this line in `.gitattributes` can be used to tell the merge
1040machinery to leave much longer (instead of the usual 7-character-long)
1041conflict markers when merging the file `Documentation/git-merge.txt`
1042results in a conflict.
1043
1044------------------------
1045Documentation/git-merge.txt conflict-marker-size=32
1046------------------------
1047
1048
cf1b7869
JH
1049Checking whitespace errors
1050~~~~~~~~~~~~~~~~~~~~~~~~~~
1051
1052`whitespace`
1053^^^^^^^^^^^^
1054
1055The `core.whitespace` configuration variable allows you to define what
2fd02c92 1056'diff' and 'apply' should consider whitespace errors for all paths in
5162e697 1057the project (See linkgit:git-config[1]). This attribute gives you finer
cf1b7869
JH
1058control per path.
1059
1060Set::
1061
2de9b711 1062 Notice all types of potential whitespace errors known to Git.
f4b05a49
JS
1063 The tab width is taken from the value of the `core.whitespace`
1064 configuration variable.
cf1b7869
JH
1065
1066Unset::
1067
1068 Do not notice anything as error.
1069
1070Unspecified::
1071
f4b05a49 1072 Use the value of the `core.whitespace` configuration variable to
cf1b7869
JH
1073 decide what to notice as error.
1074
1075String::
1076
1077 Specify a comma separate list of common whitespace problems to
f4b05a49 1078 notice in the same format as the `core.whitespace` configuration
cf1b7869
JH
1079 variable.
1080
1081
8a33dd8b
JH
1082Creating an archive
1083~~~~~~~~~~~~~~~~~~~
1084
08b51f51
JH
1085`export-ignore`
1086^^^^^^^^^^^^^^^
1087
1088Files and directories with the attribute `export-ignore` won't be added to
1089archive files.
1090
8a33dd8b
JH
1091`export-subst`
1092^^^^^^^^^^^^^^
1093
2de9b711 1094If the attribute `export-subst` is set for a file then Git will expand
8a33dd8b 1095several placeholders when adding this file to an archive. The
08b51f51 1096expansion depends on the availability of a commit ID, i.e., if
8a33dd8b
JH
1097linkgit:git-archive[1] has been given a tree instead of a commit or a
1098tag then no replacement will be done. The placeholders are the same
1099as those for the option `--pretty=format:` of linkgit:git-log[1],
1100except that they need to be wrapped like this: `$Format:PLACEHOLDERS$`
1101in the file. E.g. the string `$Format:%H$` will be replaced by the
1102commit hash.
1103
1104
975457f1
NG
1105Packing objects
1106~~~~~~~~~~~~~~~
1107
1108`delta`
1109^^^^^^^
1110
1111Delta compression will not be attempted for blobs for paths with the
1112attribute `delta` set to false.
1113
1114
a2df1fb2
AG
1115Viewing files in GUI tools
1116~~~~~~~~~~~~~~~~~~~~~~~~~~
1117
1118`encoding`
1119^^^^^^^^^^
1120
1121The value of this attribute specifies the character encoding that should
1122be used by GUI tools (e.g. linkgit:gitk[1] and linkgit:git-gui[1]) to
1123display the contents of the relevant file. Note that due to performance
1124considerations linkgit:gitk[1] does not use this attribute unless you
1125manually enable per-file encodings in its options.
1126
1127If this attribute is not set or has an invalid value, the value of the
1128`gui.encoding` configuration variable is used instead
1129(See linkgit:git-config[1]).
1130
1131
0922570c 1132USING MACRO ATTRIBUTES
bbb896d8
JH
1133----------------------
1134
1135You do not want any end-of-line conversions applied to, nor textual diffs
1136produced for, any binary file you track. You would need to specify e.g.
1137
1138------------
5ec3e670 1139*.jpg -text -diff
bbb896d8
JH
1140------------
1141
1142but that may become cumbersome, when you have many attributes. Using
0922570c 1143macro attributes, you can define an attribute that, when set, also
98e84066 1144sets or unsets a number of other attributes at the same time. The
0922570c 1145system knows a built-in macro attribute, `binary`:
bbb896d8
JH
1146
1147------------
1148*.jpg binary
1149------------
1150
98e84066 1151Setting the "binary" attribute also unsets the "text" and "diff"
0922570c 1152attributes as above. Note that macro attributes can only be "Set",
98e84066
MH
1153though setting one might have the effect of setting or unsetting other
1154attributes or even returning other attributes to the "Unspecified"
1155state.
bbb896d8
JH
1156
1157
0922570c 1158DEFINING MACRO ATTRIBUTES
bbb896d8
JH
1159-------------------------
1160
e78e6967
MH
1161Custom macro attributes can be defined only in top-level gitattributes
1162files (`$GIT_DIR/info/attributes`, the `.gitattributes` file at the
1163top level of the working tree, or the global or system-wide
1164gitattributes files), not in `.gitattributes` files in working tree
1165subdirectories. The built-in macro attribute "binary" is equivalent
1166to:
bbb896d8
JH
1167
1168------------
155a4b71 1169[attr]binary -diff -merge -text
bbb896d8
JH
1170------------
1171
1172
88e7fdf2
JH
1173EXAMPLE
1174-------
1175
1176If you have these three `gitattributes` file:
1177
1178----------------------------------------------------------------
1179(in $GIT_DIR/info/attributes)
1180
1181a* foo !bar -baz
1182
1183(in .gitattributes)
1184abc foo bar baz
1185
1186(in t/.gitattributes)
1187ab* merge=filfre
1188abc -foo -bar
1189*.c frotz
1190----------------------------------------------------------------
1191
1192the attributes given to path `t/abc` are computed as follows:
1193
11941. By examining `t/.gitattributes` (which is in the same
2de9b711 1195 directory as the path in question), Git finds that the first
88e7fdf2
JH
1196 line matches. `merge` attribute is set. It also finds that
1197 the second line matches, and attributes `foo` and `bar`
1198 are unset.
1199
12002. Then it examines `.gitattributes` (which is in the parent
1201 directory), and finds that the first line matches, but
1202 `t/.gitattributes` file already decided how `merge`, `foo`
1203 and `bar` attributes should be given to this path, so it
1204 leaves `foo` and `bar` unset. Attribute `baz` is set.
1205
5c759f96 12063. Finally it examines `$GIT_DIR/info/attributes`. This file
88e7fdf2
JH
1207 is used to override the in-tree settings. The first line is
1208 a match, and `foo` is set, `bar` is reverted to unspecified
1209 state, and `baz` is unset.
1210
02783075 1211As the result, the attributes assignment to `t/abc` becomes:
88e7fdf2
JH
1212
1213----------------------------------------------------------------
1214foo set to true
1215bar unspecified
1216baz set to false
1217merge set to string value "filfre"
1218frotz unspecified
1219----------------------------------------------------------------
1220
1221
cde15181
MH
1222SEE ALSO
1223--------
1224linkgit:git-check-attr[1].
8460b2fc 1225
88e7fdf2
JH
1226GIT
1227---
9e1f0a85 1228Part of the linkgit:git[1] suite