]> git.ipfire.org Git - thirdparty/gcc.git/blob - gcc/doc/gty.texi
Revert "sphinx: remove texinfo files"
[thirdparty/gcc.git] / gcc / doc / gty.texi
1 @c Copyright (C) 2002-2022 Free Software Foundation, Inc.
2 @c This is part of the GCC manual.
3 @c For copying conditions, see the file gcc.texi.
4
5 @node Type Information
6 @chapter Memory Management and Type Information
7 @cindex GGC
8 @findex GTY
9
10 GCC uses some fairly sophisticated memory management techniques, which
11 involve determining information about GCC's data structures from GCC's
12 source code and using this information to perform garbage collection and
13 implement precompiled headers.
14
15 A full C++ parser would be too complicated for this task, so a limited
16 subset of C++ is interpreted and special markers are used to determine
17 what parts of the source to look at. All @code{struct}, @code{union}
18 and @code{template} structure declarations that define data structures
19 that are allocated under control of the garbage collector must be
20 marked. All global variables that hold pointers to garbage-collected
21 memory must also be marked. Finally, all global variables that need
22 to be saved and restored by a precompiled header must be marked. (The
23 precompiled header mechanism can only save static variables if they're
24 scalar. Complex data structures must be allocated in garbage-collected
25 memory to be saved in a precompiled header.)
26
27 The full format of a marker is
28 @smallexample
29 GTY (([@var{option}] [(@var{param})], [@var{option}] [(@var{param})] @dots{}))
30 @end smallexample
31 @noindent
32 but in most cases no options are needed. The outer double parentheses
33 are still necessary, though: @code{GTY(())}. Markers can appear:
34
35 @itemize @bullet
36 @item
37 In a structure definition, before the open brace;
38 @item
39 In a global variable declaration, after the keyword @code{static} or
40 @code{extern}; and
41 @item
42 In a structure field definition, before the name of the field.
43 @end itemize
44
45 Here are some examples of marking simple data structures and globals.
46
47 @smallexample
48 struct GTY(()) @var{tag}
49 @{
50 @var{fields}@dots{}
51 @};
52
53 typedef struct GTY(()) @var{tag}
54 @{
55 @var{fields}@dots{}
56 @} *@var{typename};
57
58 static GTY(()) struct @var{tag} *@var{list}; /* @r{points to GC memory} */
59 static GTY(()) int @var{counter}; /* @r{save counter in a PCH} */
60 @end smallexample
61
62 The parser understands simple typedefs such as
63 @code{typedef struct @var{tag} *@var{name};} and
64 @code{typedef int @var{name};}.
65 These don't need to be marked.
66
67 However, in combination with GTY, avoid using typedefs such as
68 @code{typedef int_hash<@dots{}> @var{name};}
69 for these generate infinite-recursion code.
70 See @uref{https://gcc.gnu.org/PR103157,PR103157}.
71 Instead, you may use
72 @code{struct @var{name} : int_hash<@dots{}> @{@};},
73 for example.
74
75 Since @code{gengtype}'s understanding of C++ is limited, there are
76 several constructs and declarations that are not supported inside
77 classes/structures marked for automatic GC code generation. The
78 following C++ constructs produce a @code{gengtype} error on
79 structures/classes marked for automatic GC code generation:
80
81 @itemize @bullet
82 @item
83 Type definitions inside classes/structures are not supported.
84 @item
85 Enumerations inside classes/structures are not supported.
86 @end itemize
87
88 If you have a class or structure using any of the above constructs,
89 you need to mark that class as @code{GTY ((user))} and provide your
90 own marking routines (see section @ref{User GC} for details).
91
92 It is always valid to include function definitions inside classes.
93 Those are always ignored by @code{gengtype}, as it only cares about
94 data members.
95
96 @menu
97 * GTY Options:: What goes inside a @code{GTY(())}.
98 * Inheritance and GTY:: Adding GTY to a class hierarchy.
99 * User GC:: Adding user-provided GC marking routines.
100 * GGC Roots:: Making global variables GGC roots.
101 * Files:: How the generated files work.
102 * Invoking the garbage collector:: How to invoke the garbage collector.
103 * Troubleshooting:: When something does not work as expected.
104 @end menu
105
106 @node GTY Options
107 @section The Inside of a @code{GTY(())}
108
109 Sometimes the C code is not enough to fully describe the type
110 structure. Extra information can be provided with @code{GTY} options
111 and additional markers. Some options take a parameter, which may be
112 either a string or a type name, depending on the parameter. If an
113 option takes no parameter, it is acceptable either to omit the
114 parameter entirely, or to provide an empty string as a parameter. For
115 example, @code{@w{GTY ((skip))}} and @code{@w{GTY ((skip ("")))}} are
116 equivalent.
117
118 When the parameter is a string, often it is a fragment of C code. Four
119 special escapes may be used in these strings, to refer to pieces of
120 the data structure being marked:
121
122 @cindex % in GTY option
123 @table @code
124 @item %h
125 The current structure.
126 @item %1
127 The structure that immediately contains the current structure.
128 @item %0
129 The outermost structure that contains the current structure.
130 @item %a
131 A partial expression of the form @code{[i1][i2]@dots{}} that indexes
132 the array item currently being marked.
133 @end table
134
135 For instance, suppose that you have a structure of the form
136 @smallexample
137 struct A @{
138 @dots{}
139 @};
140 struct B @{
141 struct A foo[12];
142 @};
143 @end smallexample
144 @noindent
145 and @code{b} is a variable of type @code{struct B}. When marking
146 @samp{b.foo[11]}, @code{%h} would expand to @samp{b.foo[11]},
147 @code{%0} and @code{%1} would both expand to @samp{b}, and @code{%a}
148 would expand to @samp{[11]}.
149
150 As in ordinary C, adjacent strings will be concatenated; this is
151 helpful when you have a complicated expression.
152 @smallexample
153 @group
154 GTY ((chain_next ("TREE_CODE (&%h.generic) == INTEGER_TYPE"
155 " ? TYPE_NEXT_VARIANT (&%h.generic)"
156 " : TREE_CHAIN (&%h.generic)")))
157 @end group
158 @end smallexample
159
160 The available options are:
161
162 @table @code
163 @findex length
164 @item length ("@var{expression}")
165
166 There are two places the type machinery will need to be explicitly told
167 the length of an array of non-atomic objects. The first case is when a
168 structure ends in a variable-length array, like this:
169 @smallexample
170 struct GTY(()) rtvec_def @{
171 int num_elem; /* @r{number of elements} */
172 rtx GTY ((length ("%h.num_elem"))) elem[1];
173 @};
174 @end smallexample
175
176 In this case, the @code{length} option is used to override the specified
177 array length (which should usually be @code{1}). The parameter of the
178 option is a fragment of C code that calculates the length.
179
180 The second case is when a structure or a global variable contains a
181 pointer to an array, like this:
182 @smallexample
183 struct gimple_omp_for_iter * GTY((length ("%h.collapse"))) iter;
184 @end smallexample
185 In this case, @code{iter} has been allocated by writing something like
186 @smallexample
187 x->iter = ggc_alloc_cleared_vec_gimple_omp_for_iter (collapse);
188 @end smallexample
189 and the @code{collapse} provides the length of the field.
190
191 This second use of @code{length} also works on global variables, like:
192 @verbatim
193 static GTY((length("reg_known_value_size"))) rtx *reg_known_value;
194 @end verbatim
195
196 Note that the @code{length} option is only meant for use with arrays of
197 non-atomic objects, that is, objects that contain pointers pointing to
198 other GTY-managed objects. For other GC-allocated arrays and strings
199 you should use @code{atomic} or @code{string_length}.
200
201 @findex string_length
202 @item string_length ("@var{expression}")
203
204 In order to simplify production of PCH, a structure member that is a plain
205 array of bytes (an optionally @code{const} and/or @code{unsigned} @code{char
206 *}) is treated specially by the infrastructure. Even if such an array has not
207 been allocated in GC-controlled memory, it will still be written properly into
208 a PCH. The machinery responsible for this needs to know the length of the
209 data; by default, the length is determined by calling @code{strlen} on the
210 pointer. The @code{string_length} option specifies an alternate way to
211 determine the length, such as by inspecting another struct member:
212
213 @smallexample
214 struct GTY(()) non_terminated_string @{
215 size_t sz;
216 const char * GTY((string_length ("%h.sz"))) data;
217 @};
218 @end smallexample
219
220 @findex skip
221 @item skip
222
223 If @code{skip} is applied to a field, the type machinery will ignore it.
224 This is somewhat dangerous; the only safe use is in a union when one
225 field really isn't ever used.
226
227 @findex callback
228 @item callback
229
230 @code{callback} should be applied to fields with pointer to function type
231 and causes the field to be ignored similarly to @code{skip}, except when
232 writing PCH and the field is non-NULL it will remember the field's address
233 for relocation purposes if the process writing PCH has different load base
234 from a process reading PCH.
235
236 @findex for_user
237 @item for_user
238
239 Use this to mark types that need to be marked by user gc routines, but are not
240 refered to in a template argument. So if you have some user gc type T1 and a
241 non user gc type T2 you can give T2 the for_user option so that the marking
242 functions for T1 can call non mangled functions to mark T2.
243
244 @findex desc
245 @findex tag
246 @findex default
247 @item desc ("@var{expression}")
248 @itemx tag ("@var{constant}")
249 @itemx default
250
251 The type machinery needs to be told which field of a @code{union} is
252 currently active. This is done by giving each field a constant
253 @code{tag} value, and then specifying a discriminator using @code{desc}.
254 The value of the expression given by @code{desc} is compared against
255 each @code{tag} value, each of which should be different. If no
256 @code{tag} is matched, the field marked with @code{default} is used if
257 there is one, otherwise no field in the union will be marked.
258
259 In the @code{desc} option, the ``current structure'' is the union that
260 it discriminates. Use @code{%1} to mean the structure containing it.
261 There are no escapes available to the @code{tag} option, since it is a
262 constant.
263
264 For example,
265 @smallexample
266 struct GTY(()) tree_binding
267 @{
268 struct tree_common common;
269 union tree_binding_u @{
270 tree GTY ((tag ("0"))) scope;
271 struct cp_binding_level * GTY ((tag ("1"))) level;
272 @} GTY ((desc ("BINDING_HAS_LEVEL_P ((tree)&%0)"))) xscope;
273 tree value;
274 @};
275 @end smallexample
276
277 In this example, the value of BINDING_HAS_LEVEL_P when applied to a
278 @code{struct tree_binding *} is presumed to be 0 or 1. If 1, the type
279 mechanism will treat the field @code{level} as being present and if 0,
280 will treat the field @code{scope} as being present.
281
282 The @code{desc} and @code{tag} options can also be used for inheritance
283 to denote which subclass an instance is. See @ref{Inheritance and GTY}
284 for more information.
285
286 @findex cache
287 @item cache
288
289 When the @code{cache} option is applied to a global variable gt_cleare_cache is
290 called on that variable between the mark and sweep phases of garbage
291 collection. The gt_clear_cache function is free to mark blocks as used, or to
292 clear pointers in the variable.
293
294 @findex deletable
295 @item deletable
296
297 @code{deletable}, when applied to a global variable, indicates that when
298 garbage collection runs, there's no need to mark anything pointed to
299 by this variable, it can just be set to @code{NULL} instead. This is used
300 to keep a list of free structures around for re-use.
301
302 @findex maybe_undef
303 @item maybe_undef
304
305 When applied to a field, @code{maybe_undef} indicates that it's OK if
306 the structure that this fields points to is never defined, so long as
307 this field is always @code{NULL}. This is used to avoid requiring
308 backends to define certain optional structures. It doesn't work with
309 language frontends.
310
311 @findex nested_ptr
312 @item nested_ptr (@var{type}, "@var{to expression}", "@var{from expression}")
313
314 The type machinery expects all pointers to point to the start of an
315 object. Sometimes for abstraction purposes it's convenient to have
316 a pointer which points inside an object. So long as it's possible to
317 convert the original object to and from the pointer, such pointers
318 can still be used. @var{type} is the type of the original object,
319 the @var{to expression} returns the pointer given the original object,
320 and the @var{from expression} returns the original object given
321 the pointer. The pointer will be available using the @code{%h}
322 escape.
323
324 @findex chain_next
325 @findex chain_prev
326 @findex chain_circular
327 @item chain_next ("@var{expression}")
328 @itemx chain_prev ("@var{expression}")
329 @itemx chain_circular ("@var{expression}")
330
331 It's helpful for the type machinery to know if objects are often
332 chained together in long lists; this lets it generate code that uses
333 less stack space by iterating along the list instead of recursing down
334 it. @code{chain_next} is an expression for the next item in the list,
335 @code{chain_prev} is an expression for the previous item. For singly
336 linked lists, use only @code{chain_next}; for doubly linked lists, use
337 both. The machinery requires that taking the next item of the
338 previous item gives the original item. @code{chain_circular} is similar
339 to @code{chain_next}, but can be used for circular single linked lists.
340
341 @findex reorder
342 @item reorder ("@var{function name}")
343
344 Some data structures depend on the relative ordering of pointers. If
345 the precompiled header machinery needs to change that ordering, it
346 will call the function referenced by the @code{reorder} option, before
347 changing the pointers in the object that's pointed to by the field the
348 option applies to. The function must take four arguments, with the
349 signature @samp{@w{void *, void *, gt_pointer_operator, void *}}.
350 The first parameter is a pointer to the structure that contains the
351 object being updated, or the object itself if there is no containing
352 structure. The second parameter is a cookie that should be ignored.
353 The third parameter is a routine that, given a pointer, will update it
354 to its correct new value. The fourth parameter is a cookie that must
355 be passed to the second parameter.
356
357 PCH cannot handle data structures that depend on the absolute values
358 of pointers. @code{reorder} functions can be expensive. When
359 possible, it is better to depend on properties of the data, like an ID
360 number or the hash of a string instead.
361
362 @findex atomic
363 @item atomic
364
365 The @code{atomic} option can only be used with pointers. It informs
366 the GC machinery that the memory that the pointer points to does not
367 contain any pointers, and hence it should be treated by the GC and PCH
368 machinery as an ``atomic'' block of memory that does not need to be
369 examined when scanning memory for pointers. In particular, the
370 machinery will not scan that memory for pointers to mark them as
371 reachable (when marking pointers for GC) or to relocate them (when
372 writing a PCH file).
373
374 The @code{atomic} option differs from the @code{skip} option.
375 @code{atomic} keeps the memory under Garbage Collection, but makes the
376 GC ignore the contents of the memory. @code{skip} is more drastic in
377 that it causes the pointer and the memory to be completely ignored by
378 the Garbage Collector. So, memory marked as @code{atomic} is
379 automatically freed when no longer reachable, while memory marked as
380 @code{skip} is not.
381
382 The @code{atomic} option must be used with great care, because all
383 sorts of problem can occur if used incorrectly, that is, if the memory
384 the pointer points to does actually contain a pointer.
385
386 Here is an example of how to use it:
387 @smallexample
388 struct GTY(()) my_struct @{
389 int number_of_elements;
390 unsigned int * GTY ((atomic)) elements;
391 @};
392 @end smallexample
393 In this case, @code{elements} is a pointer under GC, and the memory it
394 points to needs to be allocated using the Garbage Collector, and will
395 be freed automatically by the Garbage Collector when it is no longer
396 referenced. But the memory that the pointer points to is an array of
397 @code{unsigned int} elements, and the GC must not try to scan it to
398 find pointers to mark or relocate, which is why it is marked with the
399 @code{atomic} option.
400
401 Note that, currently, global variables cannot be marked with
402 @code{atomic}; only fields of a struct can. This is a known
403 limitation. It would be useful to be able to mark global pointers
404 with @code{atomic} to make the PCH machinery aware of them so that
405 they are saved and restored correctly to PCH files.
406
407 @findex special
408 @item special ("@var{name}")
409
410 The @code{special} option is used to mark types that have to be dealt
411 with by special case machinery. The parameter is the name of the
412 special case. See @file{gengtype.cc} for further details. Avoid
413 adding new special cases unless there is no other alternative.
414
415 @findex user
416 @item user
417
418 The @code{user} option indicates that the code to mark structure
419 fields is completely handled by user-provided routines. See section
420 @ref{User GC} for details on what functions need to be provided.
421 @end table
422
423 @node Inheritance and GTY
424 @section Support for inheritance
425 gengtype has some support for simple class hierarchies. You can use
426 this to have gengtype autogenerate marking routines, provided:
427
428 @itemize @bullet
429 @item
430 There must be a concrete base class, with a discriminator expression
431 that can be used to identify which subclass an instance is.
432 @item
433 Only single inheritance is used.
434 @item
435 None of the classes within the hierarchy are templates.
436 @end itemize
437
438 If your class hierarchy does not fit in this pattern, you must use
439 @ref{User GC} instead.
440
441 The base class and its discriminator must be identified using the ``desc''
442 option. Each concrete subclass must use the ``tag'' option to identify
443 which value of the discriminator it corresponds to.
444
445 Every class in the hierarchy must have a @code{GTY(())} marker, as
446 gengtype will only attempt to parse classes that have such a marker
447 @footnote{Classes lacking such a marker will not be identified as being
448 part of the hierarchy, and so the marking routines will not handle them,
449 leading to a assertion failure within the marking routines due to an
450 unknown tag value (assuming that assertions are enabled).}.
451
452 @smallexample
453 class GTY((desc("%h.kind"), tag("0"))) example_base
454 @{
455 public:
456 int kind;
457 tree a;
458 @};
459
460 class GTY((tag("1"))) some_subclass : public example_base
461 @{
462 public:
463 tree b;
464 @};
465
466 class GTY((tag("2"))) some_other_subclass : public example_base
467 @{
468 public:
469 tree c;
470 @};
471 @end smallexample
472
473 The generated marking routines for the above will contain a ``switch''
474 on ``kind'', visiting all appropriate fields. For example, if kind is
475 2, it will cast to ``some_other_subclass'' and visit fields a, b, and c.
476
477 @node User GC
478 @section Support for user-provided GC marking routines
479 @cindex user gc
480 The garbage collector supports types for which no automatic marking
481 code is generated. For these types, the user is required to provide
482 three functions: one to act as a marker for garbage collection, and
483 two functions to act as marker and pointer walker for pre-compiled
484 headers.
485
486 Given a structure @code{struct GTY((user)) my_struct}, the following functions
487 should be defined to mark @code{my_struct}:
488
489 @smallexample
490 void gt_ggc_mx (my_struct *p)
491 @{
492 /* This marks field 'fld'. */
493 gt_ggc_mx (p->fld);
494 @}
495
496 void gt_pch_nx (my_struct *p)
497 @{
498 /* This marks field 'fld'. */
499 gt_pch_nx (tp->fld);
500 @}
501
502 void gt_pch_nx (my_struct *p, gt_pointer_operator op, void *cookie)
503 @{
504 /* For every field 'fld', call the given pointer operator. */
505 op (&(tp->fld), NULL, cookie);
506 @}
507 @end smallexample
508
509 In general, each marker @code{M} should call @code{M} for every
510 pointer field in the structure. Fields that are not allocated in GC
511 or are not pointers must be ignored.
512
513 For embedded lists (e.g., structures with a @code{next} or @code{prev}
514 pointer), the marker must follow the chain and mark every element in
515 it.
516
517 Note that the rules for the pointer walker @code{gt_pch_nx (my_struct
518 *, gt_pointer_operator, void *)} are slightly different. In this
519 case, the operation @code{op} must be applied to the @emph{address} of
520 every pointer field.
521
522 @subsection User-provided marking routines for template types
523 When a template type @code{TP} is marked with @code{GTY}, all
524 instances of that type are considered user-provided types. This means
525 that the individual instances of @code{TP} do not need to be marked
526 with @code{GTY}. The user needs to provide template functions to mark
527 all the fields of the type.
528
529 The following code snippets represent all the functions that need to
530 be provided. Note that type @code{TP} may reference to more than one
531 type. In these snippets, there is only one type @code{T}, but there
532 could be more.
533
534 @smallexample
535 template<typename T>
536 void gt_ggc_mx (TP<T> *tp)
537 @{
538 extern void gt_ggc_mx (T&);
539
540 /* This marks field 'fld' of type 'T'. */
541 gt_ggc_mx (tp->fld);
542 @}
543
544 template<typename T>
545 void gt_pch_nx (TP<T> *tp)
546 @{
547 extern void gt_pch_nx (T&);
548
549 /* This marks field 'fld' of type 'T'. */
550 gt_pch_nx (tp->fld);
551 @}
552
553 template<typename T>
554 void gt_pch_nx (TP<T *> *tp, gt_pointer_operator op, void *cookie)
555 @{
556 /* For every field 'fld' of 'tp' with type 'T *', call the given
557 pointer operator. */
558 op (&(tp->fld), NULL, cookie);
559 @}
560
561 template<typename T>
562 void gt_pch_nx (TP<T> *tp, gt_pointer_operator, void *cookie)
563 @{
564 extern void gt_pch_nx (T *, gt_pointer_operator, void *);
565
566 /* For every field 'fld' of 'tp' with type 'T', call the pointer
567 walker for all the fields of T. */
568 gt_pch_nx (&(tp->fld), op, cookie);
569 @}
570 @end smallexample
571
572 Support for user-defined types is currently limited. The following
573 restrictions apply:
574
575 @enumerate
576 @item Type @code{TP} and all the argument types @code{T} must be
577 marked with @code{GTY}.
578
579 @item Type @code{TP} can only have type names in its argument list.
580
581 @item The pointer walker functions are different for @code{TP<T>} and
582 @code{TP<T *>}. In the case of @code{TP<T>}, references to
583 @code{T} must be handled by calling @code{gt_pch_nx} (which
584 will, in turn, walk all the pointers inside fields of @code{T}).
585 In the case of @code{TP<T *>}, references to @code{T *} must be
586 handled by calling the @code{op} function on the address of the
587 pointer (see the code snippets above).
588 @end enumerate
589
590 @node GGC Roots
591 @section Marking Roots for the Garbage Collector
592 @cindex roots, marking
593 @cindex marking roots
594
595 In addition to keeping track of types, the type machinery also locates
596 the global variables (@dfn{roots}) that the garbage collector starts
597 at. Roots must be declared using one of the following syntaxes:
598
599 @itemize @bullet
600 @item
601 @code{extern GTY(([@var{options}])) @var{type} @var{name};}
602 @item
603 @code{static GTY(([@var{options}])) @var{type} @var{name};}
604 @end itemize
605 @noindent
606 The syntax
607 @itemize @bullet
608 @item
609 @code{GTY(([@var{options}])) @var{type} @var{name};}
610 @end itemize
611 @noindent
612 is @emph{not} accepted. There should be an @code{extern} declaration
613 of such a variable in a header somewhere---mark that, not the
614 definition. Or, if the variable is only used in one file, make it
615 @code{static}.
616
617 @node Files
618 @section Source Files Containing Type Information
619 @cindex generated files
620 @cindex files, generated
621
622 Whenever you add @code{GTY} markers to a source file that previously
623 had none, or create a new source file containing @code{GTY} markers,
624 there are three things you need to do:
625
626 @enumerate
627 @item
628 You need to add the file to the list of source files the type
629 machinery scans. There are four cases:
630
631 @enumerate a
632 @item
633 For a back-end file, this is usually done
634 automatically; if not, you should add it to @code{target_gtfiles} in
635 the appropriate port's entries in @file{config.gcc}.
636
637 @item
638 For files shared by all front ends, add the filename to the
639 @code{GTFILES} variable in @file{Makefile.in}.
640
641 @item
642 For files that are part of one front end, add the filename to the
643 @code{gtfiles} variable defined in the appropriate
644 @file{config-lang.in}.
645 Headers should appear before non-headers in this list.
646
647 @item
648 For files that are part of some but not all front ends, add the
649 filename to the @code{gtfiles} variable of @emph{all} the front ends
650 that use it.
651 @end enumerate
652
653 @item
654 If the file was a header file, you'll need to check that it's included
655 in the right place to be visible to the generated files. For a back-end
656 header file, this should be done automatically. For a front-end header
657 file, it needs to be included by the same file that includes
658 @file{gtype-@var{lang}.h}. For other header files, it needs to be
659 included in @file{gtype-desc.cc}, which is a generated file, so add it to
660 @code{ifiles} in @code{open_base_file} in @file{gengtype.cc}.
661
662 For source files that aren't header files, the machinery will generate a
663 header file that should be included in the source file you just changed.
664 The file will be called @file{gt-@var{path}.h} where @var{path} is the
665 pathname relative to the @file{gcc} directory with slashes replaced by
666 @verb{|-|}, so for example the header file to be included in
667 @file{cp/parser.cc} is called @file{gt-cp-parser.h}. The
668 generated header file should be included after everything else in the
669 source file.
670
671 @end enumerate
672
673 For language frontends, there is another file that needs to be included
674 somewhere. It will be called @file{gtype-@var{lang}.h}, where
675 @var{lang} is the name of the subdirectory the language is contained in.
676
677 Plugins can add additional root tables. Run the @code{gengtype}
678 utility in plugin mode as @code{gengtype -P pluginout.h @var{source-dir}
679 @var{file-list} @var{plugin*.c}} with your plugin files
680 @var{plugin*.c} using @code{GTY} to generate the @var{pluginout.h} file.
681 The GCC build tree is needed to be present in that mode.
682
683
684 @node Invoking the garbage collector
685 @section How to invoke the garbage collector
686 @cindex garbage collector, invocation
687 @findex ggc_collect
688
689 The GCC garbage collector GGC is only invoked explicitly. In contrast
690 with many other garbage collectors, it is not implicitly invoked by
691 allocation routines when a lot of memory has been consumed. So the
692 only way to have GGC reclaim storage is to call the @code{ggc_collect}
693 function explicitly.
694 With @var{mode} @code{GGC_COLLECT_FORCE} or otherwise (default
695 @code{GGC_COLLECT_HEURISTIC}) when the internal heuristic decides to
696 collect, this call is potentially an expensive operation, as it may
697 have to scan the entire heap. Beware that local variables (on the GCC
698 call stack) are not followed by such an invocation (as many other
699 garbage collectors do): you should reference all your data from static
700 or external @code{GTY}-ed variables, and it is advised to call
701 @code{ggc_collect} with a shallow call stack. The GGC is an exact mark
702 and sweep garbage collector (so it does not scan the call stack for
703 pointers). In practice GCC passes don't often call @code{ggc_collect}
704 themselves, because it is called by the pass manager between passes.
705
706 At the time of the @code{ggc_collect} call all pointers in the GC-marked
707 structures must be valid or @code{NULL}. In practice this means that
708 there should not be uninitialized pointer fields in the structures even
709 if your code never reads or writes those fields at a particular
710 instance. One way to ensure this is to use cleared versions of
711 allocators unless all the fields are initialized manually immediately
712 after allocation.
713
714 @node Troubleshooting
715 @section Troubleshooting the garbage collector
716 @cindex garbage collector, troubleshooting
717
718 With the current garbage collector implementation, most issues should
719 show up as GCC compilation errors. Some of the most commonly
720 encountered issues are described below.
721
722 @itemize @bullet
723 @item Gengtype does not produce allocators for a @code{GTY}-marked type.
724 Gengtype checks if there is at least one possible path from GC roots to
725 at least one instance of each type before outputting allocators. If
726 there is no such path, the @code{GTY} markers will be ignored and no
727 allocators will be output. Solve this by making sure that there exists
728 at least one such path. If creating it is unfeasible or raises a ``code
729 smell'', consider if you really must use GC for allocating such type.
730
731 @item Link-time errors about undefined @code{gt_ggc_r_foo_bar} and
732 similarly-named symbols. Check if your @file{foo_bar} source file has
733 @code{#include "gt-foo_bar.h"} as its very last line.
734
735 @end itemize