]> git.ipfire.org Git - thirdparty/gcc.git/blob - gcc/doc/gty.texi
use templates instead of gengtype for typed allocation functions
[thirdparty/gcc.git] / gcc / doc / gty.texi
1 @c Copyright (C) 2002-2014 Free Software Foundation, Inc.
2 @c This is part of the GCC manual.
3 @c For copying conditions, see the file gcc.texi.
4
5 @node Type Information
6 @chapter Memory Management and Type Information
7 @cindex GGC
8 @findex GTY
9
10 GCC uses some fairly sophisticated memory management techniques, which
11 involve determining information about GCC's data structures from GCC's
12 source code and using this information to perform garbage collection and
13 implement precompiled headers.
14
15 A full C++ parser would be too complicated for this task, so a limited
16 subset of C++ is interpreted and special markers are used to determine
17 what parts of the source to look at. All @code{struct}, @code{union}
18 and @code{template} structure declarations that define data structures
19 that are allocated under control of the garbage collector must be
20 marked. All global variables that hold pointers to garbage-collected
21 memory must also be marked. Finally, all global variables that need
22 to be saved and restored by a precompiled header must be marked. (The
23 precompiled header mechanism can only save static variables if they're
24 scalar. Complex data structures must be allocated in garbage-collected
25 memory to be saved in a precompiled header.)
26
27 The full format of a marker is
28 @smallexample
29 GTY (([@var{option}] [(@var{param})], [@var{option}] [(@var{param})] @dots{}))
30 @end smallexample
31 @noindent
32 but in most cases no options are needed. The outer double parentheses
33 are still necessary, though: @code{GTY(())}. Markers can appear:
34
35 @itemize @bullet
36 @item
37 In a structure definition, before the open brace;
38 @item
39 In a global variable declaration, after the keyword @code{static} or
40 @code{extern}; and
41 @item
42 In a structure field definition, before the name of the field.
43 @end itemize
44
45 Here are some examples of marking simple data structures and globals.
46
47 @smallexample
48 struct GTY(()) @var{tag}
49 @{
50 @var{fields}@dots{}
51 @};
52
53 typedef struct GTY(()) @var{tag}
54 @{
55 @var{fields}@dots{}
56 @} *@var{typename};
57
58 static GTY(()) struct @var{tag} *@var{list}; /* @r{points to GC memory} */
59 static GTY(()) int @var{counter}; /* @r{save counter in a PCH} */
60 @end smallexample
61
62 The parser understands simple typedefs such as
63 @code{typedef struct @var{tag} *@var{name};} and
64 @code{typedef int @var{name};}.
65 These don't need to be marked.
66
67 Since @code{gengtype}'s understanding of C++ is limited, there are
68 several constructs and declarations that are not supported inside
69 classes/structures marked for automatic GC code generation. The
70 following C++ constructs produce a @code{gengtype} error on
71 structures/classes marked for automatic GC code generation:
72
73 @itemize @bullet
74 @item
75 Type definitions inside classes/structures are not supported.
76 @item
77 Enumerations inside classes/structures are not supported.
78 @end itemize
79
80 If you have a class or structure using any of the above constructs,
81 you need to mark that class as @code{GTY ((user))} and provide your
82 own marking routines (see section @ref{User GC} for details).
83
84 It is always valid to include function definitions inside classes.
85 Those are always ignored by @code{gengtype}, as it only cares about
86 data members.
87
88 @menu
89 * GTY Options:: What goes inside a @code{GTY(())}.
90 * Inheritance and GTY:: Adding GTY to a class hierarchy.
91 * User GC:: Adding user-provided GC marking routines.
92 * GGC Roots:: Making global variables GGC roots.
93 * Files:: How the generated files work.
94 * Invoking the garbage collector:: How to invoke the garbage collector.
95 * Troubleshooting:: When something does not work as expected.
96 @end menu
97
98 @node GTY Options
99 @section The Inside of a @code{GTY(())}
100
101 Sometimes the C code is not enough to fully describe the type
102 structure. Extra information can be provided with @code{GTY} options
103 and additional markers. Some options take a parameter, which may be
104 either a string or a type name, depending on the parameter. If an
105 option takes no parameter, it is acceptable either to omit the
106 parameter entirely, or to provide an empty string as a parameter. For
107 example, @code{@w{GTY ((skip))}} and @code{@w{GTY ((skip ("")))}} are
108 equivalent.
109
110 When the parameter is a string, often it is a fragment of C code. Four
111 special escapes may be used in these strings, to refer to pieces of
112 the data structure being marked:
113
114 @cindex % in GTY option
115 @table @code
116 @item %h
117 The current structure.
118 @item %1
119 The structure that immediately contains the current structure.
120 @item %0
121 The outermost structure that contains the current structure.
122 @item %a
123 A partial expression of the form @code{[i1][i2]@dots{}} that indexes
124 the array item currently being marked.
125 @end table
126
127 For instance, suppose that you have a structure of the form
128 @smallexample
129 struct A @{
130 @dots{}
131 @};
132 struct B @{
133 struct A foo[12];
134 @};
135 @end smallexample
136 @noindent
137 and @code{b} is a variable of type @code{struct B}. When marking
138 @samp{b.foo[11]}, @code{%h} would expand to @samp{b.foo[11]},
139 @code{%0} and @code{%1} would both expand to @samp{b}, and @code{%a}
140 would expand to @samp{[11]}.
141
142 As in ordinary C, adjacent strings will be concatenated; this is
143 helpful when you have a complicated expression.
144 @smallexample
145 @group
146 GTY ((chain_next ("TREE_CODE (&%h.generic) == INTEGER_TYPE"
147 " ? TYPE_NEXT_VARIANT (&%h.generic)"
148 " : TREE_CHAIN (&%h.generic)")))
149 @end group
150 @end smallexample
151
152 The available options are:
153
154 @table @code
155 @findex length
156 @item length ("@var{expression}")
157
158 There are two places the type machinery will need to be explicitly told
159 the length of an array of non-atomic objects. The first case is when a
160 structure ends in a variable-length array, like this:
161 @smallexample
162 struct GTY(()) rtvec_def @{
163 int num_elem; /* @r{number of elements} */
164 rtx GTY ((length ("%h.num_elem"))) elem[1];
165 @};
166 @end smallexample
167
168 In this case, the @code{length} option is used to override the specified
169 array length (which should usually be @code{1}). The parameter of the
170 option is a fragment of C code that calculates the length.
171
172 The second case is when a structure or a global variable contains a
173 pointer to an array, like this:
174 @smallexample
175 struct gimple_omp_for_iter * GTY((length ("%h.collapse"))) iter;
176 @end smallexample
177 In this case, @code{iter} has been allocated by writing something like
178 @smallexample
179 x->iter = ggc_alloc_cleared_vec_gimple_omp_for_iter (collapse);
180 @end smallexample
181 and the @code{collapse} provides the length of the field.
182
183 This second use of @code{length} also works on global variables, like:
184 @verbatim
185 static GTY((length("reg_known_value_size"))) rtx *reg_known_value;
186 @end verbatim
187
188 Note that the @code{length} option is only meant for use with arrays of
189 non-atomic objects, that is, objects that contain pointers pointing to
190 other GTY-managed objects. For other GC-allocated arrays and strings
191 you should use @code{atomic}.
192
193 @findex skip
194 @item skip
195
196 If @code{skip} is applied to a field, the type machinery will ignore it.
197 This is somewhat dangerous; the only safe use is in a union when one
198 field really isn't ever used.
199
200 @findex desc
201 @findex tag
202 @findex default
203 @item desc ("@var{expression}")
204 @itemx tag ("@var{constant}")
205 @itemx default
206
207 The type machinery needs to be told which field of a @code{union} is
208 currently active. This is done by giving each field a constant
209 @code{tag} value, and then specifying a discriminator using @code{desc}.
210 The value of the expression given by @code{desc} is compared against
211 each @code{tag} value, each of which should be different. If no
212 @code{tag} is matched, the field marked with @code{default} is used if
213 there is one, otherwise no field in the union will be marked.
214
215 In the @code{desc} option, the ``current structure'' is the union that
216 it discriminates. Use @code{%1} to mean the structure containing it.
217 There are no escapes available to the @code{tag} option, since it is a
218 constant.
219
220 For example,
221 @smallexample
222 struct GTY(()) tree_binding
223 @{
224 struct tree_common common;
225 union tree_binding_u @{
226 tree GTY ((tag ("0"))) scope;
227 struct cp_binding_level * GTY ((tag ("1"))) level;
228 @} GTY ((desc ("BINDING_HAS_LEVEL_P ((tree)&%0)"))) xscope;
229 tree value;
230 @};
231 @end smallexample
232
233 In this example, the value of BINDING_HAS_LEVEL_P when applied to a
234 @code{struct tree_binding *} is presumed to be 0 or 1. If 1, the type
235 mechanism will treat the field @code{level} as being present and if 0,
236 will treat the field @code{scope} as being present.
237
238 The @code{desc} and @code{tag} options can also be used for inheritance
239 to denote which subclass an instance is. See @ref{Inheritance and GTY}
240 for more information.
241
242 @findex param_is
243 @findex use_param
244 @item param_is (@var{type})
245 @itemx use_param
246
247 Sometimes it's convenient to define some data structure to work on
248 generic pointers (that is, @code{PTR}) and then use it with a specific
249 type. @code{param_is} specifies the real type pointed to, and
250 @code{use_param} says where in the generic data structure that type
251 should be put.
252
253 For instance, to have a @code{htab_t} that points to trees, one would
254 write the definition of @code{htab_t} like this:
255 @smallexample
256 typedef struct GTY(()) @{
257 @dots{}
258 void ** GTY ((use_param, @dots{})) entries;
259 @dots{}
260 @} htab_t;
261 @end smallexample
262 and then declare variables like this:
263 @smallexample
264 static htab_t GTY ((param_is (union tree_node))) ict;
265 @end smallexample
266
267 @findex param@var{n}_is
268 @findex use_param@var{n}
269 @item param@var{n}_is (@var{type})
270 @itemx use_param@var{n}
271
272 In more complicated cases, the data structure might need to work on
273 several different types, which might not necessarily all be pointers.
274 For this, @code{param1_is} through @code{param9_is} may be used to
275 specify the real type of a field identified by @code{use_param1} through
276 @code{use_param9}.
277
278 @findex use_params
279 @item use_params
280
281 When a structure contains another structure that is parameterized,
282 there's no need to do anything special, the inner structure inherits the
283 parameters of the outer one. When a structure contains a pointer to a
284 parameterized structure, the type machinery won't automatically detect
285 this (it could, it just doesn't yet), so it's necessary to tell it that
286 the pointed-to structure should use the same parameters as the outer
287 structure. This is done by marking the pointer with the
288 @code{use_params} option.
289
290 @findex deletable
291 @item deletable
292
293 @code{deletable}, when applied to a global variable, indicates that when
294 garbage collection runs, there's no need to mark anything pointed to
295 by this variable, it can just be set to @code{NULL} instead. This is used
296 to keep a list of free structures around for re-use.
297
298 @findex if_marked
299 @item if_marked ("@var{expression}")
300
301 Suppose you want some kinds of object to be unique, and so you put them
302 in a hash table. If garbage collection marks the hash table, these
303 objects will never be freed, even if the last other reference to them
304 goes away. GGC has special handling to deal with this: if you use the
305 @code{if_marked} option on a global hash table, GGC will call the
306 routine whose name is the parameter to the option on each hash table
307 entry. If the routine returns nonzero, the hash table entry will
308 be marked as usual. If the routine returns zero, the hash table entry
309 will be deleted.
310
311 The routine @code{ggc_marked_p} can be used to determine if an element
312 has been marked already; in fact, the usual case is to use
313 @code{if_marked ("ggc_marked_p")}.
314
315 @findex mark_hook
316 @item mark_hook ("@var{hook-routine-name}")
317
318 If provided for a structure or union type, the given
319 @var{hook-routine-name} (between double-quotes) is the name of a
320 routine called when the garbage collector has just marked the data as
321 reachable. This routine should not change the data, or call any ggc
322 routine. Its only argument is a pointer to the just marked (const)
323 structure or union.
324
325 @findex maybe_undef
326 @item maybe_undef
327
328 When applied to a field, @code{maybe_undef} indicates that it's OK if
329 the structure that this fields points to is never defined, so long as
330 this field is always @code{NULL}. This is used to avoid requiring
331 backends to define certain optional structures. It doesn't work with
332 language frontends.
333
334 @findex nested_ptr
335 @item nested_ptr (@var{type}, "@var{to expression}", "@var{from expression}")
336
337 The type machinery expects all pointers to point to the start of an
338 object. Sometimes for abstraction purposes it's convenient to have
339 a pointer which points inside an object. So long as it's possible to
340 convert the original object to and from the pointer, such pointers
341 can still be used. @var{type} is the type of the original object,
342 the @var{to expression} returns the pointer given the original object,
343 and the @var{from expression} returns the original object given
344 the pointer. The pointer will be available using the @code{%h}
345 escape.
346
347 @findex chain_next
348 @findex chain_prev
349 @findex chain_circular
350 @item chain_next ("@var{expression}")
351 @itemx chain_prev ("@var{expression}")
352 @itemx chain_circular ("@var{expression}")
353
354 It's helpful for the type machinery to know if objects are often
355 chained together in long lists; this lets it generate code that uses
356 less stack space by iterating along the list instead of recursing down
357 it. @code{chain_next} is an expression for the next item in the list,
358 @code{chain_prev} is an expression for the previous item. For singly
359 linked lists, use only @code{chain_next}; for doubly linked lists, use
360 both. The machinery requires that taking the next item of the
361 previous item gives the original item. @code{chain_circular} is similar
362 to @code{chain_next}, but can be used for circular single linked lists.
363
364 @findex reorder
365 @item reorder ("@var{function name}")
366
367 Some data structures depend on the relative ordering of pointers. If
368 the precompiled header machinery needs to change that ordering, it
369 will call the function referenced by the @code{reorder} option, before
370 changing the pointers in the object that's pointed to by the field the
371 option applies to. The function must take four arguments, with the
372 signature @samp{@w{void *, void *, gt_pointer_operator, void *}}.
373 The first parameter is a pointer to the structure that contains the
374 object being updated, or the object itself if there is no containing
375 structure. The second parameter is a cookie that should be ignored.
376 The third parameter is a routine that, given a pointer, will update it
377 to its correct new value. The fourth parameter is a cookie that must
378 be passed to the second parameter.
379
380 PCH cannot handle data structures that depend on the absolute values
381 of pointers. @code{reorder} functions can be expensive. When
382 possible, it is better to depend on properties of the data, like an ID
383 number or the hash of a string instead.
384
385 @findex atomic
386 @item atomic
387
388 The @code{atomic} option can only be used with pointers. It informs
389 the GC machinery that the memory that the pointer points to does not
390 contain any pointers, and hence it should be treated by the GC and PCH
391 machinery as an ``atomic'' block of memory that does not need to be
392 examined when scanning memory for pointers. In particular, the
393 machinery will not scan that memory for pointers to mark them as
394 reachable (when marking pointers for GC) or to relocate them (when
395 writing a PCH file).
396
397 The @code{atomic} option differs from the @code{skip} option.
398 @code{atomic} keeps the memory under Garbage Collection, but makes the
399 GC ignore the contents of the memory. @code{skip} is more drastic in
400 that it causes the pointer and the memory to be completely ignored by
401 the Garbage Collector. So, memory marked as @code{atomic} is
402 automatically freed when no longer reachable, while memory marked as
403 @code{skip} is not.
404
405 The @code{atomic} option must be used with great care, because all
406 sorts of problem can occur if used incorrectly, that is, if the memory
407 the pointer points to does actually contain a pointer.
408
409 Here is an example of how to use it:
410 @smallexample
411 struct GTY(()) my_struct @{
412 int number_of_elements;
413 unsigned int * GTY ((atomic)) elements;
414 @};
415 @end smallexample
416 In this case, @code{elements} is a pointer under GC, and the memory it
417 points to needs to be allocated using the Garbage Collector, and will
418 be freed automatically by the Garbage Collector when it is no longer
419 referenced. But the memory that the pointer points to is an array of
420 @code{unsigned int} elements, and the GC must not try to scan it to
421 find pointers to mark or relocate, which is why it is marked with the
422 @code{atomic} option.
423
424 Note that, currently, global variables can not be marked with
425 @code{atomic}; only fields of a struct can. This is a known
426 limitation. It would be useful to be able to mark global pointers
427 with @code{atomic} to make the PCH machinery aware of them so that
428 they are saved and restored correctly to PCH files.
429
430 @findex special
431 @item special ("@var{name}")
432
433 The @code{special} option is used to mark types that have to be dealt
434 with by special case machinery. The parameter is the name of the
435 special case. See @file{gengtype.c} for further details. Avoid
436 adding new special cases unless there is no other alternative.
437
438 @findex user
439 @item user
440
441 The @code{user} option indicates that the code to mark structure
442 fields is completely handled by user-provided routines. See section
443 @ref{User GC} for details on what functions need to be provided.
444 @end table
445
446 @node Inheritance and GTY
447 @section Support for inheritance
448 gengtype has some support for simple class hierarchies. You can use
449 this to have gengtype autogenerate marking routines, provided:
450
451 @itemize @bullet
452 @item
453 There must be a concrete base class, with a discriminator expression
454 that can be used to identify which subclass an instance is.
455 @item
456 Only single inheritance is used.
457 @item
458 None of the classes within the hierarchy are templates.
459 @end itemize
460
461 If your class hierarchy does not fit in this pattern, you must use
462 @ref{User GC} instead.
463
464 The base class and its discriminator must be identified using the ``desc''
465 option. Each concrete subclass must use the ``tag'' option to identify
466 which value of the discriminator it corresponds to.
467
468 Every class in the hierarchy must have a @code{GTY(())} marker, as
469 gengtype will only attempt to parse classes that have such a marker
470 @footnote{Classes lacking such a marker will not be identified as being
471 part of the hierarchy, and so the marking routines will not handle them,
472 leading to a assertion failure within the marking routines due to an
473 unknown tag value (assuming that assertions are enabled).}.
474
475 @smallexample
476 class GTY((desc("%h.kind"), tag("0"))) example_base
477 @{
478 public:
479 int kind;
480 tree a;
481 @};
482
483 class GTY((tag("1")) some_subclass : public example_base
484 @{
485 public:
486 tree b;
487 @};
488
489 class GTY((tag("2")) some_other_subclass : public example_base
490 @{
491 public:
492 tree c;
493 @};
494 @end smallexample
495
496 The generated marking routines for the above will contain a ``switch''
497 on ``kind'', visiting all appropriate fields. For example, if kind is
498 2, it will cast to ``some_other_subclass'' and visit fields a, b, and c.
499
500 @node User GC
501 @section Support for user-provided GC marking routines
502 @cindex user gc
503 The garbage collector supports types for which no automatic marking
504 code is generated. For these types, the user is required to provide
505 three functions: one to act as a marker for garbage collection, and
506 two functions to act as marker and pointer walker for pre-compiled
507 headers.
508
509 Given a structure @code{struct GTY((user)) my_struct}, the following functions
510 should be defined to mark @code{my_struct}:
511
512 @smallexample
513 void gt_ggc_mx (my_struct *p)
514 @{
515 /* This marks field 'fld'. */
516 gt_ggc_mx (p->fld);
517 @}
518
519 void gt_pch_nx (my_struct *p)
520 @{
521 /* This marks field 'fld'. */
522 gt_pch_nx (tp->fld);
523 @}
524
525 void gt_pch_nx (my_struct *p, gt_pointer_operator op, void *cookie)
526 @{
527 /* For every field 'fld', call the given pointer operator. */
528 op (&(tp->fld), cookie);
529 @}
530 @end smallexample
531
532 In general, each marker @code{M} should call @code{M} for every
533 pointer field in the structure. Fields that are not allocated in GC
534 or are not pointers must be ignored.
535
536 For embedded lists (e.g., structures with a @code{next} or @code{prev}
537 pointer), the marker must follow the chain and mark every element in
538 it.
539
540 Note that the rules for the pointer walker @code{gt_pch_nx (my_struct
541 *, gt_pointer_operator, void *)} are slightly different. In this
542 case, the operation @code{op} must be applied to the @emph{address} of
543 every pointer field.
544
545 @subsection User-provided marking routines for template types
546 When a template type @code{TP} is marked with @code{GTY}, all
547 instances of that type are considered user-provided types. This means
548 that the individual instances of @code{TP} do not need to be marked
549 with @code{GTY}. The user needs to provide template functions to mark
550 all the fields of the type.
551
552 The following code snippets represent all the functions that need to
553 be provided. Note that type @code{TP} may reference to more than one
554 type. In these snippets, there is only one type @code{T}, but there
555 could be more.
556
557 @smallexample
558 template<typename T>
559 void gt_ggc_mx (TP<T> *tp)
560 @{
561 extern void gt_ggc_mx (T&);
562
563 /* This marks field 'fld' of type 'T'. */
564 gt_ggc_mx (tp->fld);
565 @}
566
567 template<typename T>
568 void gt_pch_nx (TP<T> *tp)
569 @{
570 extern void gt_pch_nx (T&);
571
572 /* This marks field 'fld' of type 'T'. */
573 gt_pch_nx (tp->fld);
574 @}
575
576 template<typename T>
577 void gt_pch_nx (TP<T *> *tp, gt_pointer_operator op, void *cookie)
578 @{
579 /* For every field 'fld' of 'tp' with type 'T *', call the given
580 pointer operator. */
581 op (&(tp->fld), cookie);
582 @}
583
584 template<typename T>
585 void gt_pch_nx (TP<T> *tp, gt_pointer_operator, void *cookie)
586 @{
587 extern void gt_pch_nx (T *, gt_pointer_operator, void *);
588
589 /* For every field 'fld' of 'tp' with type 'T', call the pointer
590 walker for all the fields of T. */
591 gt_pch_nx (&(tp->fld), op, cookie);
592 @}
593 @end smallexample
594
595 Support for user-defined types is currently limited. The following
596 restrictions apply:
597
598 @enumerate
599 @item Type @code{TP} and all the argument types @code{T} must be
600 marked with @code{GTY}.
601
602 @item Type @code{TP} can only have type names in its argument list.
603
604 @item The pointer walker functions are different for @code{TP<T>} and
605 @code{TP<T *>}. In the case of @code{TP<T>}, references to
606 @code{T} must be handled by calling @code{gt_pch_nx} (which
607 will, in turn, walk all the pointers inside fields of @code{T}).
608 In the case of @code{TP<T *>}, references to @code{T *} must be
609 handled by calling the @code{op} function on the address of the
610 pointer (see the code snippets above).
611 @end enumerate
612
613 @node GGC Roots
614 @section Marking Roots for the Garbage Collector
615 @cindex roots, marking
616 @cindex marking roots
617
618 In addition to keeping track of types, the type machinery also locates
619 the global variables (@dfn{roots}) that the garbage collector starts
620 at. Roots must be declared using one of the following syntaxes:
621
622 @itemize @bullet
623 @item
624 @code{extern GTY(([@var{options}])) @var{type} @var{name};}
625 @item
626 @code{static GTY(([@var{options}])) @var{type} @var{name};}
627 @end itemize
628 @noindent
629 The syntax
630 @itemize @bullet
631 @item
632 @code{GTY(([@var{options}])) @var{type} @var{name};}
633 @end itemize
634 @noindent
635 is @emph{not} accepted. There should be an @code{extern} declaration
636 of such a variable in a header somewhere---mark that, not the
637 definition. Or, if the variable is only used in one file, make it
638 @code{static}.
639
640 @node Files
641 @section Source Files Containing Type Information
642 @cindex generated files
643 @cindex files, generated
644
645 Whenever you add @code{GTY} markers to a source file that previously
646 had none, or create a new source file containing @code{GTY} markers,
647 there are three things you need to do:
648
649 @enumerate
650 @item
651 You need to add the file to the list of source files the type
652 machinery scans. There are four cases:
653
654 @enumerate a
655 @item
656 For a back-end file, this is usually done
657 automatically; if not, you should add it to @code{target_gtfiles} in
658 the appropriate port's entries in @file{config.gcc}.
659
660 @item
661 For files shared by all front ends, add the filename to the
662 @code{GTFILES} variable in @file{Makefile.in}.
663
664 @item
665 For files that are part of one front end, add the filename to the
666 @code{gtfiles} variable defined in the appropriate
667 @file{config-lang.in}.
668 Headers should appear before non-headers in this list.
669
670 @item
671 For files that are part of some but not all front ends, add the
672 filename to the @code{gtfiles} variable of @emph{all} the front ends
673 that use it.
674 @end enumerate
675
676 @item
677 If the file was a header file, you'll need to check that it's included
678 in the right place to be visible to the generated files. For a back-end
679 header file, this should be done automatically. For a front-end header
680 file, it needs to be included by the same file that includes
681 @file{gtype-@var{lang}.h}. For other header files, it needs to be
682 included in @file{gtype-desc.c}, which is a generated file, so add it to
683 @code{ifiles} in @code{open_base_file} in @file{gengtype.c}.
684
685 For source files that aren't header files, the machinery will generate a
686 header file that should be included in the source file you just changed.
687 The file will be called @file{gt-@var{path}.h} where @var{path} is the
688 pathname relative to the @file{gcc} directory with slashes replaced by
689 @verb{|-|}, so for example the header file to be included in
690 @file{cp/parser.c} is called @file{gt-cp-parser.c}. The
691 generated header file should be included after everything else in the
692 source file. Don't forget to mention this file as a dependency in the
693 @file{Makefile}!
694
695 @end enumerate
696
697 For language frontends, there is another file that needs to be included
698 somewhere. It will be called @file{gtype-@var{lang}.h}, where
699 @var{lang} is the name of the subdirectory the language is contained in.
700
701 Plugins can add additional root tables. Run the @code{gengtype}
702 utility in plugin mode as @code{gengtype -P pluginout.h @var{source-dir}
703 @var{file-list} @var{plugin*.c}} with your plugin files
704 @var{plugin*.c} using @code{GTY} to generate the @var{pluginout.h} file.
705 The GCC build tree is needed to be present in that mode.
706
707
708 @node Invoking the garbage collector
709 @section How to invoke the garbage collector
710 @cindex garbage collector, invocation
711 @findex ggc_collect
712
713 The GCC garbage collector GGC is only invoked explicitly. In contrast
714 with many other garbage collectors, it is not implicitly invoked by
715 allocation routines when a lot of memory has been consumed. So the
716 only way to have GGC reclaim storage is to call the @code{ggc_collect}
717 function explicitly. This call is an expensive operation, as it may
718 have to scan the entire heap. Beware that local variables (on the GCC
719 call stack) are not followed by such an invocation (as many other
720 garbage collectors do): you should reference all your data from static
721 or external @code{GTY}-ed variables, and it is advised to call
722 @code{ggc_collect} with a shallow call stack. The GGC is an exact mark
723 and sweep garbage collector (so it does not scan the call stack for
724 pointers). In practice GCC passes don't often call @code{ggc_collect}
725 themselves, because it is called by the pass manager between passes.
726
727 At the time of the @code{ggc_collect} call all pointers in the GC-marked
728 structures must be valid or @code{NULL}. In practice this means that
729 there should not be uninitialized pointer fields in the structures even
730 if your code never reads or writes those fields at a particular
731 instance. One way to ensure this is to use cleared versions of
732 allocators unless all the fields are initialized manually immediately
733 after allocation.
734
735 @node Troubleshooting
736 @section Troubleshooting the garbage collector
737 @cindex garbage collector, troubleshooting
738
739 With the current garbage collector implementation, most issues should
740 show up as GCC compilation errors. Some of the most commonly
741 encountered issues are described below.
742
743 @itemize @bullet
744 @item Gengtype does not produce allocators for a @code{GTY}-marked type.
745 Gengtype checks if there is at least one possible path from GC roots to
746 at least one instance of each type before outputting allocators. If
747 there is no such path, the @code{GTY} markers will be ignored and no
748 allocators will be output. Solve this by making sure that there exists
749 at least one such path. If creating it is unfeasible or raises a ``code
750 smell'', consider if you really must use GC for allocating such type.
751
752 @item Link-time errors about undefined @code{gt_ggc_r_foo_bar} and
753 similarly-named symbols. Check if your @file{foo_bar} source file has
754 @code{#include "gt-foo_bar.h"} as its very last line.
755
756 @end itemize