]> git.ipfire.org Git - thirdparty/binutils-gdb.git/blame - gdb/doc/gdbint.texinfo
* config/sh/tm-sh.h (BELIEVE_PCC_PROMOTION): Define, so that
[thirdparty/binutils-gdb.git] / gdb / doc / gdbint.texinfo
CommitLineData
ca714d03 1\input texinfo
7f09f15f 2@setfilename gdbint.info
b7becc8f
RP
3
4@ifinfo
5@format
6START-INFO-DIR-ENTRY
b517f124 7* Gdb-Internals: (gdbint). The GNU debugger's internals.
b7becc8f
RP
8END-INFO-DIR-ENTRY
9@end format
10@end ifinfo
11
ca714d03
RP
12@ifinfo
13This file documents the internals of the GNU debugger GDB.
f222d23d 14
18b1e896 15Copyright 1990-1999 Free Software Foundation, Inc.
3cee93ac
CF
16Contributed by Cygnus Solutions. Written by John Gilmore.
17Second Edition by Stan Shebs.
cfddbd02 18
3cee93ac
CF
19Permission is granted to make and distribute verbatim copies of this
20manual provided the copyright notice and this permission notice are
21preserved on all copies.
ca714d03
RP
22
23@ignore
24Permission is granted to process this file through Tex and print the
3cee93ac
CF
25results, provided the printed document carries copying permission notice
26identical to this one except for the removal of this paragraph (this
27paragraph not being relevant to the printed manual).
ca714d03
RP
28
29@end ignore
30Permission is granted to copy or distribute modified versions of this
31manual under the terms of the GPL (for which purpose this text may be
32regarded as a program in the language TeX).
33@end ifinfo
34
7f09f15f 35@setchapternewpage off
ca714d03 36@settitle GDB Internals
3cee93ac 37
ca714d03 38@titlepage
3cee93ac 39@title{GDB Internals}
ca714d03
RP
40@subtitle{A guide to the internals of the GNU debugger}
41@author John Gilmore
3cee93ac
CF
42@author Cygnus Solutions
43@author Second Edition:
44@author Stan Shebs
45@author Cygnus Solutions
ca714d03
RP
46@page
47@tex
48\def\$#1${{#1}} % Kluge: collect RCS revision info without $...$
49\xdef\manvers{\$Revision$} % For use in headers, footers too
50{\parskip=0pt
3cee93ac 51\hfill Cygnus Solutions\par
ca714d03
RP
52\hfill \manvers\par
53\hfill \TeX{}info \texinfoversion\par
54}
55@end tex
56
57@vskip 0pt plus 1filll
18b1e896 58Copyright @copyright{} 1990-1999 Free Software Foundation, Inc.
ca714d03
RP
59
60Permission is granted to make and distribute verbatim copies of
61this manual provided the copyright notice and this permission notice
62are preserved on all copies.
63
64@end titlepage
65
b517f124 66@node Top
c1cd5aec
JK
67@c Perhaps this should be the title of the document (but only for info,
68@c not for TeX). Existing GNU manuals seem inconsistent on this point.
69@top Scope of this Document
70
3cee93ac
CF
71This document documents the internals of the GNU debugger, GDB. It
72includes description of GDB's key algorithms and operations, as well
73as the mechanisms that adapt GDB to specific hosts and targets.
493cf018 74
ca714d03 75@menu
3cee93ac
CF
76* Requirements::
77* Overall Structure::
78* Algorithms::
79* User Interface::
80* Symbol Handling::
81* Language Support::
82* Host Definition::
83* Target Architecture Definition::
84* Target Vector Definition::
85* Native Debugging::
86* Support Libraries::
87* Coding::
88* Porting GDB::
89* Hints::
ca714d03
RP
90@end menu
91
3cee93ac 92@node Requirements
cfddbd02 93
3cee93ac 94@chapter Requirements
cfddbd02 95
3cee93ac
CF
96Before diving into the internals, you should understand the formal
97requirements and other expectations for GDB. Although some of these may
98seem obvious, there have been proposals for GDB that have run counter to
99these requirements.
a5e7f259 100
3cee93ac
CF
101First of all, GDB is a debugger. It's not designed to be a front panel
102for embedded systems. It's not a text editor. It's not a shell. It's
103not a programming environment.
a5e7f259 104
3cee93ac
CF
105GDB is an interactive tool. Although a batch mode is available, GDB's
106primary role is to interact with a human programmer.
a5e7f259 107
3cee93ac
CF
108GDB should be responsive to the user. A programmer hot on the trail of
109a nasty bug, and operating under a looming deadline, is going to be very
110impatient of everything, including the response time to debugger
111commands.
a5e7f259 112
3cee93ac
CF
113GDB should be relatively permissive, such as for expressions. While the
114compiler should be picky (or have the option to be made picky), since
115source code lives for a long time usually, the programmer doing
116debugging shouldn't be spending time figuring out to mollify the
117debugger.
a5e7f259 118
3cee93ac
CF
119GDB will be called upon to deal with really large programs. Executable
120sizes of 50 to 100 megabytes occur regularly, and we've heard reports of
121programs approaching 1 gigabyte in size.
a5e7f259 122
3cee93ac
CF
123GDB should be able to run everywhere. No other debugger is available
124for even half as many configurations as GDB supports.
a5e7f259 125
a5e7f259 126
3cee93ac 127@node Overall Structure
a5e7f259 128
3cee93ac 129@chapter Overall Structure
a5e7f259 130
3cee93ac
CF
131GDB consists of three major subsystems: user interface, symbol handling
132(the ``symbol side''), and target system handling (the ``target side'').
a5e7f259 133
3cee93ac
CF
134Ther user interface consists of several actual interfaces, plus
135supporting code.
a5e7f259 136
3cee93ac
CF
137The symbol side consists of object file readers, debugging info
138interpreters, symbol table management, source language expression
139parsing, type and value printing.
a5e7f259 140
3cee93ac
CF
141The target side consists of execution control, stack frame analysis, and
142physical target manipulation.
a5e7f259 143
3cee93ac
CF
144The target side/symbol side division is not formal, and there are a
145number of exceptions. For instance, core file support involves symbolic
146elements (the basic core file reader is in BFD) and target elements (it
147supplies the contents of memory and the values of registers). Instead,
148this division is useful for understanding how the minor subsystems
149should fit together.
a5e7f259 150
3cee93ac
CF
151@section The Symbol Side
152
153The symbolic side of GDB can be thought of as ``everything you can do in
154GDB without having a live program running''. For instance, you can look
155at the types of variables, and evaluate many kinds of expressions.
156
157@section The Target Side
cfddbd02 158
3cee93ac
CF
159The target side of GDB is the ``bits and bytes manipulator''. Although
160it may make reference to symbolic info here and there, most of the
161target side will run with only a stripped executable available -- or
162even no executable at all, in remote debugging cases.
cfddbd02 163
3cee93ac
CF
164Operations such as disassembly, stack frame crawls, and register
165display, are able to work with no symbolic info at all. In some cases,
166such as disassembly, GDB will use symbolic info to present addresses
167relative to symbols rather than as raw numbers, but it will work either
168way.
d98259f8 169
3cee93ac 170@section Configurations
d98259f8 171
b7becc8f
RP
172@dfn{Host} refers to attributes of the system where GDB runs.
173@dfn{Target} refers to the system where the program being debugged
3cee93ac
CF
174executes. In most cases they are the same machine, in which case a
175third type of @dfn{Native} attributes come into play.
cfddbd02 176
97f3cb72
RP
177Defines and include files needed to build on the host are host support.
178Examples are tty support, system defined types, host byte order, host
179float format.
cfddbd02 180
fd3d2e1d
JG
181Defines and information needed to handle the target format are target
182dependent. Examples are the stack frame format, instruction set,
183breakpoint instruction, registers, and how to set up and tear down the stack
184to call a function.
185
186Information that is only needed when the host and target are the same,
187is native dependent. One example is Unix child process support; if the
188host and target are not the same, doing a fork to start the target
189process is a bad idea. The various macros needed for finding the
3cee93ac
CF
190registers in the @code{upage}, running @code{ptrace}, and such are all
191in the native-dependent files.
fd3d2e1d 192
3cee93ac
CF
193Another example of native-dependent code is support for features that
194are really part of the target environment, but which require
195@code{#include} files that are only available on the host system. Core
196file handling and @code{setjmp} handling are two common cases.
fd3d2e1d 197
3cee93ac
CF
198When you want to make GDB work ``native'' on a particular machine, you
199have to include all three kinds of information.
fd3d2e1d 200
7f27984e 201
3cee93ac 202@node Algorithms
7f27984e 203
3cee93ac 204@chapter Algorithms
bbb5013f 205
3cee93ac
CF
206GDB uses a number of debugging-specific algorithms. They are often not
207very complicated, but get lost in the thicket of special cases and
208real-world issues. This chapter describes the basic algorithms and
209mentions some of the specific target definitions that they use.
bbb5013f 210
3cee93ac 211@section Frames
bbb5013f 212
3cee93ac
CF
213A frame is a construct that GDB uses to keep track of calling and called
214functions.
97f3cb72 215
3cee93ac
CF
216@code{FRAME_FP} in the machine description has no meaning to the
217machine-independent part of GDB, except that it is used when setting up
218a new frame from scratch, as follows:
bbb5013f 219
97f3cb72 220@example
3cee93ac 221 create_new_frame (read_register (FP_REGNUM), read_pc ()));
97f3cb72 222@end example
bbb5013f 223
3cee93ac
CF
224Other than that, all the meaning imparted to @code{FP_REGNUM} is
225imparted by the machine-dependent code. So, @code{FP_REGNUM} can have
226any value that is convenient for the code that creates new frames.
227(@code{create_new_frame} calls @code{INIT_EXTRA_FRAME_INFO} if it is
228defined; that is where you should use the @code{FP_REGNUM} value, if
229your frames are nonstandard.)
230
231Given a GDB frame, define @code{FRAME_CHAIN} to determine the address of
232the calling function's frame. This will be used to create a new GDB
233frame struct, and then @code{INIT_EXTRA_FRAME_INFO} and
234@code{INIT_FRAME_PC} will be called for the new frame.
235
236@section Breakpoint Handling
237
238In general, a breakpoint is a user-designated location in the program
239where the user wants to regain control if program execution ever reaches
240that location.
241
242There are two main ways to implement breakpoints; either as ``hardware''
243breakpoints or as ``software'' breakpoints.
244
245Hardware breakpoints are sometimes available as a builtin debugging
246features with some chips. Typically these work by having dedicated
247register into which the breakpoint address may be stored. If the PC
248ever matches a value in a breakpoint registers, the CPU raises an
249exception and reports it to GDB. Another possibility is when an
250emulator is in use; many emulators include circuitry that watches the
251address lines coming out from the processor, and force it to stop if the
252address matches a breakpoint's address. A third possibility is that the
253target already has the ability to do breakpoints somehow; for instance,
254a ROM monitor may do its own software breakpoints. So although these
255are not literally ``hardware breakpoints'', from GDB's point of view
256they work the same; GDB need not do nothing more than set the breakpoint
257and wait for something to happen.
258
259Since they depend on hardware resources, hardware breakpoints may be
260limited in number; when the user asks for more, GDB will start trying to
261set software breakpoints.
262
263Software breakpoints require GDB to do somewhat more work. The basic
264theory is that GDB will replace a program instruction a trap, illegal
265divide, or some other instruction that will cause an exception, and then
266when it's encountered, GDB will take the exception and stop the program.
267When the user says to continue, GDB will restore the original
268instruction, single-step, re-insert the trap, and continue on.
269
270Since it literally overwrites the program being tested, the program area
271must be writeable, so this technique won't work on programs in ROM. It
272can also distort the behavior of programs that examine themselves,
273although the situation would be highly unusual.
274
275Also, the software breakpoint instruction should be the smallest size of
276instruction, so it doesn't overwrite an instruction that might be a jump
277target, and cause disaster when the program jumps into the middle of the
278breakpoint instruction. (Strictly speaking, the breakpoint must be no
279larger than the smallest interval between instructions that may be jump
280targets; perhaps there is an architecture where only even-numbered
281instructions may jumped to.) Note that it's possible for an instruction
282set not to have any instructions usable for a software breakpoint,
283although in practice only the ARC has failed to define such an
284instruction.
285
286The basic definition of the software breakpoint is the macro
287@code{BREAKPOINT}.
288
289Basic breakpoint object handling is in @file{breakpoint.c}. However,
290much of the interesting breakpoint action is in @file{infrun.c}.
291
292@section Single Stepping
293
294@section Signal Handling
295
296@section Thread Handling
297
298@section Inferior Function Calls
299
300@section Longjmp Support
7f27984e 301
3cee93ac
CF
302GDB has support for figuring out that the target is doing a
303@code{longjmp} and for stopping at the target of the jump, if we are
304stepping. This is done with a few specialized internal breakpoints,
305which are visible in the @code{maint info breakpoint} command.
7f27984e 306
3cee93ac
CF
307To make this work, you need to define a macro called
308@code{GET_LONGJMP_TARGET}, which will examine the @code{jmp_buf}
309structure and extract the longjmp target address. Since @code{jmp_buf}
310is target specific, you will need to define it in the appropriate
311@file{tm-@var{xyz}.h} file. Look in @file{tm-sun4os4.h} and
312@file{sparc-tdep.c} for examples of how to do this.
7f27984e 313
3cee93ac 314@node User Interface
7f27984e 315
3cee93ac 316@chapter User Interface
46bc46eb 317
3cee93ac
CF
318GDB has several user interfaces. Although the command-line interface
319is the most common and most familiar, there are others.
46bc46eb 320
3cee93ac 321@section Command Interpreter
aeb62c7b 322
3cee93ac
CF
323The command interpreter in GDB is fairly simple. It is designed to
324allow for the set of commands to be augmented dynamically, and also
325has a recursive subcommand capability, where the first argument to
326a command may itself direct a lookup on a different command list.
aeb62c7b 327
3cee93ac
CF
328For instance, the @code{set} command just starts a lookup on the
329@code{setlist} command list, while @code{set thread} recurses
330to the @code{set_thread_cmd_list}.
46bc46eb 331
3cee93ac
CF
332To add commands in general, use @code{add_cmd}. @code{add_com} adds to
333the main command list, and should be used for those commands. The usual
334place to add commands is in the @code{_initialize_@var{xyz}} routines at the
335ends of most source files.
fd3d2e1d 336
3cee93ac 337@section Console Printing
46bc46eb 338
3cee93ac 339@section TUI
46bc46eb 340
3cee93ac 341@section libgdb
fd3d2e1d 342
3cee93ac
CF
343@code{libgdb} was an abortive project of years ago. The theory was to
344provide an API to GDB's functionality.
fd3d2e1d 345
3cee93ac 346@node Symbol Handling
fd3d2e1d 347
3cee93ac 348@chapter Symbol Handling
fd3d2e1d 349
3cee93ac
CF
350Symbols are a key part of GDB's operation. Symbols include variables,
351functions, and types.
352
353@section Symbol Reading
fd3d2e1d 354
3cee93ac
CF
355GDB reads symbols from ``symbol files''. The usual symbol file is the
356file containing the program which GDB is debugging. GDB can be directed
357to use a different file for symbols (with the @code{symbol-file}
358command), and it can also read more symbols via the ``add-file'' and
359``load'' commands, or while reading symbols from shared libraries.
fd3d2e1d 360
3cee93ac
CF
361Symbol files are initially opened by code in @file{symfile.c} using the
362BFD library. BFD identifies the type of the file by examining its
363header. @code{symfile_init} then uses this identification to locate a
364set of symbol-reading functions.
fd3d2e1d 365
3cee93ac
CF
366Symbol reading modules identify themselves to GDB by calling
367@code{add_symtab_fns} during their module initialization. The argument
368to @code{add_symtab_fns} is a @code{struct sym_fns} which contains the
369name (or name prefix) of the symbol format, the length of the prefix,
370and pointers to four functions. These functions are called at various
371times to process symbol-files whose identification matches the specified
372prefix.
373
374The functions supplied by each module are:
fd3d2e1d 375
3cee93ac
CF
376@table @code
377@item @var{xyz}_symfile_init(struct sym_fns *sf)
fd3d2e1d 378
3cee93ac
CF
379Called from @code{symbol_file_add} when we are about to read a new
380symbol file. This function should clean up any internal state (possibly
381resulting from half-read previous files, for example) and prepare to
382read a new symbol file. Note that the symbol file which we are reading
383might be a new "main" symbol file, or might be a secondary symbol file
384whose symbols are being added to the existing symbol table.
385
386The argument to @code{@var{xyz}_symfile_init} is a newly allocated
387@code{struct sym_fns} whose @code{bfd} field contains the BFD for the
388new symbol file being read. Its @code{private} field has been zeroed,
389and can be modified as desired. Typically, a struct of private
390information will be @code{malloc}'d, and a pointer to it will be placed
391in the @code{private} field.
392
393There is no result from @code{@var{xyz}_symfile_init}, but it can call
394@code{error} if it detects an unavoidable problem.
fd3d2e1d 395
3cee93ac 396@item @var{xyz}_new_init()
fd3d2e1d 397
3cee93ac
CF
398Called from @code{symbol_file_add} when discarding existing symbols.
399This function need only handle the symbol-reading module's internal
400state; the symbol table data structures visible to the rest of GDB will
401be discarded by @code{symbol_file_add}. It has no arguments and no
402result. It may be called after @code{@var{xyz}_symfile_init}, if a new
403symbol table is being read, or may be called alone if all symbols are
404simply being discarded.
46bc46eb 405
3cee93ac 406@item @var{xyz}_symfile_read(struct sym_fns *sf, CORE_ADDR addr, int mainline)
fd3d2e1d 407
3cee93ac
CF
408Called from @code{symbol_file_add} to actually read the symbols from a
409symbol-file into a set of psymtabs or symtabs.
fd3d2e1d 410
3cee93ac
CF
411@code{sf} points to the struct sym_fns originally passed to
412@code{@var{xyz}_sym_init} for possible initialization. @code{addr} is
413the offset between the file's specified start address and its true
414address in memory. @code{mainline} is 1 if this is the main symbol
415table being read, and 0 if a secondary symbol file (e.g. shared library
416or dynamically loaded file) is being read.@refill
fd3d2e1d
JG
417@end table
418
3cee93ac
CF
419In addition, if a symbol-reading module creates psymtabs when
420@var{xyz}_symfile_read is called, these psymtabs will contain a pointer
421to a function @code{@var{xyz}_psymtab_to_symtab}, which can be called
422from any point in the GDB symbol-handling code.
fd3d2e1d 423
3cee93ac
CF
424@table @code
425@item @var{xyz}_psymtab_to_symtab (struct partial_symtab *pst)
426
427Called from @code{psymtab_to_symtab} (or the PSYMTAB_TO_SYMTAB macro) if
428the psymtab has not already been read in and had its @code{pst->symtab}
429pointer set. The argument is the psymtab to be fleshed-out into a
430symtab. Upon return, pst->readin should have been set to 1, and
431pst->symtab should contain a pointer to the new corresponding symtab, or
432zero if there were no symbols in that part of the symbol file.
433@end table
fd3d2e1d 434
3cee93ac 435@section Partial Symbol Tables
fd3d2e1d 436
3cee93ac 437GDB has three types of symbol tables.
fd3d2e1d 438
3cee93ac 439@itemize @bullet
fd3d2e1d 440
3cee93ac
CF
441@item full symbol tables (symtabs). These contain the main information
442about symbols and addresses.
fd3d2e1d 443
3cee93ac
CF
444@item partial symbol tables (psymtabs). These contain enough
445information to know when to read the corresponding part of the full
446symbol table.
46bc46eb 447
3cee93ac
CF
448@item minimal symbol tables (msymtabs). These contain information
449gleaned from non-debugging symbols.
46bc46eb 450
3cee93ac 451@end itemize
ca714d03 452
3cee93ac 453This section describes partial symbol tables.
46bc46eb 454
3cee93ac
CF
455A psymtab is constructed by doing a very quick pass over an executable
456file's debugging information. Small amounts of information are
457extracted -- enough to identify which parts of the symbol table will
458need to be re-read and fully digested later, when the user needs the
459information. The speed of this pass causes GDB to start up very
460quickly. Later, as the detailed rereading occurs, it occurs in small
461pieces, at various times, and the delay therefrom is mostly invisible to
462the user.
463@c (@xref{Symbol Reading}.)
97f3cb72 464
3cee93ac
CF
465The symbols that show up in a file's psymtab should be, roughly, those
466visible to the debugger's user when the program is not running code from
467that file. These include external symbols and types, static symbols and
468types, and enum values declared at file scope.
1dbe1ef7 469
3cee93ac
CF
470The psymtab also contains the range of instruction addresses that the
471full symbol table would represent.
97f3cb72 472
3cee93ac
CF
473The idea is that there are only two ways for the user (or much of the
474code in the debugger) to reference a symbol:
d98259f8 475
3cee93ac 476@itemize @bullet
97f3cb72 477
3cee93ac
CF
478@item by its address
479(e.g. execution stops at some address which is inside a function in this
480file). The address will be noticed to be in the range of this psymtab,
481and the full symtab will be read in. @code{find_pc_function},
482@code{find_pc_line}, and other @code{find_pc_@dots{}} functions handle
483this.
1dbe1ef7 484
3cee93ac
CF
485@item by its name
486(e.g. the user asks to print a variable, or set a breakpoint on a
487function). Global names and file-scope names will be found in the
488psymtab, which will cause the symtab to be pulled in. Local names will
489have to be qualified by a global name, or a file-scope name, in which
490case we will have already read in the symtab as we evaluated the
491qualifier. Or, a local symbol can be referenced when we are "in" a
492local scope, in which case the first case applies. @code{lookup_symbol}
493does most of the work here.
aeb62c7b 494
3cee93ac 495@end itemize
aeb62c7b 496
3cee93ac
CF
497The only reason that psymtabs exist is to cause a symtab to be read in
498at the right moment. Any symbol that can be elided from a psymtab,
499while still causing that to happen, should not appear in it. Since
500psymtabs don't have the idea of scope, you can't put local symbols in
501them anyway. Psymtabs don't have the idea of the type of a symbol,
502either, so types need not appear, unless they will be referenced by
503name.
aeb62c7b 504
3cee93ac
CF
505It is a bug for GDB to behave one way when only a psymtab has been read,
506and another way if the corresponding symtab has been read in. Such bugs
507are typically caused by a psymtab that does not contain all the visible
508symbols, or which has the wrong instruction address ranges.
aeb62c7b 509
3cee93ac
CF
510The psymtab for a particular section of a symbol-file (objfile) could be
511thrown away after the symtab has been read in. The symtab should always
512be searched before the psymtab, so the psymtab will never be used (in a
513bug-free environment). Currently, psymtabs are allocated on an obstack,
514and all the psymbols themselves are allocated in a pair of large arrays
515on an obstack, so there is little to be gained by trying to free them
516unless you want to do a lot more work.
aeb62c7b 517
3cee93ac 518@section Types
aeb62c7b 519
3cee93ac 520Fundamental Types (e.g., FT_VOID, FT_BOOLEAN).
1dbe1ef7 521
3cee93ac
CF
522These are the fundamental types that GDB uses internally. Fundamental
523types from the various debugging formats (stabs, ELF, etc) are mapped
524into one of these. They are basically a union of all fundamental types
525that gdb knows about for all the languages that GDB knows about.
1dbe1ef7 526
3cee93ac 527Type Codes (e.g., TYPE_CODE_PTR, TYPE_CODE_ARRAY).
6e1c67d2 528
3cee93ac
CF
529Each time GDB builds an internal type, it marks it with one of these
530types. The type may be a fundamental type, such as TYPE_CODE_INT, or a
531derived type, such as TYPE_CODE_PTR which is a pointer to another type.
532Typically, several FT_* types map to one TYPE_CODE_* type, and are
533distinguished by other members of the type struct, such as whether the
534type is signed or unsigned, and how many bits it uses.
535
536Builtin Types (e.g., builtin_type_void, builtin_type_char).
537
538These are instances of type structs that roughly correspond to
539fundamental types and are created as global types for GDB to use for
540various ugly historical reasons. We eventually want to eliminate these.
541Note for example that builtin_type_int initialized in gdbtypes.c is
542basically the same as a TYPE_CODE_INT type that is initialized in
543c-lang.c for an FT_INTEGER fundamental type. The difference is that the
544builtin_type is not associated with any particular objfile, and only one
545instance exists, while c-lang.c builds as many TYPE_CODE_INT types as
546needed, with each one associated with some particular objfile.
547
548@section Object File Formats
549
550@subsection a.out
551
552The @file{a.out} format is the original file format for Unix. It
553consists of three sections: text, data, and bss, which are for program
554code, initialized data, and uninitialized data, respectively.
555
556The @file{a.out} format is so simple that it doesn't have any reserved
557place for debugging information. (Hey, the original Unix hackers used
558@file{adb}, which is a machine-language debugger.) The only debugging
9ddf9aa9
AC
559format for @file{a.out} is stabs, which is encoded as a set of normal
560symbols with distinctive attributes.
561
562The basic @file{a.out} reader is in @file{dbxread.c}.
3cee93ac
CF
563
564@subsection COFF
565
566The COFF format was introduced with System V Release 3 (SVR3) Unix.
567COFF files may have multiple sections, each prefixed by a header. The
568number of sections is limited.
569
570The COFF specification includes support for debugging. Although this
571was a step forward, the debugging information was woefully limited. For
572instance, it was not possible to represent code that came from an
573included file.
574
9ddf9aa9
AC
575The COFF reader is in @file{coffread.c}.
576
3cee93ac
CF
577@subsection ECOFF
578
9ddf9aa9
AC
579ECOFF is an extended COFF originally introduced for Mips and Alpha
580workstations.
581
582The basic ECOFF reader is in @file{mipsread.c}.
583
3cee93ac
CF
584@subsection XCOFF
585
586The IBM RS/6000 running AIX uses an object file format called XCOFF.
587The COFF sections, symbols, and line numbers are used, but debugging
588symbols are dbx-style stabs whose strings are located in the
589@samp{.debug} section (rather than the string table). For more
590information, see @xref{Top,,,stabs,The Stabs Debugging Format}.
591
9ddf9aa9
AC
592The shared library scheme has a clean interface for figuring out what
593shared libraries are in use, but the catch is that everything which
3cee93ac
CF
594refers to addresses (symbol tables and breakpoints at least) needs to be
595relocated for both shared libraries and the main executable. At least
596using the standard mechanism this can only be done once the program has
597been run (or the core file has been read).
598
599@subsection PE
600
601Windows 95 and NT use the PE (Portable Executable) format for their
9ddf9aa9
AC
602executables. PE is basically COFF with additional headers.
603
604While BFD includes special PE support, GDB needs only the basic
605COFF reader.
3cee93ac
CF
606
607@subsection ELF
608
609The ELF format came with System V Release 4 (SVR4) Unix. ELF is similar
610to COFF in being organized into a number of sections, but it removes
611many of COFF's limitations.
612
9ddf9aa9
AC
613The basic ELF reader is in @file{elfread.c}.
614
3cee93ac
CF
615@subsection SOM
616
9ddf9aa9
AC
617SOM is HP's object file and debug format (not to be confused with IBM's
618SOM, which is a cross-language ABI).
619
620The SOM reader is in @file{hpread.c}.
621
622@subsection Other File Formats
623
624Other file formats that have been supported by GDB include Netware
625Loadable Modules (@file{nlmread.c}.
626
3cee93ac
CF
627@section Debugging File Formats
628
9ddf9aa9
AC
629This section describes characteristics of debugging information that
630are independent of the object file format.
631
3cee93ac
CF
632@subsection stabs
633
9ddf9aa9
AC
634@code{stabs} started out as special symbols within the @code{a.out}
635format. Since then, it has been encapsulated into other file
636formats, such as COFF and ELF.
637
638While @file{dbxread.c} does some of the basic stab processing,
639including for encapsulated versions, @file{stabsread.c} does
640the real work.
641
3cee93ac
CF
642@subsection COFF
643
9ddf9aa9
AC
644The basic COFF definition includes debugging information. The level
645of support is minimal and non-extensible, and is not often used.
646
647@subsection Mips debug (Third Eye)
648
649ECOFF includes a definition of a special debug format.
650
651The file @file{mdebugread.c} implements reading for this format.
652
3cee93ac
CF
653@subsection DWARF 1
654
9ddf9aa9
AC
655DWARF 1 is a debugging format that was originally designed to be
656used with ELF in SVR4 systems.
657
658@c CHILL_PRODUCER
659@c GCC_PRODUCER
660@c GPLUS_PRODUCER
661@c LCC_PRODUCER
662@c If defined, these are the producer strings in a DWARF 1 file. All of
663@c these have reasonable defaults already.
664
665The DWARF 1 reader is in @file{dwarfread.c}.
666
3cee93ac
CF
667@subsection DWARF 2
668
9ddf9aa9
AC
669DWARF 2 is an improved but incompatible version of DWARF 1.
670
671The DWARF 2 reader is in @file{dwarf2read.c}.
672
3cee93ac
CF
673@subsection SOM
674
9ddf9aa9
AC
675Like COFF, the SOM definition includes debugging information.
676
3cee93ac
CF
677@section Adding a New Symbol Reader to GDB
678
679If you are using an existing object file format (a.out, COFF, ELF, etc),
680there is probably little to be done.
681
682If you need to add a new object file format, you must first add it to
683BFD. This is beyond the scope of this document.
684
685You must then arrange for the BFD code to provide access to the
686debugging symbols. Generally GDB will have to call swapping routines
687from BFD and a few other BFD internal routines to locate the debugging
688information. As much as possible, GDB should not depend on the BFD
689internal data structures.
690
691For some targets (e.g., COFF), there is a special transfer vector used
692to call swapping routines, since the external data structures on various
693platforms have different sizes and layouts. Specialized routines that
6e1c67d2
JG
694will only ever be implemented by one object file format may be called
695directly. This interface should be described in a file
3cee93ac 696@file{bfd/libxyz.h}, which is included by GDB.
6e1c67d2 697
97f3cb72 698
3cee93ac
CF
699@node Language Support
700
701@chapter Language Support
97f3cb72 702
3cee93ac
CF
703GDB's language support is mainly driven by the symbol reader, although
704it is possible for the user to set the source language manually.
97f3cb72 705
3cee93ac
CF
706GDB chooses the source language by looking at the extension of the file
707recorded in the debug info; @code{.c} means C, @code{.f} means Fortran,
708etc. It may also use a special-purpose language identifier if the debug
709format supports it, such as DWARF.
710
711@section Adding a Source Language to GDB
712
713To add other languages to GDB's expression parser, follow the following
714steps:
97f3cb72
RP
715
716@table @emph
717@item Create the expression parser.
718
3cee93ac
CF
719This should reside in a file @file{@var{lang}-exp.y}. Routines for
720building parsed expressions into a @samp{union exp_element} list are in
721@file{parse.c}.
97f3cb72 722
aeb62c7b
JG
723Since we can't depend upon everyone having Bison, and YACC produces
724parsers that define a bunch of global names, the following lines
3cee93ac
CF
725@emph{must} be included at the top of the YACC parser, to prevent the
726various parsers from defining the same global names:
d98259f8 727
d98259f8 728@example
97f3cb72 729#define yyparse @var{lang}_parse
aeb62c7b 730#define yylex @var{lang}_lex
97f3cb72
RP
731#define yyerror @var{lang}_error
732#define yylval @var{lang}_lval
733#define yychar @var{lang}_char
734#define yydebug @var{lang}_debug
735#define yypact @var{lang}_pact
736#define yyr1 @var{lang}_r1
737#define yyr2 @var{lang}_r2
738#define yydef @var{lang}_def
739#define yychk @var{lang}_chk
740#define yypgo @var{lang}_pgo
741#define yyact @var{lang}_act
742#define yyexca @var{lang}_exca
aeb62c7b
JG
743#define yyerrflag @var{lang}_errflag
744#define yynerrs @var{lang}_nerrs
d98259f8 745@end example
d98259f8 746
b7becc8f
RP
747At the bottom of your parser, define a @code{struct language_defn} and
748initialize it with the right values for your language. Define an
749@code{initialize_@var{lang}} routine and have it call
750@samp{add_language(@var{lang}_language_defn)} to tell the rest of GDB
751that your language exists. You'll need some other supporting variables
752and functions, which will be used via pointers from your
753@code{@var{lang}_language_defn}. See the declaration of @code{struct
754language_defn} in @file{language.h}, and the other @file{*-exp.y} files,
755for more information.
756
97f3cb72
RP
757@item Add any evaluation routines, if necessary
758
759If you need new opcodes (that represent the operations of the language),
760add them to the enumerated type in @file{expression.h}. Add support
761code for these operations in @code{eval.c:evaluate_subexp()}. Add cases
762for new opcodes in two functions from @file{parse.c}:
763@code{prefixify_subexp()} and @code{length_of_subexp()}. These compute
764the number of @code{exp_element}s that a given operation takes up.
765
766@item Update some existing code
767
768Add an enumerated identifier for your language to the enumerated type
769@code{enum language} in @file{defs.h}.
770
3cee93ac
CF
771Update the routines in @file{language.c} so your language is included.
772These routines include type predicates and such, which (in some cases)
773are language dependent. If your language does not appear in the switch
97f3cb72
RP
774statement, an error is reported.
775
776Also included in @file{language.c} is the code that updates the variable
777@code{current_language}, and the routines that translate the
778@code{language_@var{lang}} enumerated identifier into a printable
779string.
780
3cee93ac
CF
781Update the function @code{_initialize_language} to include your
782language. This function picks the default language upon startup, so is
783dependent upon which languages that GDB is built for.
97f3cb72 784
b7becc8f
RP
785Update @code{allocate_symtab} in @file{symfile.c} and/or symbol-reading
786code so that the language of each symtab (source file) is set properly.
787This is used to determine the language to use at each stack frame level.
788Currently, the language is set based upon the extension of the source
789file. If the language can be better inferred from the symbol
790information, please set the language of the symtab in the symbol-reading
791code.
97f3cb72
RP
792
793Add helper code to @code{expprint.c:print_subexp()} to handle any new
794expression opcodes you have added to @file{expression.h}. Also, add the
795printed representations of your operators to @code{op_print_tab}.
796
797@item Add a place of call
798
799Add a call to @code{@var{lang}_parse()} and @code{@var{lang}_error} in
800@code{parse.c:parse_exp_1()}.
801
802@item Use macros to trim code
803
804The user has the option of building GDB for some or all of the
805languages. If the user decides to build GDB for the language
806@var{lang}, then every file dependent on @file{language.h} will have the
807macro @code{_LANG_@var{lang}} defined in it. Use @code{#ifdef}s to
808leave out large routines that the user won't need if he or she is not
809using your language.
810
811Note that you do not need to do this in your YACC parser, since if GDB
812is not build for @var{lang}, then @file{@var{lang}-exp.tab.o} (the
813compiled form of your parser) is not linked into GDB at all.
814
815See the file @file{configure.in} for how GDB is configured for different
816languages.
817
818@item Edit @file{Makefile.in}
819
820Add dependencies in @file{Makefile.in}. Make sure you update the macro
821variables such as @code{HFILES} and @code{OBJS}, otherwise your code may
822not get linked in, or, worse yet, it may not get @code{tar}red into the
823distribution!
3cee93ac 824
97f3cb72
RP
825@end table
826
827
3cee93ac 828@node Host Definition
97f3cb72 829
3cee93ac 830@chapter Host Definition
97f3cb72 831
3cee93ac
CF
832With the advent of autoconf, it's rarely necessary to have host
833definition machinery anymore.
b7becc8f 834
3cee93ac 835@section Adding a New Host
97f3cb72 836
3cee93ac
CF
837Most of GDB's host configuration support happens via autoconf. It
838should be rare to need new host-specific definitions. GDB still uses
839the host-specific definitions and files listed below, but these mostly
840exist for historical reasons, and should eventually disappear.
97f3cb72 841
3cee93ac 842Several files control GDB's configuration for host systems:
97f3cb72 843
3cee93ac 844@table @file
97f3cb72 845
3cee93ac
CF
846@item gdb/config/@var{arch}/@var{xyz}.mh
847Specifies Makefile fragments needed when hosting on machine @var{xyz}.
848In particular, this lists the required machine-dependent object files,
849by defining @samp{XDEPFILES=@dots{}}. Also specifies the header file
850which describes host @var{xyz}, by defining @code{XM_FILE=
851xm-@var{xyz}.h}. You can also define @code{CC}, @code{SYSV_DEFINE},
852@code{XM_CFLAGS}, @code{XM_ADD_FILES}, @code{XM_CLIBS}, @code{XM_CDEPS},
853etc.; see @file{Makefile.in}.
97f3cb72 854
3cee93ac
CF
855@item gdb/config/@var{arch}/xm-@var{xyz}.h
856(@file{xm.h} is a link to this file, created by configure). Contains C
857macro definitions describing the host system environment, such as byte
858order, host C compiler and library.
97f3cb72 859
3cee93ac
CF
860@item gdb/@var{xyz}-xdep.c
861Contains any miscellaneous C code required for this machine as a host.
862On most machines it doesn't exist at all. If it does exist, put
863@file{@var{xyz}-xdep.o} into the @code{XDEPFILES} line in
864@file{gdb/config/@var{arch}/@var{xyz}.mh}.
493cf018 865
3cee93ac 866@end table
493cf018 867
3cee93ac 868@subheading Generic Host Support Files
493cf018 869
3cee93ac
CF
870There are some ``generic'' versions of routines that can be used by
871various systems. These can be customized in various ways by macros
872defined in your @file{xm-@var{xyz}.h} file. If these routines work for
873the @var{xyz} host, you can just include the generic file's name (with
874@samp{.o}, not @samp{.c}) in @code{XDEPFILES}.
493cf018 875
3cee93ac
CF
876Otherwise, if your machine needs custom support routines, you will need
877to write routines that perform the same functions as the generic file.
878Put them into @code{@var{xyz}-xdep.c}, and put @code{@var{xyz}-xdep.o}
879into @code{XDEPFILES}.
493cf018 880
3cee93ac 881@table @file
493cf018 882
3cee93ac
CF
883@item ser-unix.c
884This contains serial line support for Unix systems. This is always
885included, via the makefile variable @code{SER_HARDWIRE}; override this
886variable in the @file{.mh} file to avoid it.
493cf018 887
3cee93ac
CF
888@item ser-go32.c
889This contains serial line support for 32-bit programs running under DOS,
890using the GO32 execution environment.
493cf018 891
3cee93ac
CF
892@item ser-tcp.c
893This contains generic TCP support using sockets.
493cf018 894
3cee93ac 895@end table
493cf018 896
3cee93ac 897@section Host Conditionals
493cf018 898
3cee93ac
CF
899When GDB is configured and compiled, various macros are defined or left
900undefined, to control compilation based on the attributes of the host
901system. These macros and their meanings (or if the meaning is not
902documented here, then one of the source files where they are used is
903indicated) are:
493cf018 904
3cee93ac 905@table @code
493cf018 906
3cee93ac
CF
907@item GDBINIT_FILENAME
908The default name of GDB's initialization file (normally @file{.gdbinit}).
3a8bc841 909
3cee93ac
CF
910@item MEM_FNS_DECLARED
911Your host config file defines this if it includes declarations of
912@code{memcpy} and @code{memset}. Define this to avoid conflicts between
913the native include files and the declarations in @file{defs.h}.
3a8bc841 914
3cee93ac
CF
915@item NO_SYS_FILE
916Define this if your system does not have a @code{<sys/file.h>}.
3a8bc841 917
3cee93ac
CF
918@item SIGWINCH_HANDLER
919If your host defines @code{SIGWINCH}, you can define this to be the name
920of a function to be called if @code{SIGWINCH} is received.
3a8bc841 921
3cee93ac
CF
922@item SIGWINCH_HANDLER_BODY
923Define this to expand into code that will define the function named by
924the expansion of @code{SIGWINCH_HANDLER}.
3a8bc841 925
3cee93ac
CF
926@item ALIGN_STACK_ON_STARTUP
927Define this if your system is of a sort that will crash in
928@code{tgetent} if the stack happens not to be longword-aligned when
929@code{main} is called. This is a rare situation, but is known to occur
930on several different types of systems.
3a8bc841 931
3cee93ac
CF
932@item CRLF_SOURCE_FILES
933Define this if host files use @code{\r\n} rather than @code{\n} as a
934line terminator. This will cause source file listings to omit @code{\r}
935characters when printing and it will allow \r\n line endings of files
936which are "sourced" by gdb. It must be possible to open files in binary
937mode using @code{O_BINARY} or, for fopen, @code{"rb"}.
3a8bc841 938
3cee93ac
CF
939@item DEFAULT_PROMPT
940The default value of the prompt string (normally @code{"(gdb) "}).
7f09f15f 941
3cee93ac
CF
942@item DEV_TTY
943The name of the generic TTY device, defaults to @code{"/dev/tty"}.
7f09f15f 944
3cee93ac
CF
945@item FCLOSE_PROVIDED
946Define this if the system declares @code{fclose} in the headers included
947in @code{defs.h}. This isn't needed unless your compiler is unusually
948anal.
7f09f15f 949
3cee93ac
CF
950@item FOPEN_RB
951Define this if binary files are opened the same way as text files.
7f09f15f 952
3cee93ac
CF
953@item GETENV_PROVIDED
954Define this if the system declares @code{getenv} in its headers included
955in @code{defs.h}. This isn't needed unless your compiler is unusually
956anal.
7f09f15f 957
3cee93ac
CF
958@item HAVE_MMAP
959In some cases, use the system call @code{mmap} for reading symbol
960tables. For some machines this allows for sharing and quick updates.
7f09f15f 961
3cee93ac
CF
962@item HAVE_SIGSETMASK
963Define this if the host system has job control, but does not define
964@code{sigsetmask()}. Currently, this is only true of the RS/6000.
7f09f15f 965
3cee93ac
CF
966@item HAVE_TERMIO
967Define this if the host system has @code{termio.h}.
7f09f15f 968
3cee93ac
CF
969@item HOST_BYTE_ORDER
970The ordering of bytes in the host. This must be defined to be either
971@code{BIG_ENDIAN} or @code{LITTLE_ENDIAN}.
eb752e4e 972
3cee93ac
CF
973@item INT_MAX
974@item INT_MIN
975@item LONG_MAX
976@item UINT_MAX
977@item ULONG_MAX
978Values for host-side constants.
eb752e4e 979
3cee93ac
CF
980@item ISATTY
981Substitute for isatty, if not available.
eb752e4e 982
3cee93ac
CF
983@item LONGEST
984This is the longest integer type available on the host. If not defined,
985it will default to @code{long long} or @code{long}, depending on
986@code{CC_HAS_LONG_LONG}.
eb752e4e 987
3cee93ac
CF
988@item CC_HAS_LONG_LONG
989Define this if the host C compiler supports ``long long''. This is set
990by the configure script.
eb752e4e 991
3cee93ac
CF
992@item PRINTF_HAS_LONG_LONG
993Define this if the host can handle printing of long long integers via
994the printf format directive ``ll''. This is set by the configure script.
eb752e4e 995
3cee93ac
CF
996@item HAVE_LONG_DOUBLE
997Define this if the host C compiler supports ``long double''. This is
998set by the configure script.
eb752e4e 999
3cee93ac
CF
1000@item PRINTF_HAS_LONG_DOUBLE
1001Define this if the host can handle printing of long double float-point
1002numbers via the printf format directive ``Lg''. This is set by the
1003configure script.
eb752e4e 1004
3cee93ac
CF
1005@item SCANF_HAS_LONG_DOUBLE
1006Define this if the host can handle the parsing of long double
1007float-point numbers via the scanf format directive directive
1008``Lg''. This is set by the configure script.
eb752e4e 1009
3cee93ac
CF
1010@item LSEEK_NOT_LINEAR
1011Define this if @code{lseek (n)} does not necessarily move to byte number
1012@code{n} in the file. This is only used when reading source files. It
1013is normally faster to define @code{CRLF_SOURCE_FILES} when possible.
eb752e4e 1014
3cee93ac
CF
1015@item L_SET
1016This macro is used as the argument to lseek (or, most commonly,
1017bfd_seek). FIXME, should be replaced by SEEK_SET instead, which is the
1018POSIX equivalent.
eb752e4e 1019
3cee93ac
CF
1020@item MAINTENANCE_CMDS
1021If the value of this is 1, then a number of optional maintenance
1022commands are compiled in.
eb752e4e 1023
3cee93ac
CF
1024@item MALLOC_INCOMPATIBLE
1025Define this if the system's prototype for @code{malloc} differs from the
1026@sc{ANSI} definition.
eb752e4e 1027
3cee93ac
CF
1028@item MMAP_BASE_ADDRESS
1029When using HAVE_MMAP, the first mapping should go at this address.
eb752e4e 1030
3cee93ac
CF
1031@item MMAP_INCREMENT
1032when using HAVE_MMAP, this is the increment between mappings.
7f09f15f 1033
3cee93ac
CF
1034@item NEED_POSIX_SETPGID
1035Define this to use the POSIX version of @code{setpgid} to determine
1036whether job control is available.
7f09f15f 1037
3cee93ac
CF
1038@item NORETURN
1039If defined, this should be one or more tokens, such as @code{volatile},
1040that can be used in both the declaration and definition of functions to
1041indicate that they never return. The default is already set correctly
1042if compiling with GCC. This will almost never need to be defined.
7f09f15f 1043
3cee93ac
CF
1044@item ATTR_NORETURN
1045If defined, this should be one or more tokens, such as
1046@code{__attribute__ ((noreturn))}, that can be used in the declarations
1047of functions to indicate that they never return. The default is already
1048set correctly if compiling with GCC. This will almost never need to be
1049defined.
7f09f15f 1050
3cee93ac
CF
1051@item USE_MMALLOC
1052GDB will use the @code{mmalloc} library for memory allocation for symbol
1053reading if this symbol is defined. Be careful defining it since there
1054are systems on which @code{mmalloc} does not work for some reason. One
1055example is the DECstation, where its RPC library can't cope with our
1056redefinition of @code{malloc} to call @code{mmalloc}. When defining
1057@code{USE_MMALLOC}, you will also have to set @code{MMALLOC} in the
1058Makefile, to point to the mmalloc library. This define is set when you
1059configure with --with-mmalloc.
7f09f15f 1060
3cee93ac
CF
1061@item NO_MMCHECK
1062Define this if you are using @code{mmalloc}, but don't want the overhead
1063of checking the heap with @code{mmcheck}. Note that on some systems,
1064the C runtime makes calls to malloc prior to calling @code{main}, and if
1065@code{free} is ever called with these pointers after calling
1066@code{mmcheck} to enable checking, a memory corruption abort is certain
1067to occur. These systems can still use mmalloc, but must define
1068NO_MMCHECK.
f8f37439 1069
3cee93ac
CF
1070@item MMCHECK_FORCE
1071Define this to 1 if the C runtime allocates memory prior to
1072@code{mmcheck} being called, but that memory is never freed so we don't
1073have to worry about it triggering a memory corruption abort. The
1074default is 0, which means that @code{mmcheck} will only install the heap
1075checking functions if there has not yet been any memory allocation
1076calls, and if it fails to install the functions, gdb will issue a
1077warning. This is currently defined if you configure using
1078--with-mmalloc.
7f09f15f 1079
3cee93ac
CF
1080@item NO_SIGINTERRUPT
1081Define this to indicate that siginterrupt() is not available.
7f09f15f 1082
3cee93ac
CF
1083@item R_OK
1084Define if this is not in a system .h file.
7f09f15f 1085
3cee93ac
CF
1086@item SEEK_CUR
1087@item SEEK_SET
1088Define these to appropriate value for the system lseek(), if not already
1089defined.
7f09f15f 1090
3cee93ac
CF
1091@item STOP_SIGNAL
1092This is the signal for stopping GDB. Defaults to SIGTSTP. (Only
1093redefined for the Convex.)
7f09f15f 1094
3cee93ac
CF
1095@item USE_O_NOCTTY
1096Define this if the interior's tty should be opened with the O_NOCTTY
1097flag. (FIXME: This should be a native-only flag, but @file{inflow.c} is
1098always linked in.)
7f09f15f 1099
3cee93ac
CF
1100@item USG
1101Means that System V (prior to SVR4) include files are in use. (FIXME:
1102This symbol is abused in @file{infrun.c}, @file{regex.c},
1103@file{remote-nindy.c}, and @file{utils.c} for other things, at the
1104moment.)
7f09f15f 1105
3cee93ac
CF
1106@item lint
1107Define this to help placate lint in some situations.
7f09f15f 1108
3cee93ac
CF
1109@item volatile
1110Define this to override the defaults of @code{__volatile__} or
1111@code{/**/}.
7f09f15f 1112
3cee93ac 1113@end table
7f09f15f 1114
7f09f15f 1115
3cee93ac 1116@node Target Architecture Definition
7f09f15f 1117
3cee93ac 1118@chapter Target Architecture Definition
7f09f15f 1119
3cee93ac
CF
1120GDB's target architecture defines what sort of machine-language programs
1121GDB can work with, and how it works with them.
edbf28ce 1122
3cee93ac
CF
1123At present, the target architecture definition consists of a number of C
1124macros.
edbf28ce 1125
3cee93ac 1126@section Registers and Memory
edbf28ce 1127
3cee93ac
CF
1128GDB's model of the target machine is rather simple. GDB assumes the
1129machine includes a bank of registers and a block of memory. Each
1130register may have a different size.
edbf28ce 1131
3cee93ac
CF
1132GDB does not have a magical way to match up with the compiler's idea of
1133which registers are which; however, it is critical that they do match up
1134accurately. The only way to make this work is to get accurate
1135information about the order that the compiler uses, and to reflect that
9ddf9aa9 1136in the @code{REGISTER_NAME} and related macros.
edbf28ce 1137
3cee93ac 1138GDB can handle big-endian, little-endian, and bi-endian architectures.
00db1549 1139
3cee93ac 1140@section Frame Interpretation
00db1549 1141
3cee93ac 1142@section Inferior Call Setup
00db1549 1143
3cee93ac 1144@section Compiler Characteristics
00db1549 1145
3cee93ac 1146@section Target Conditionals
00db1549 1147
3cee93ac
CF
1148This section describes the macros that you can use to define the target
1149machine.
00db1549 1150
3cee93ac 1151@table @code
00db1549 1152
3cee93ac
CF
1153@item ADDITIONAL_OPTIONS
1154@item ADDITIONAL_OPTION_CASES
1155@item ADDITIONAL_OPTION_HANDLER
1156@item ADDITIONAL_OPTION_HELP
1157These are a set of macros that allow the addition of additional command
1158line options to GDB. They are currently used only for the unsupported
1159i960 Nindy target, and should not be used in any other configuration.
00db1549 1160
3cee93ac
CF
1161@item ADDR_BITS_REMOVE (addr)
1162If a raw machine address includes any bits that are not really part of
1163the address, then define this macro to expand into an expression that
1164zeros those bits in @var{addr}. For example, the two low-order bits of
1165a Motorola 88K address may be used by some kernels for their own
1166purposes, since addresses must always be 4-byte aligned, and so are of
1167no use for addressing. Those bits should be filtered out with an
1168expression such as @code{((addr) & ~3)}.
d3d6d0ff 1169
3cee93ac
CF
1170@item BEFORE_MAIN_LOOP_HOOK
1171Define this to expand into any code that you want to execute before the
1172main loop starts. Although this is not, strictly speaking, a target
1173conditional, that is how it is currently being used. Note that if a
1174configuration were to define it one way for a host and a different way
1175for the target, GDB will probably not compile, let alone run correctly.
1176This is currently used only for the unsupported i960 Nindy target, and
1177should not be used in any other configuration.
d3d6d0ff 1178
3cee93ac
CF
1179@item BELIEVE_PCC_PROMOTION
1180Define if the compiler promotes a short or char parameter to an int, but
1181still reports the parameter as its original type, rather than the
1182promoted type.
d3d6d0ff 1183
3cee93ac
CF
1184@item BELIEVE_PCC_PROMOTION_TYPE
1185Define this if GDB should believe the type of a short argument when
1186compiled by pcc, but look within a full int space to get its value.
1187Only defined for Sun-3 at present.
968720bf 1188
3cee93ac
CF
1189@item BITS_BIG_ENDIAN
1190Define this if the numbering of bits in the targets does *not* match the
1191endianness of the target byte order. A value of 1 means that the bits
1192are numbered in a big-endian order, 0 means little-endian.
968720bf 1193
3cee93ac
CF
1194@item BREAKPOINT
1195This is the character array initializer for the bit pattern to put into
1196memory where a breakpoint is set. Although it's common to use a trap
1197instruction for a breakpoint, it's not required; for instance, the bit
1198pattern could be an invalid instruction. The breakpoint must be no
1199longer than the shortest instruction of the architecture.
968720bf 1200
3cee93ac
CF
1201@item BIG_BREAKPOINT
1202@item LITTLE_BREAKPOINT
1203Similar to BREAKPOINT, but used for bi-endian targets.
968720bf 1204
afcad54a
AC
1205@item REMOTE_BREAKPOINT
1206@item LITTLE_REMOTE_BREAKPOINT
1207@item BIG_REMOTE_BREAKPOINT
1208Similar to BREAKPOINT, but used for remote targets.
1209
1210@item BREAKPOINT_FROM_PC (pcptr, lenptr)
1211
1212Use the program counter to determine the contents and size of a
1213breakpoint instruction. It returns a pointer to a string of bytes that
1214encode a breakpoint instruction, stores the length of the string to
1215*lenptr, and adjusts pc (if necessary) to point to the actual memory
1216location where the breakpoint should be inserted.
1217
1218Although it is common to use a trap instruction for a breakpoint, it's
1219not required; for instance, the bit pattern could be an invalid
1220instruction. The breakpoint must be no longer than the shortest
1221instruction of the architecture.
1222
1223Replaces all the other BREAKPOINTs.
1224
3cee93ac
CF
1225@item CALL_DUMMY
1226valops.c
1227@item CALL_DUMMY_LOCATION
1228inferior.h
1229@item CALL_DUMMY_STACK_ADJUST
1230valops.c
968720bf 1231
3cee93ac
CF
1232@item CANNOT_FETCH_REGISTER (regno)
1233A C expression that should be nonzero if @var{regno} cannot be fetched
1234from an inferior process. This is only relevant if
1235@code{FETCH_INFERIOR_REGISTERS} is not defined.
968720bf 1236
3cee93ac
CF
1237@item CANNOT_STORE_REGISTER (regno)
1238A C expression that should be nonzero if @var{regno} should not be
1239written to the target. This is often the case for program counters,
1240status words, and other special registers. If this is not defined, GDB
1241will assume that all registers may be written.
968720bf 1242
3cee93ac
CF
1243@item DO_DEFERRED_STORES
1244@item CLEAR_DEFERRED_STORES
1245Define this to execute any deferred stores of registers into the inferior,
1246and to cancel any deferred stores.
968720bf 1247
3cee93ac 1248Currently only implemented correctly for native Sparc configurations?
2a20c602 1249
3cee93ac
CF
1250@item CPLUS_MARKER
1251Define this to expand into the character that G++ uses to distinguish
1252compiler-generated identifiers from programmer-specified identifiers.
1253By default, this expands into @code{'$'}. Most System V targets should
1254define this to @code{'.'}.
2a20c602 1255
3cee93ac
CF
1256@item DBX_PARM_SYMBOL_CLASS
1257Hook for the @code{SYMBOL_CLASS} of a parameter when decoding DBX symbol
1258information. In the i960, parameters can be stored as locals or as
1259args, depending on the type of the debug record.
2a20c602 1260
3cee93ac
CF
1261@item DECR_PC_AFTER_BREAK
1262Define this to be the amount by which to decrement the PC after the
1263program encounters a breakpoint. This is often the number of bytes in
1264BREAKPOINT, though not always. For most targets this value will be 0.
2a20c602 1265
3cee93ac
CF
1266@item DECR_PC_AFTER_HW_BREAK
1267Similarly, for hardware breakpoints.
2a20c602 1268
3cee93ac
CF
1269@item DISABLE_UNSETTABLE_BREAK addr
1270If defined, this should evaluate to 1 if @var{addr} is in a shared
1271library in which breakpoints cannot be set and so should be disabled.
2a20c602 1272
3cee93ac
CF
1273@item DO_REGISTERS_INFO
1274If defined, use this to print the value of a register or all registers.
2a20c602 1275
3cee93ac
CF
1276@item END_OF_TEXT_DEFAULT
1277This is an expression that should designate the end of the text section
1278(? FIXME ?)
2a20c602 1279
3cee93ac
CF
1280@item EXTRACT_RETURN_VALUE(type,regbuf,valbuf)
1281Define this to extract a function's return value of type @var{type} from
1282the raw register state @var{regbuf} and copy that, in virtual format,
1283into @var{valbuf}.
1284
1285@item EXTRACT_STRUCT_VALUE_ADDRESS(regbuf)
1286Define this to extract from an array @var{regbuf} containing the (raw)
1287register state, the address in which a function should return its
1288structure value, as a CORE_ADDR (or an expression that can be used as
1289one).
2a20c602 1290
3cee93ac
CF
1291@item FLOAT_INFO
1292If defined, then the `info float' command will print information about
1293the processor's floating point unit.
a5e7f259 1294
3cee93ac
CF
1295@item FP_REGNUM
1296The number of the frame pointer register.
a5e7f259 1297
3cee93ac
CF
1298@item FRAMELESS_FUNCTION_INVOCATION(fi, frameless)
1299Define this to set the variable @var{frameless} to 1 if the function
1300invocation represented by @var{fi} does not have a stack frame
1301associated with it. Otherwise set it to 0.
a5e7f259 1302
3cee93ac
CF
1303@item FRAME_ARGS_ADDRESS_CORRECT
1304stack.c
a5e7f259 1305
3cee93ac
CF
1306@item FRAME_CHAIN(frame)
1307Given @var{frame}, return a pointer to the calling frame.
a5e7f259 1308
3cee93ac
CF
1309@item FRAME_CHAIN_COMBINE(chain,frame)
1310Define this to take the frame chain pointer and the frame's nominal
1311address and produce the nominal address of the caller's frame.
1312Presently only defined for HP PA.
a5e7f259 1313
3cee93ac 1314@item FRAME_CHAIN_VALID(chain,thisframe)
b6960094 1315
3cee93ac 1316Define this to be an expression that returns zero if the given frame is
b6960094
AC
1317an outermost frame, with no caller, and nonzero otherwise. Three common
1318definitions are available. @code{default_frame_chain_valid} (the
1319default) is nonzero if the chain pointer is nonzero and given frame's PC
1320is not inside the startup file (such as @file{crt0.o}).
1321@code{alternate_frame_chain_valid} is nonzero if the chain pointer is
1322nonzero and the given frame's PC is not in @code{main()} or a known
3cee93ac 1323entry point function (such as @code{_start()}).
2a20c602 1324
86a51f41
AC
1325@item FRAME_INIT_SAVED_REGS(frame)
1326See @file{frame.h}. Determines the address of all registers in the
1327current stack frame storing each in @code{frame->saved_regs}. Space for
1328@code{frame->saved_regs} shall be allocated by
1329@code{FRAME_INIT_SAVED_REGS} using either
1330@code{frame_saved_regs_zalloc} or @code{frame_obstack_alloc}.
1331
1332@var{FRAME_FIND_SAVED_REGS} and @var{EXTRA_FRAME_INFO} are deprecated.
2a20c602 1333
3cee93ac
CF
1334@item FRAME_NUM_ARGS (val, fi)
1335For the frame described by @var{fi}, set @var{val} to the number of arguments
1336that are being passed.
2a20c602 1337
3cee93ac
CF
1338@item FRAME_SAVED_PC(frame)
1339Given @var{frame}, return the pc saved there. That is, the return
1340address.
2a20c602 1341
3cee93ac
CF
1342@item FUNCTION_EPILOGUE_SIZE
1343For some COFF targets, the @code{x_sym.x_misc.x_fsize} field of the
1344function end symbol is 0. For such targets, you must define
1345@code{FUNCTION_EPILOGUE_SIZE} to expand into the standard size of a
1346function's epilogue.
2a20c602 1347
3cee93ac
CF
1348@item GCC_COMPILED_FLAG_SYMBOL
1349@item GCC2_COMPILED_FLAG_SYMBOL
1350If defined, these are the names of the symbols that GDB will look for to
1351detect that GCC compiled the file. The default symbols are
1352@code{gcc_compiled.} and @code{gcc2_compiled.}, respectively. (Currently
1353only defined for the Delta 68.)
2a20c602 1354
3cee93ac
CF
1355@item GDB_TARGET_IS_HPPA
1356This determines whether horrible kludge code in dbxread.c and
1357partial-stab.h is used to mangle multiple-symbol-table files from
1358HPPA's. This should all be ripped out, and a scheme like elfread.c
1359used.
2a20c602 1360
3cee93ac
CF
1361@item GDB_TARGET_IS_MACH386
1362@item GDB_TARGET_IS_SUN3
1363@item GDB_TARGET_IS_SUN386
1364Kludges that should go away.
2a20c602 1365
3cee93ac
CF
1366@item GET_LONGJMP_TARGET
1367For most machines, this is a target-dependent parameter. On the
1368DECstation and the Iris, this is a native-dependent parameter, since
1369<setjmp.h> is needed to define it.
2a20c602 1370
3cee93ac
CF
1371This macro determines the target PC address that longjmp() will jump to,
1372assuming that we have just stopped at a longjmp breakpoint. It takes a
1373CORE_ADDR * as argument, and stores the target PC value through this
1374pointer. It examines the current state of the machine as needed.
2a20c602 1375
3cee93ac
CF
1376@item GET_SAVED_REGISTER
1377Define this if you need to supply your own definition for the function
1378@code{get_saved_register}. Currently this is only done for the a29k.
2a20c602 1379
3cee93ac
CF
1380@item HAVE_REGISTER_WINDOWS
1381Define this if the target has register windows.
1382@item REGISTER_IN_WINDOW_P (regnum)
1383Define this to be an expression that is 1 if the given register is in
1384the window.
2a20c602 1385
3cee93ac
CF
1386@item IBM6000_TARGET
1387Shows that we are configured for an IBM RS/6000 target. This
1388conditional should be eliminated (FIXME) and replaced by
1389feature-specific macros. It was introduced in haste and we are
1390repenting at leisure.
2a20c602 1391
3cee93ac
CF
1392@item IEEE_FLOAT
1393Define this if the target system uses IEEE-format floating point numbers.
a5e7f259 1394
86a51f41
AC
1395@item INIT_EXTRA_FRAME_INFO (fromleaf, frame)
1396If additional information about the frame is required this should be
1397stored in @code{frame->extra_info}. Space for @code{frame->extra_info}
1398is allocated using @code{frame_obstack_alloc}.
493cf018 1399
3cee93ac
CF
1400@item INIT_FRAME_PC (fromleaf, prev)
1401This is a C statement that sets the pc of the frame pointed to by
1402@var{prev}. [By default...]
493cf018 1403
3a0c96a9
AC
1404@item INNER_THAN (lhs,rhs)
1405Returns non-zero if stack address @var{lhs} is inner than (nearer to the
1406stack top) stack address @var{rhs}. Define this as @code{lhs < rhs} if
1407the target's stack grows downward in memory, or @code{lhs > rsh} if the
1408stack grows upward.
493cf018 1409
3cee93ac
CF
1410@item IN_SIGTRAMP (pc, name)
1411Define this to return true if the given @var{pc} and/or @var{name}
1412indicates that the current function is a sigtramp.
cdc647da 1413
3cee93ac
CF
1414@item SIGTRAMP_START (pc)
1415@item SIGTRAMP_END (pc)
1416Define these to be the start and end address of the sigtramp for the
1417given @var{pc}. On machines where the address is just a compile time
1418constant, the macro expansion will typically just ignore the supplied
1419@var{pc}.
cdc647da 1420
9ddf9aa9 1421@item IN_SOLIB_CALL_TRAMPOLINE pc name
3cee93ac
CF
1422Define this to evaluate to nonzero if the program is stopped in the
1423trampoline that connects to a shared library.
cdc647da 1424
9ddf9aa9
AC
1425@item IN_SOLIB_RETURN_TRAMPOLINE pc name
1426Define this to evaluate to nonzero if the program is stopped in the
1427trampoline that returns from a shared library.
1428
3cee93ac
CF
1429@item IS_TRAPPED_INTERNALVAR (name)
1430This is an ugly hook to allow the specification of special actions that
1431should occur as a side-effect of setting the value of a variable
1432internal to GDB. Currently only used by the h8500. Note that this
1433could be either a host or target conditional.
54109914 1434
3cee93ac
CF
1435@item NEED_TEXT_START_END
1436Define this if GDB should determine the start and end addresses of the
1437text section. (Seems dubious.)
54109914 1438
3cee93ac
CF
1439@item NO_HIF_SUPPORT
1440(Specific to the a29k.)
cdc647da 1441
02331869
AC
1442@item SOFTWARE_SINGLE_STEP_P
1443Define this as 1 if the target does not have a hardware single-step
1444mechanism. The macro @code{SOFTWARE_SINGLE_STEP} must also be defined.
1445
1446@item SOFTWARE_SINGLE_STEP(signal,insert_breapoints_p)
1447A function that inserts or removes (dependant on
1448@var{insert_breapoints_p}) breakpoints at each possible destinations of
1449the next instruction. See @code{sparc-tdep.c} and @code{rs6000-tdep.c}
3cee93ac 1450for examples.
cdc647da 1451
3cee93ac
CF
1452@item PCC_SOL_BROKEN
1453(Used only in the Convex target.)
2a426d31 1454
3cee93ac
CF
1455@item PC_IN_CALL_DUMMY
1456inferior.h
2a426d31 1457
3cee93ac
CF
1458@item PC_LOAD_SEGMENT
1459If defined, print information about the load segment for the program
1460counter. (Defined only for the RS/6000.)
cdc647da 1461
3cee93ac
CF
1462@item PC_REGNUM
1463If the program counter is kept in a register, then define this macro to
1464be the number of that register. This need be defined only if
1465@code{TARGET_WRITE_PC} is not defined.
cdc647da 1466
3cee93ac
CF
1467@item NPC_REGNUM
1468The number of the ``next program counter'' register, if defined.
54109914 1469
3cee93ac
CF
1470@item NNPC_REGNUM
1471The number of the ``next next program counter'' register, if defined.
1472Currently, this is only defined for the Motorola 88K.
54109914 1473
3cee93ac
CF
1474@item PRINT_REGISTER_HOOK (regno)
1475If defined, this must be a function that prints the contents of the
1476given register to standard output.
54109914 1477
3cee93ac
CF
1478@item PRINT_TYPELESS_INTEGER
1479This is an obscure substitute for @code{print_longest} that seems to
1480have been defined for the Convex target.
54109914 1481
3cee93ac
CF
1482@item PROCESS_LINENUMBER_HOOK
1483A hook defined for XCOFF reading.
54109914 1484
3cee93ac
CF
1485@item PROLOGUE_FIRSTLINE_OVERLAP
1486(Only used in unsupported Convex configuration.)
54109914 1487
3cee93ac
CF
1488@item PS_REGNUM
1489If defined, this is the number of the processor status register. (This
1490definition is only used in generic code when parsing "$ps".)
54109914 1491
3cee93ac
CF
1492@item POP_FRAME
1493Used in @samp{call_function_by_hand} to remove an artificial stack
1494frame.
cdc647da 1495
3cee93ac
CF
1496@item PUSH_ARGUMENTS (nargs, args, sp, struct_return, struct_addr)
1497Define this to push arguments onto the stack for inferior function call.
cdc647da 1498
3cee93ac
CF
1499@item PUSH_DUMMY_FRAME
1500Used in @samp{call_function_by_hand} to create an artificial stack frame.
54109914 1501
3cee93ac
CF
1502@item REGISTER_BYTES
1503The total amount of space needed to store GDB's copy of the machine's
1504register state.
54109914 1505
9ddf9aa9
AC
1506@item REGISTER_NAME(i)
1507Return the name of register @var{i} as a string. May return @var{NULL}
1508or @var{NUL} to indicate that register @var{i} is not valid.
beb773f3 1509
3cee93ac
CF
1510@item REG_STRUCT_HAS_ADDR (gcc_p, type)
1511Define this to return 1 if the given type will be passed by pointer
1512rather than directly.
beb773f3 1513
3cee93ac
CF
1514@item SDB_REG_TO_REGNUM
1515Define this to convert sdb register numbers into GDB regnums. If not
1516defined, no conversion will be done.
beb773f3 1517
3cee93ac
CF
1518@item SHIFT_INST_REGS
1519(Only used for m88k targets.)
cdc647da 1520
3cee93ac
CF
1521@item SKIP_PROLOGUE (pc)
1522A C statement that advances the @var{pc} across any function entry
1523prologue instructions so as to reach ``real'' code.
cdc647da 1524
3cee93ac
CF
1525@item SKIP_PROLOGUE_FRAMELESS_P
1526A C statement that should behave similarly, but that can stop as soon as
1527the function is known to have a frame. If not defined,
1528@code{SKIP_PROLOGUE} will be used instead.
cdc647da 1529
3cee93ac
CF
1530@item SKIP_TRAMPOLINE_CODE (pc)
1531If the target machine has trampoline code that sits between callers and
1532the functions being called, then define this macro to return a new PC
1533that is at the start of the real function.
cdc647da 1534
3cee93ac
CF
1535@item SP_REGNUM
1536Define this to be the number of the register that serves as the stack
1537pointer.
beb773f3 1538
3cee93ac
CF
1539@item STAB_REG_TO_REGNUM
1540Define this to convert stab register numbers (as gotten from `r'
1541declarations) into GDB regnums. If not defined, no conversion will be
1542done.
beb773f3 1543
3cee93ac
CF
1544@item STACK_ALIGN (addr)
1545Define this to adjust the address to the alignment required for the
1546processor's stack.
beb773f3 1547
3cee93ac
CF
1548@item STEP_SKIPS_DELAY (addr)
1549Define this to return true if the address is of an instruction with a
1550delay slot. If a breakpoint has been placed in the instruction's delay
1551slot, GDB will single-step over that instruction before resuming
1552normally. Currently only defined for the Mips.
2a426d31 1553
3cee93ac
CF
1554@item STORE_RETURN_VALUE (type, valbuf)
1555A C expression that stores a function return value of type @var{type},
1556where @var{valbuf} is the address of the value to be stored.
2a426d31 1557
3cee93ac
CF
1558@item SUN_FIXED_LBRAC_BUG
1559(Used only for Sun-3 and Sun-4 targets.)
2a426d31 1560
3cee93ac
CF
1561@item SYMBOL_RELOADING_DEFAULT
1562The default value of the `symbol-reloading' variable. (Never defined in
1563current sources.)
beb773f3 1564
18b1e896
SS
1565@item TARGET_BYTE_ORDER_DEFAULT
1566The ordering of bytes in the target. This must be either
1567@code{BIG_ENDIAN} or @code{LITTLE_ENDIAN}. This macro replaces
1568@var{TARGET_BYTE_ORDER} which is deprecated.
1569
1570@item TARGET_BYTE_ORDER_SELECTABLE_P
1571Non-zero if the target has both @code{BIG_ENDIAN} and
1572@code{LITTLE_ENDIAN} variants. This macro replaces
1573@var{TARGET_BYTE_ORDER_SELECTABLE} which is deprecated.
2a426d31 1574
3cee93ac
CF
1575@item TARGET_CHAR_BIT
1576Number of bits in a char; defaults to 8.
cdc647da 1577
3cee93ac
CF
1578@item TARGET_COMPLEX_BIT
1579Number of bits in a complex number; defaults to @code{2 * TARGET_FLOAT_BIT}.
cdc647da 1580
3cee93ac
CF
1581@item TARGET_DOUBLE_BIT
1582Number of bits in a double float; defaults to @code{8 * TARGET_CHAR_BIT}.
cdc647da 1583
3cee93ac
CF
1584@item TARGET_DOUBLE_COMPLEX_BIT
1585Number of bits in a double complex; defaults to @code{2 * TARGET_DOUBLE_BIT}.
cdc647da 1586
3cee93ac
CF
1587@item TARGET_FLOAT_BIT
1588Number of bits in a float; defaults to @code{4 * TARGET_CHAR_BIT}.
cdc647da 1589
3cee93ac
CF
1590@item TARGET_INT_BIT
1591Number of bits in an integer; defaults to @code{4 * TARGET_CHAR_BIT}.
cdc647da 1592
3cee93ac
CF
1593@item TARGET_LONG_BIT
1594Number of bits in a long integer; defaults to @code{4 * TARGET_CHAR_BIT}.
cdc647da 1595
3cee93ac
CF
1596@item TARGET_LONG_DOUBLE_BIT
1597Number of bits in a long double float;
1598defaults to @code{2 * TARGET_DOUBLE_BIT}.
beb773f3 1599
3cee93ac
CF
1600@item TARGET_LONG_LONG_BIT
1601Number of bits in a long long integer; defaults to @code{2 * TARGET_LONG_BIT}.
cdc647da 1602
3cee93ac
CF
1603@item TARGET_PTR_BIT
1604Number of bits in a pointer; defaults to @code{TARGET_INT_BIT}.
cdc647da 1605
3cee93ac
CF
1606@item TARGET_SHORT_BIT
1607Number of bits in a short integer; defaults to @code{2 * TARGET_CHAR_BIT}.
cdc647da 1608
3cee93ac
CF
1609@item TARGET_READ_PC
1610@item TARGET_WRITE_PC (val, pid)
1611@item TARGET_READ_SP
1612@item TARGET_WRITE_SP
1613@item TARGET_READ_FP
1614@item TARGET_WRITE_FP
1615These change the behavior of @code{read_pc}, @code{write_pc},
1616@code{read_sp}, @code{write_sp}, @code{read_fp} and @code{write_fp}.
1617For most targets, these may be left undefined. GDB will call the read
1618and write register functions with the relevant @code{_REGNUM} argument.
54109914 1619
3cee93ac
CF
1620These macros are useful when a target keeps one of these registers in a
1621hard to get at place; for example, part in a segment register and part
1622in an ordinary register.
cdc647da 1623
02331869
AC
1624@item TARGET_VIRTUAL_FRAME_POINTER(pc,regp,offsetp)
1625Returns a @code{(register, offset)} pair representing the virtual
1626frame pointer in use at the code address @code{"pc"}. If virtual
1627frame pointers are not used, a default definition simply returns
1628@code{FP_REGNUM}, with an offset of zero.
1629
3cee93ac
CF
1630@item USE_STRUCT_CONVENTION (gcc_p, type)
1631If defined, this must be an expression that is nonzero if a value of the
1632given @var{type} being returned from a function must have space
1633allocated for it on the stack. @var{gcc_p} is true if the function
1634being considered is known to have been compiled by GCC; this is helpful
1635for systems where GCC is known to use different calling convention than
1636other compilers.
cdc647da 1637
3cee93ac
CF
1638@item VARIABLES_INSIDE_BLOCK (desc, gcc_p)
1639For dbx-style debugging information, if the compiler puts variable
1640declarations inside LBRAC/RBRAC blocks, this should be defined to be
1641nonzero. @var{desc} is the value of @code{n_desc} from the
1642@code{N_RBRAC} symbol, and @var{gcc_p} is true if GDB has noticed the
1643presence of either the @code{GCC_COMPILED_SYMBOL} or the
1644@code{GCC2_COMPILED_SYMBOL}. By default, this is 0.
cdc647da 1645
3cee93ac
CF
1646@item OS9K_VARIABLES_INSIDE_BLOCK (desc, gcc_p)
1647Similarly, for OS/9000. Defaults to 1.
cdc647da 1648
3cee93ac 1649@end table
beb773f3 1650
3cee93ac 1651Motorola M68K target conditionals.
beb773f3 1652
3cee93ac 1653@table @code
beb773f3 1654
3cee93ac
CF
1655@item BPT_VECTOR
1656Define this to be the 4-bit location of the breakpoint trap vector. If
1657not defined, it will default to @code{0xf}.
beb773f3 1658
3cee93ac
CF
1659@item REMOTE_BPT_VECTOR
1660Defaults to @code{1}.
54109914 1661
3cee93ac 1662@end table
54109914 1663
3cee93ac 1664@section Adding a New Target
54109914 1665
3cee93ac 1666The following files define a target to GDB:
54109914 1667
3cee93ac 1668@table @file
54109914 1669
3cee93ac
CF
1670@item gdb/config/@var{arch}/@var{ttt}.mt
1671Contains a Makefile fragment specific to this target. Specifies what
1672object files are needed for target @var{ttt}, by defining
1673@samp{TDEPFILES=@dots{}}. Also specifies the header file which
1674describes @var{ttt}, by defining @samp{TM_FILE= tm-@var{ttt}.h}. You
1675can also define @samp{TM_CFLAGS}, @samp{TM_CLIBS}, @samp{TM_CDEPS}, but
1676these are now deprecated and may go away in future versions of GDB.
54109914 1677
3cee93ac
CF
1678@item gdb/config/@var{arch}/tm-@var{ttt}.h
1679(@file{tm.h} is a link to this file, created by configure). Contains
1680macro definitions about the target machine's registers, stack frame
1681format and instructions.
54109914 1682
3cee93ac
CF
1683@item gdb/@var{ttt}-tdep.c
1684Contains any miscellaneous code required for this target machine. On
1685some machines it doesn't exist at all. Sometimes the macros in
1686@file{tm-@var{ttt}.h} become very complicated, so they are implemented
1687as functions here instead, and the macro is simply defined to call the
1688function. This is vastly preferable, since it is easier to understand
1689and debug.
beb773f3 1690
3cee93ac
CF
1691@item gdb/config/@var{arch}/tm-@var{arch}.h
1692This often exists to describe the basic layout of the target machine's
1693processor chip (registers, stack, etc). If used, it is included by
1694@file{tm-@var{ttt}.h}. It can be shared among many targets that use the
1695same processor.
7f494564 1696
3cee93ac
CF
1697@item gdb/@var{arch}-tdep.c
1698Similarly, there are often common subroutines that are shared by all
1699target machines that use this particular architecture.
493cf018 1700
7f494564
SS
1701@end table
1702
3cee93ac
CF
1703If you are adding a new operating system for an existing CPU chip, add a
1704@file{config/tm-@var{os}.h} file that describes the operating system
1705facilities that are unusual (extra symbol table info; the breakpoint
1706instruction needed; etc). Then write a @file{@var{arch}/tm-@var{os}.h}
1707that just @code{#include}s @file{tm-@var{arch}.h} and
1708@file{config/tm-@var{os}.h}.
493cf018 1709
493cf018 1710
3cee93ac 1711@node Target Vector Definition
493cf018 1712
3cee93ac 1713@chapter Target Vector Definition
cdc647da 1714
3cee93ac
CF
1715The target vector defines the interface between GDB's abstract handling
1716of target systems, and the nitty-gritty code that actually exercises
1717control over a process or a serial port. GDB includes some 30-40
1718different target vectors; however, each configuration of GDB includes
1719only a few of them.
cdc647da 1720
3cee93ac 1721@section File Targets
cdc647da 1722
3cee93ac 1723Both executables and core files have target vectors.
beb773f3 1724
3cee93ac 1725@section Standard Protocol and Remote Stubs
beb773f3 1726
3cee93ac
CF
1727GDB's file @file{remote.c} talks a serial protocol to code that runs in
1728the target system. GDB provides several sample ``stubs'' that can be
1729integrated into target programs or operating systems for this purpose;
1730they are named @file{*-stub.c}.
54109914 1731
3cee93ac
CF
1732The GDB user's manual describes how to put such a stub into your target
1733code. What follows is a discussion of integrating the SPARC stub into a
1734complicated operating system (rather than a simple program), by Stu
1735Grossman, the author of this stub.
54109914 1736
3cee93ac
CF
1737The trap handling code in the stub assumes the following upon entry to
1738trap_low:
beb773f3 1739
3cee93ac 1740@enumerate
beb773f3 1741
3cee93ac 1742@item %l1 and %l2 contain pc and npc respectively at the time of the trap
54109914 1743
3cee93ac 1744@item traps are disabled
beb773f3 1745
3cee93ac 1746@item you are in the correct trap window
beb773f3 1747
3cee93ac 1748@end enumerate
beb773f3 1749
3cee93ac
CF
1750As long as your trap handler can guarantee those conditions, then there
1751is no reason why you shouldn't be able to `share' traps with the stub.
1752The stub has no requirement that it be jumped to directly from the
1753hardware trap vector. That is why it calls @code{exceptionHandler()},
1754which is provided by the external environment. For instance, this could
1755setup the hardware traps to actually execute code which calls the stub
1756first, and then transfers to its own trap handler.
54109914 1757
3cee93ac
CF
1758For the most point, there probably won't be much of an issue with
1759`sharing' traps, as the traps we use are usually not used by the kernel,
1760and often indicate unrecoverable error conditions. Anyway, this is all
1761controlled by a table, and is trivial to modify. The most important
1762trap for us is for @code{ta 1}. Without that, we can't single step or
1763do breakpoints. Everything else is unnecessary for the proper operation
1764of the debugger/stub.
54109914 1765
3cee93ac
CF
1766From reading the stub, it's probably not obvious how breakpoints work.
1767They are simply done by deposit/examine operations from GDB.
54109914 1768
3cee93ac 1769@section ROM Monitor Interface
54109914 1770
3cee93ac 1771@section Custom Protocols
54109914 1772
3cee93ac 1773@section Transport Layer
54109914 1774
3cee93ac 1775@section Builtin Simulator
54109914 1776
54109914 1777
3cee93ac 1778@node Native Debugging
54109914 1779
3cee93ac 1780@chapter Native Debugging
cdc647da 1781
3cee93ac 1782Several files control GDB's configuration for native support:
cdc647da 1783
3cee93ac 1784@table @file
beb773f3 1785
3cee93ac
CF
1786@item gdb/config/@var{arch}/@var{xyz}.mh
1787Specifies Makefile fragments needed when hosting @emph{or native} on
1788machine @var{xyz}. In particular, this lists the required
1789native-dependent object files, by defining @samp{NATDEPFILES=@dots{}}.
1790Also specifies the header file which describes native support on
1791@var{xyz}, by defining @samp{NAT_FILE= nm-@var{xyz}.h}. You can also
1792define @samp{NAT_CFLAGS}, @samp{NAT_ADD_FILES}, @samp{NAT_CLIBS},
1793@samp{NAT_CDEPS}, etc.; see @file{Makefile.in}.
beb773f3 1794
3cee93ac
CF
1795@item gdb/config/@var{arch}/nm-@var{xyz}.h
1796(@file{nm.h} is a link to this file, created by configure). Contains C
1797macro definitions describing the native system environment, such as
1798child process control and core file support.
beb773f3 1799
3cee93ac
CF
1800@item gdb/@var{xyz}-nat.c
1801Contains any miscellaneous C code required for this native support of
1802this machine. On some machines it doesn't exist at all.
cdc647da 1803
3cee93ac 1804@end table
cdc647da 1805
3cee93ac
CF
1806There are some ``generic'' versions of routines that can be used by
1807various systems. These can be customized in various ways by macros
1808defined in your @file{nm-@var{xyz}.h} file. If these routines work for
1809the @var{xyz} host, you can just include the generic file's name (with
1810@samp{.o}, not @samp{.c}) in @code{NATDEPFILES}.
cdc647da 1811
3cee93ac
CF
1812Otherwise, if your machine needs custom support routines, you will need
1813to write routines that perform the same functions as the generic file.
1814Put them into @code{@var{xyz}-nat.c}, and put @code{@var{xyz}-nat.o}
1815into @code{NATDEPFILES}.
cdc647da 1816
3cee93ac 1817@table @file
cdc647da 1818
3cee93ac
CF
1819@item inftarg.c
1820This contains the @emph{target_ops vector} that supports Unix child
1821processes on systems which use ptrace and wait to control the child.
cdc647da 1822
3cee93ac
CF
1823@item procfs.c
1824This contains the @emph{target_ops vector} that supports Unix child
1825processes on systems which use /proc to control the child.
54109914 1826
3cee93ac
CF
1827@item fork-child.c
1828This does the low-level grunge that uses Unix system calls to do a "fork
1829and exec" to start up a child process.
cdc647da 1830
3cee93ac
CF
1831@item infptrace.c
1832This is the low level interface to inferior processes for systems using
1833the Unix @code{ptrace} call in a vanilla way.
cdc647da 1834
3cee93ac 1835@end table
cdc647da 1836
3cee93ac 1837@section Native core file Support
c3bbca3a 1838
3cee93ac 1839@table @file
cdc647da 1840
3cee93ac
CF
1841@item core-aout.c::fetch_core_registers()
1842Support for reading registers out of a core file. This routine calls
1843@code{register_addr()}, see below. Now that BFD is used to read core
1844files, virtually all machines should use @code{core-aout.c}, and should
1845just provide @code{fetch_core_registers} in @code{@var{xyz}-nat.c} (or
1846@code{REGISTER_U_ADDR} in @code{nm-@var{xyz}.h}).
54109914 1847
3cee93ac
CF
1848@item core-aout.c::register_addr()
1849If your @code{nm-@var{xyz}.h} file defines the macro
1850@code{REGISTER_U_ADDR(addr, blockend, regno)}, it should be defined to
1851set @code{addr} to the offset within the @samp{user} struct of GDB
1852register number @code{regno}. @code{blockend} is the offset within the
1853``upage'' of @code{u.u_ar0}. If @code{REGISTER_U_ADDR} is defined,
1854@file{core-aout.c} will define the @code{register_addr()} function and
1855use the macro in it. If you do not define @code{REGISTER_U_ADDR}, but
1856you are using the standard @code{fetch_core_registers()}, you will need
1857to define your own version of @code{register_addr()}, put it into your
1858@code{@var{xyz}-nat.c} file, and be sure @code{@var{xyz}-nat.o} is in
1859the @code{NATDEPFILES} list. If you have your own
1860@code{fetch_core_registers()}, you may not need a separate
1861@code{register_addr()}. Many custom @code{fetch_core_registers()}
1862implementations simply locate the registers themselves.@refill
54109914 1863
3cee93ac 1864@end table
54109914 1865
3cee93ac
CF
1866When making GDB run native on a new operating system, to make it
1867possible to debug core files, you will need to either write specific
1868code for parsing your OS's core files, or customize
1869@file{bfd/trad-core.c}. First, use whatever @code{#include} files your
1870machine uses to define the struct of registers that is accessible
1871(possibly in the u-area) in a core file (rather than
1872@file{machine/reg.h}), and an include file that defines whatever header
1873exists on a core file (e.g. the u-area or a @samp{struct core}). Then
1874modify @code{trad_unix_core_file_p()} to use these values to set up the
1875section information for the data segment, stack segment, any other
1876segments in the core file (perhaps shared library contents or control
1877information), ``registers'' segment, and if there are two discontiguous
1878sets of registers (e.g. integer and float), the ``reg2'' segment. This
1879section information basically delimits areas in the core file in a
1880standard way, which the section-reading routines in BFD know how to seek
1881around in.
cdc647da 1882
3cee93ac
CF
1883Then back in GDB, you need a matching routine called
1884@code{fetch_core_registers()}. If you can use the generic one, it's in
1885@file{core-aout.c}; if not, it's in your @file{@var{xyz}-nat.c} file.
1886It will be passed a char pointer to the entire ``registers'' segment,
1887its length, and a zero; or a char pointer to the entire ``regs2''
1888segment, its length, and a 2. The routine should suck out the supplied
1889register values and install them into GDB's ``registers'' array.
cdc647da 1890
3cee93ac
CF
1891If your system uses @file{/proc} to control processes, and uses ELF
1892format core files, then you may be able to use the same routines for
1893reading the registers out of processes and out of core files.
54109914 1894
3cee93ac 1895@section ptrace
beb773f3 1896
3cee93ac 1897@section /proc
beb773f3 1898
3cee93ac 1899@section win32
beb773f3 1900
3cee93ac 1901@section shared libraries
beb773f3 1902
3cee93ac 1903@section Native Conditionals
beb773f3 1904
3cee93ac
CF
1905When GDB is configured and compiled, various macros are defined or left
1906undefined, to control compilation when the host and target systems are
1907the same. These macros should be defined (or left undefined) in
1908@file{nm-@var{system}.h}.
54109914 1909
3cee93ac 1910@table @code
54109914 1911
3cee93ac
CF
1912@item ATTACH_DETACH
1913If defined, then GDB will include support for the @code{attach} and
1914@code{detach} commands.
54109914 1915
3cee93ac
CF
1916@item CHILD_PREPARE_TO_STORE
1917If the machine stores all registers at once in the child process, then
1918define this to ensure that all values are correct. This usually entails
1919a read from the child.
54109914 1920
3cee93ac
CF
1921[Note that this is incorrectly defined in @file{xm-@var{system}.h} files
1922currently.]
54109914 1923
3cee93ac
CF
1924@item FETCH_INFERIOR_REGISTERS
1925Define this if the native-dependent code will provide its own routines
1926@code{fetch_inferior_registers} and @code{store_inferior_registers} in
1927@file{@var{HOST}-nat.c}. If this symbol is @emph{not} defined, and
1928@file{infptrace.c} is included in this configuration, the default
1929routines in @file{infptrace.c} are used for these functions.
54109914 1930
3cee93ac
CF
1931@item FILES_INFO_HOOK
1932(Only defined for Convex.)
beb773f3 1933
3cee93ac
CF
1934@item FP0_REGNUM
1935This macro is normally defined to be the number of the first floating
1936point register, if the machine has such registers. As such, it would
1937appear only in target-specific code. However, /proc support uses this
1938to decide whether floats are in use on this target.
beb773f3 1939
3cee93ac
CF
1940@item GET_LONGJMP_TARGET
1941For most machines, this is a target-dependent parameter. On the
1942DECstation and the Iris, this is a native-dependent parameter, since
1943<setjmp.h> is needed to define it.
beb773f3 1944
3cee93ac
CF
1945This macro determines the target PC address that longjmp() will jump to,
1946assuming that we have just stopped at a longjmp breakpoint. It takes a
1947CORE_ADDR * as argument, and stores the target PC value through this
1948pointer. It examines the current state of the machine as needed.
beb773f3 1949
3cee93ac
CF
1950@item KERNEL_U_ADDR
1951Define this to the address of the @code{u} structure (the ``user
1952struct'', also known as the ``u-page'') in kernel virtual memory. GDB
1953needs to know this so that it can subtract this address from absolute
1954addresses in the upage, that are obtained via ptrace or from core files.
1955On systems that don't need this value, set it to zero.
beb773f3 1956
3cee93ac
CF
1957@item KERNEL_U_ADDR_BSD
1958Define this to cause GDB to determine the address of @code{u} at
1959runtime, by using Berkeley-style @code{nlist} on the kernel's image in
1960the root directory.
beb773f3 1961
3cee93ac
CF
1962@item KERNEL_U_ADDR_HPUX
1963Define this to cause GDB to determine the address of @code{u} at
1964runtime, by using HP-style @code{nlist} on the kernel's image in the
1965root directory.
54109914 1966
3cee93ac
CF
1967@item ONE_PROCESS_WRITETEXT
1968Define this to be able to, when a breakpoint insertion fails, warn the
1969user that another process may be running with the same executable.
54109914 1970
3cee93ac
CF
1971@item PROC_NAME_FMT
1972Defines the format for the name of a @file{/proc} device. Should be
1973defined in @file{nm.h} @emph{only} in order to override the default
1974definition in @file{procfs.c}.
54109914 1975
3cee93ac
CF
1976@item PTRACE_FP_BUG
1977mach386-xdep.c
54109914 1978
3cee93ac
CF
1979@item PTRACE_ARG3_TYPE
1980The type of the third argument to the @code{ptrace} system call, if it
1981exists and is different from @code{int}.
cdc647da 1982
3cee93ac
CF
1983@item REGISTER_U_ADDR
1984Defines the offset of the registers in the ``u area''.
cdc647da 1985
493cf018 1986@item SHELL_COMMAND_CONCAT
3cee93ac
CF
1987If defined, is a string to prefix on the shell command used to start the
1988inferior.
1989
493cf018 1990@item SHELL_FILE
3cee93ac
CF
1991If defined, this is the name of the shell to use to run the inferior.
1992Defaults to @code{"/bin/sh"}.
cdc647da 1993
3cee93ac
CF
1994@item SOLIB_ADD (filename, from_tty, targ)
1995Define this to expand into an expression that will cause the symbols in
1996@var{filename} to be added to GDB's symbol table.
cdc647da 1997
3cee93ac
CF
1998@item SOLIB_CREATE_INFERIOR_HOOK
1999Define this to expand into any shared-library-relocation code that you
2000want to be run just after the child process has been forked.
cdc647da 2001
3cee93ac
CF
2002@item START_INFERIOR_TRAPS_EXPECTED
2003When starting an inferior, GDB normally expects to trap twice; once when
2004the shell execs, and once when the program itself execs. If the actual
2005number of traps is something other than 2, then define this macro to
2006expand into the number expected.
beb773f3 2007
3cee93ac
CF
2008@item SVR4_SHARED_LIBS
2009Define this to indicate that SVR4-style shared libraries are in use.
cdc647da 2010
3cee93ac
CF
2011@item USE_PROC_FS
2012This determines whether small routines in @file{*-tdep.c}, which
2013translate register values between GDB's internal representation and the
2014/proc representation, are compiled.
cdc647da 2015
3cee93ac
CF
2016@item U_REGS_OFFSET
2017This is the offset of the registers in the upage. It need only be
2018defined if the generic ptrace register access routines in
2019@file{infptrace.c} are being used (that is, @file{infptrace.c} is
2020configured in, and @code{FETCH_INFERIOR_REGISTERS} is not defined). If
2021the default value from @file{infptrace.c} is good enough, leave it
2022undefined.
cdc647da 2023
3cee93ac
CF
2024The default value means that u.u_ar0 @emph{points to} the location of
2025the registers. I'm guessing that @code{#define U_REGS_OFFSET 0} means
2026that u.u_ar0 @emph{is} the location of the registers.
cdc647da 2027
3cee93ac
CF
2028@item CLEAR_SOLIB
2029objfiles.c
cdc647da 2030
3cee93ac
CF
2031@item DEBUG_PTRACE
2032Define this to debug ptrace calls.
cdc647da 2033
3cee93ac 2034@end table
cdc647da 2035
cdc647da 2036
3cee93ac 2037@node Support Libraries
cdc647da 2038
3cee93ac 2039@chapter Support Libraries
cdc647da 2040
3cee93ac 2041@section BFD
cdc647da 2042
3cee93ac 2043BFD provides support for GDB in several ways:
cdc647da 2044
3cee93ac 2045@table @emph
cdc647da 2046
3cee93ac
CF
2047@item identifying executable and core files
2048BFD will identify a variety of file types, including a.out, coff, and
2049several variants thereof, as well as several kinds of core files.
cdc647da 2050
3cee93ac
CF
2051@item access to sections of files
2052BFD parses the file headers to determine the names, virtual addresses,
2053sizes, and file locations of all the various named sections in files
2054(such as the text section or the data section). GDB simply calls BFD to
2055read or write section X at byte offset Y for length Z.
cdc647da 2056
3cee93ac
CF
2057@item specialized core file support
2058BFD provides routines to determine the failing command name stored in a
2059core file, the signal with which the program failed, and whether a core
2060file matches (i.e. could be a core dump of) a particular executable
2061file.
cdc647da 2062
3cee93ac
CF
2063@item locating the symbol information
2064GDB uses an internal interface of BFD to determine where to find the
2065symbol information in an executable file or symbol-file. GDB itself
2066handles the reading of symbols, since BFD does not ``understand'' debug
2067symbols, but GDB uses BFD's cached information to find the symbols,
2068string table, etc.
cdc647da 2069
3cee93ac 2070@end table
238ffce0 2071
3cee93ac 2072@section opcodes
238ffce0 2073
3cee93ac
CF
2074The opcodes library provides GDB's disassembler. (It's a separate
2075library because it's also used in binutils, for @file{objdump}).
beb773f3 2076
3cee93ac 2077@section readline
beb773f3 2078
3cee93ac
CF
2079@section mmalloc
2080
2081@section libiberty
2082
2083@section gnu-regex
2084
2085Regex conditionals.
2086
2087@table @code
2088
2089@item C_ALLOCA
2090
2091@item NFAILURES
2092
2093@item RE_NREGS
2094
2095@item SIGN_EXTEND_CHAR
beb773f3 2096
3cee93ac 2097@item SWITCH_ENUM_BUG
cdc647da 2098
3cee93ac
CF
2099@item SYNTAX_TABLE
2100
2101@item Sword
2102
2103@item sparc
cdc647da 2104
493cf018
JG
2105@end table
2106
3cee93ac
CF
2107@section include
2108
2109@node Coding
2110
2111@chapter Coding
2112
2113This chapter covers topics that are lower-level than the major
2114algorithms of GDB.
2115
2116@section Cleanups
2117
2118Cleanups are a structured way to deal with things that need to be done
2119later. When your code does something (like @code{malloc} some memory,
2120or open a file) that needs to be undone later (e.g. free the memory or
2121close the file), it can make a cleanup. The cleanup will be done at
2122some future point: when the command is finished, when an error occurs,
2123or when your code decides it's time to do cleanups.
2124
2125You can also discard cleanups, that is, throw them away without doing
2126what they say. This is only done if you ask that it be done.
2127
2128Syntax:
beb773f3 2129
54109914 2130@table @code
beb773f3 2131
3cee93ac
CF
2132@item struct cleanup *@var{old_chain};
2133Declare a variable which will hold a cleanup chain handle.
beb773f3 2134
3cee93ac
CF
2135@item @var{old_chain} = make_cleanup (@var{function}, @var{arg});
2136Make a cleanup which will cause @var{function} to be called with
2137@var{arg} (a @code{char *}) later. The result, @var{old_chain}, is a
2138handle that can be passed to @code{do_cleanups} or
2139@code{discard_cleanups} later. Unless you are going to call
2140@code{do_cleanups} or @code{discard_cleanups} yourself, you can ignore
2141the result from @code{make_cleanup}.
2142
2143@item do_cleanups (@var{old_chain});
2144Perform all cleanups done since @code{make_cleanup} returned
2145@var{old_chain}. E.g.:
2146@example
2147make_cleanup (a, 0);
2148old = make_cleanup (b, 0);
2149do_cleanups (old);
2150@end example
2151@noindent
2152will call @code{b()} but will not call @code{a()}. The cleanup that
2153calls @code{a()} will remain in the cleanup chain, and will be done
2154later unless otherwise discarded.@refill
2155
2156@item discard_cleanups (@var{old_chain});
2157Same as @code{do_cleanups} except that it just removes the cleanups from
2158the chain and does not call the specified functions.
beb773f3
SS
2159
2160@end table
2161
3cee93ac
CF
2162Some functions, e.g. @code{fputs_filtered()} or @code{error()}, specify
2163that they ``should not be called when cleanups are not in place''. This
2164means that any actions you need to reverse in the case of an error or
2165interruption must be on the cleanup chain before you call these
2166functions, since they might never return to your code (they
2167@samp{longjmp} instead).
2168
2169@section Wrapping Output Lines
2170
2171Output that goes through @code{printf_filtered} or @code{fputs_filtered}
2172or @code{fputs_demangled} needs only to have calls to @code{wrap_here}
2173added in places that would be good breaking points. The utility
2174routines will take care of actually wrapping if the line width is
2175exceeded.
2176
2177The argument to @code{wrap_here} is an indentation string which is
2178printed @emph{only} if the line breaks there. This argument is saved
2179away and used later. It must remain valid until the next call to
2180@code{wrap_here} or until a newline has been printed through the
2181@code{*_filtered} functions. Don't pass in a local variable and then
2182return!
2183
2184It is usually best to call @code{wrap_here()} after printing a comma or
2185space. If you call it before printing a space, make sure that your
2186indentation properly accounts for the leading space that will print if
2187the line wraps there.
2188
2189Any function or set of functions that produce filtered output must
2190finish by printing a newline, to flush the wrap buffer, before switching
2191to unfiltered (``@code{printf}'') output. Symbol reading routines that
2192print warnings are a good example.
2193
18b1e896 2194@section GDB Coding Standards
3cee93ac
CF
2195
2196GDB follows the GNU coding standards, as described in
2197@file{etc/standards.texi}. This file is also available for anonymous
18b1e896
SS
2198FTP from GNU archive sites. GDB takes a strict interpretation of the
2199standard; in general, when the GNU standard recommends a practice but
2200does not require it, GDB requires it.
2201
2202GDB follows an additional set of coding standards specific to GDB,
2203as described in the following sections.
2204
2205You can configure with @samp{--enable-build-warnings} to get GCC to
2206check on a number of these rules. GDB sources ought not to engender any
2207complaints, unless they are caused by bogus host systems. (The exact
2208set of enabled warnings is currently @samp{-Wall -Wpointer-arith
2209-Wstrict-prototypes -Wmissing-prototypes -Wmissing-declarations}.
2210
2211@subsection Formatting
2212
2213The standard GNU recommendations for formatting must be followed
2214strictly.
2215
2216Note that while in a definition, the function's name must be in column
2217zero, in a function declaration, the name must be on the same line as
2218the return type.
968720bf 2219
18b1e896
SS
2220In addition, there must be a space between a function or macro name and
2221the opening parenthesis of its argument list (except for macro
2222definitions, as required by C). There must not be a space after an open
2223paren/bracket or before a close paren/bracket.
2224
2225While additional whitespace is generally helpful for reading, do not use
2226more than one blank line to separate blocks, and avoid adding whitespace
2227after the end of a program line (as of 1/99, some 600 lines had whitespace
2228after the semicolon). Excess whitespace causes difficulties for diff and
2229patch.
2230
2231@subsection Comments
2232
2233The standard GNU requirements on comments must be followed strictly.
2234
2235Block comments must appear in the following form, with no `/*'- or
2236'*/'-only lines, and no leading `*':
968720bf 2237
3cee93ac 2238@example @code
18b1e896
SS
2239/* Wait for control to return from inferior to debugger. If inferior
2240 gets a signal, we may decide to start it up again instead of
2241 returning. That is why there is a loop in this function. When
2242 this function actually returns it means the inferior should be left
2243 stopped and GDB should read more commands. */
2244@end example
2245
2246(Note that this format is encouraged by Emacs; tabbing for a multi-line
2247comment works correctly, and M-Q fills the block consistently.)
2248
2249Put a blank line between the block comments preceding function or
2250variable definitions, and the definition itself.
2251
2252In general, put function-body comments on lines by themselves, rather
2253than trying to fit them into the 20 characters left at the end of a
2254line, since either the comment or the code will inevitably get longer
2255than will fit, and then somebody will have to move it anyhow.
2256
2257@subsection C Usage
2258
2259Code must not depend on the sizes of C data types, the format of the
2260host's floating point numbers, the alignment of anything, or the order
2261of evaluation of expressions.
2262
2263Use functions freely. There are only a handful of compute-bound areas
2264in GDB that might be affected by the overhead of a function call, mainly
2265in symbol reading. Most of GDB's performance is limited by the target
2266interface (whether serial line or system call).
2267
2268However, use functions with moderation. A thousand one-line functions
2269are just as hard to understand as one thousand-line function.
2270
2271@subsection Function Prototypes
2272
2273Prototypes must be used to @emph{declare} functions but never to
2274@emph{define} them. Prototypes for GDB functions must include both the
2275argument type and name, with the name matching that used in the actual
2276function definition.
2277
2278For the sake of compatibility with pre-ANSI compilers, define prototypes
2279with the @code{PARAMS} macro:
2280
2281@example @code
2282extern int memory_remove_breakpoint PARAMS ((CORE_ADDR addr,
2283 char *contents_cache));
3cee93ac 2284@end example
cdc647da 2285
3cee93ac
CF
2286Note the double parentheses around the parameter types. This allows an
2287arbitrary number of parameters to be described, without freaking out the
2288C preprocessor. When the function has no parameters, it should be
2289described like:
cdc647da 2290
3cee93ac 2291@example @code
18b1e896 2292extern void noprocess PARAMS ((void));
3cee93ac 2293@end example
54109914 2294
3cee93ac
CF
2295The @code{PARAMS} macro expands to its argument in ANSI C, or to a
2296simple @code{()} in traditional C.
54109914 2297
3cee93ac 2298All external functions should have a @code{PARAMS} declaration in a
18b1e896
SS
2299header file that callers include, except for @code{_initialize_*}
2300functions, which must be external so that @file{init.c} construction
2301works, but shouldn't be visible to random source files.
cdc647da 2302
18b1e896
SS
2303All static functions must be declared in a block near the top of the
2304source file.
c3bbca3a 2305
18b1e896 2306@subsection Clean Design
cdc647da 2307
3cee93ac
CF
2308In addition to getting the syntax right, there's the little question of
2309semantics. Some things are done in certain ways in GDB because long
2310experience has shown that the more obvious ways caused various kinds of
2311trouble.
cdc647da 2312
3cee93ac
CF
2313You can't assume the byte order of anything that comes from a target
2314(including @var{value}s, object files, and instructions). Such things
2315must be byte-swapped using @code{SWAP_TARGET_AND_HOST} in GDB, or one of
2316the swap routines defined in @file{bfd.h}, such as @code{bfd_get_32}.
cdc647da 2317
3cee93ac
CF
2318You can't assume that you know what interface is being used to talk to
2319the target system. All references to the target must go through the
2320current @code{target_ops} vector.
cdc647da 2321
3cee93ac
CF
2322You can't assume that the host and target machines are the same machine
2323(except in the ``native'' support modules). In particular, you can't
2324assume that the target machine's header files will be available on the
2325host machine. Target code must bring along its own header files --
2326written from scratch or explicitly donated by their owner, to avoid
2327copyright problems.
cdc647da 2328
3cee93ac
CF
2329Insertion of new @code{#ifdef}'s will be frowned upon. It's much better
2330to write the code portably than to conditionalize it for various
2331systems.
beb773f3 2332
3cee93ac
CF
2333New @code{#ifdef}'s which test for specific compilers or manufacturers
2334or operating systems are unacceptable. All @code{#ifdef}'s should test
2335for features. The information about which configurations contain which
2336features should be segregated into the configuration files. Experience
2337has proven far too often that a feature unique to one particular system
2338often creeps into other systems; and that a conditional based on some
2339predefined macro for your current system will become worthless over
2340time, as new versions of your system come out that behave differently
2341with regard to this feature.
cdc647da 2342
3cee93ac
CF
2343Adding code that handles specific architectures, operating systems,
2344target interfaces, or hosts, is not acceptable in generic code. If a
2345hook is needed at that point, invent a generic hook and define it for
2346your configuration, with something like:
54109914 2347
3cee93ac
CF
2348@example
2349#ifdef WRANGLE_SIGNALS
2350 WRANGLE_SIGNALS (signo);
2351#endif
2352@end example
cdc647da 2353
3cee93ac
CF
2354In your host, target, or native configuration file, as appropriate,
2355define @code{WRANGLE_SIGNALS} to do the machine-dependent thing. Take a
2356bit of care in defining the hook, so that it can be used by other ports
2357in the future, if they need a hook in the same place.
968720bf 2358
3cee93ac
CF
2359If the hook is not defined, the code should do whatever "most" machines
2360want. Using @code{#ifdef}, as above, is the preferred way to do this,
2361but sometimes that gets convoluted, in which case use
cdc647da 2362
3cee93ac
CF
2363@example
2364#ifndef SPECIAL_FOO_HANDLING
2365#define SPECIAL_FOO_HANDLING(pc, sp) (0)
2366#endif
2367@end example
54109914 2368
3cee93ac 2369where the macro is used or in an appropriate header file.
54109914 2370
3cee93ac
CF
2371Whether to include a @dfn{small} hook, a hook around the exact pieces of
2372code which are system-dependent, or whether to replace a whole function
2373with a hook depends on the case. A good example of this dilemma can be
2374found in @code{get_saved_register}. All machines that GDB 2.8 ran on
2375just needed the @code{FRAME_FIND_SAVED_REGS} hook to find the saved
2376registers. Then the SPARC and Pyramid came along, and
2377@code{HAVE_REGISTER_WINDOWS} and @code{REGISTER_IN_WINDOW_P} were
2378introduced. Then the 29k and 88k required the @code{GET_SAVED_REGISTER}
2379hook. The first three are examples of small hooks; the latter replaces
2380a whole function. In this specific case, it is useful to have both
2381kinds; it would be a bad idea to replace all the uses of the small hooks
2382with @code{GET_SAVED_REGISTER}, since that would result in much
2383duplicated code. Other times, duplicating a few lines of code here or
2384there is much cleaner than introducing a large number of small hooks.
968720bf 2385
3cee93ac
CF
2386Another way to generalize GDB along a particular interface is with an
2387attribute struct. For example, GDB has been generalized to handle
2388multiple kinds of remote interfaces -- not by #ifdef's everywhere, but
2389by defining the "target_ops" structure and having a current target (as
2390well as a stack of targets below it, for memory references). Whenever
2391something needs to be done that depends on which remote interface we are
2392using, a flag in the current target_ops structure is tested (e.g.
2393`target_has_stack'), or a function is called through a pointer in the
2394current target_ops structure. In this way, when a new remote interface
2395is added, only one module needs to be touched -- the one that actually
2396implements the new remote interface. Other examples of
2397attribute-structs are BFD access to multiple kinds of object file
2398formats, or GDB's access to multiple source languages.
2399
2400Please avoid duplicating code. For example, in GDB 3.x all the code
2401interfacing between @code{ptrace} and the rest of GDB was duplicated in
2402@file{*-dep.c}, and so changing something was very painful. In GDB 4.x,
2403these have all been consolidated into @file{infptrace.c}.
2404@file{infptrace.c} can deal with variations between systems the same way
2405any system-independent file would (hooks, #if defined, etc.), and
2406machines which are radically different don't need to use infptrace.c at
2407all.
2408
3cee93ac
CF
2409
2410@node Porting GDB
2411
2412@chapter Porting GDB
2413
2414Most of the work in making GDB compile on a new machine is in specifying
2415the configuration of the machine. This is done in a dizzying variety of
2416header files and configuration scripts, which we hope to make more
2417sensible soon. Let's say your new host is called an @var{xyz} (e.g.
2418@samp{sun4}), and its full three-part configuration name is
2419@code{@var{arch}-@var{xvend}-@var{xos}} (e.g. @samp{sparc-sun-sunos4}).
2420In particular:
2421
2422In the top level directory, edit @file{config.sub} and add @var{arch},
2423@var{xvend}, and @var{xos} to the lists of supported architectures,
2424vendors, and operating systems near the bottom of the file. Also, add
2425@var{xyz} as an alias that maps to
2426@code{@var{arch}-@var{xvend}-@var{xos}}. You can test your changes by
2427running
2428
2429@example
2430./config.sub @var{xyz}
2431@end example
2432@noindent
2433and
2434@example
2435./config.sub @code{@var{arch}-@var{xvend}-@var{xos}}
2436@end example
2437@noindent
2438which should both respond with @code{@var{arch}-@var{xvend}-@var{xos}}
2439and no error messages.
2440
2441You need to port BFD, if that hasn't been done already. Porting BFD is
2442beyond the scope of this manual.
2443
2444To configure GDB itself, edit @file{gdb/configure.host} to recognize
2445your system and set @code{gdb_host} to @var{xyz}, and (unless your
2446desired target is already available) also edit @file{gdb/configure.tgt},
2447setting @code{gdb_target} to something appropriate (for instance,
2448@var{xyz}).
2449
2450Finally, you'll need to specify and define GDB's host-, native-, and
2451target-dependent @file{.h} and @file{.c} files used for your
2452configuration.
2453
2454@section Configuring GDB for Release
2455
2456From the top level directory (containing @file{gdb}, @file{bfd},
2457@file{libiberty}, and so on):
2458@example
2459make -f Makefile.in gdb.tar.gz
2460@end example
2461
2462This will properly configure, clean, rebuild any files that are
2463distributed pre-built (e.g. @file{c-exp.tab.c} or @file{refcard.ps}),
2464and will then make a tarfile. (If the top level directory has already
2465been configured, you can just do @code{make gdb.tar.gz} instead.)
2466
2467This procedure requires:
2468@itemize @bullet
2469@item symbolic links
2470@item @code{makeinfo} (texinfo2 level)
2471@item @TeX{}
2472@item @code{dvips}
2473@item @code{yacc} or @code{bison}
2474@end itemize
2475@noindent
2476@dots{} and the usual slew of utilities (@code{sed}, @code{tar}, etc.).
2477
2478@subheading TEMPORARY RELEASE PROCEDURE FOR DOCUMENTATION
2479
2480@file{gdb.texinfo} is currently marked up using the texinfo-2 macros,
2481which are not yet a default for anything (but we have to start using
2482them sometime).
2483
2484For making paper, the only thing this implies is the right generation of
2485@file{texinfo.tex} needs to be included in the distribution.
2486
2487For making info files, however, rather than duplicating the texinfo2
2488distribution, generate @file{gdb-all.texinfo} locally, and include the
2489files @file{gdb.info*} in the distribution. Note the plural;
2490@code{makeinfo} will split the document into one overall file and five
2491or so included files.
2492
2493@node Hints
2494
2495@chapter Hints
2496
2497Check the @file{README} file, it often has useful information that does not
2498appear anywhere else in the directory.
2499
2500@menu
2501* Getting Started:: Getting started working on GDB
2502* Debugging GDB:: Debugging GDB with itself
2503@end menu
2504
2505@node Getting Started,,, Hints
2506
2507@section Getting Started
2508
2509GDB is a large and complicated program, and if you first starting to
2510work on it, it can be hard to know where to start. Fortunately, if you
2511know how to go about it, there are ways to figure out what is going on.
2512
2513This manual, the GDB Internals manual, has information which applies
2514generally to many parts of GDB.
2515
2516Information about particular functions or data structures are located in
2517comments with those functions or data structures. If you run across a
2518function or a global variable which does not have a comment correctly
2519explaining what is does, this can be thought of as a bug in GDB; feel
2520free to submit a bug report, with a suggested comment if you can figure
2521out what the comment should say. If you find a comment which is
2522actually wrong, be especially sure to report that.
2523
2524Comments explaining the function of macros defined in host, target, or
2525native dependent files can be in several places. Sometimes they are
2526repeated every place the macro is defined. Sometimes they are where the
2527macro is used. Sometimes there is a header file which supplies a
2528default definition of the macro, and the comment is there. This manual
2529also documents all the available macros.
2530@c (@pxref{Host Conditionals}, @pxref{Target
2531@c Conditionals}, @pxref{Native Conditionals}, and @pxref{Obsolete
2532@c Conditionals})
2533
2534Start with the header files. Once you some idea of how GDB's internal
2535symbol tables are stored (see @file{symtab.h}, @file{gdbtypes.h}), you
2536will find it much easier to understand the code which uses and creates
2537those symbol tables.
2538
2539You may wish to process the information you are getting somehow, to
2540enhance your understanding of it. Summarize it, translate it to another
2541language, add some (perhaps trivial or non-useful) feature to GDB, use
2542the code to predict what a test case would do and write the test case
2543and verify your prediction, etc. If you are reading code and your eyes
2544are starting to glaze over, this is a sign you need to use a more active
2545approach.
2546
2547Once you have a part of GDB to start with, you can find more
2548specifically the part you are looking for by stepping through each
2549function with the @code{next} command. Do not use @code{step} or you
2550will quickly get distracted; when the function you are stepping through
2551calls another function try only to get a big-picture understanding
2552(perhaps using the comment at the beginning of the function being
2553called) of what it does. This way you can identify which of the
2554functions being called by the function you are stepping through is the
2555one which you are interested in. You may need to examine the data
2556structures generated at each stage, with reference to the comments in
2557the header files explaining what the data structures are supposed to
2558look like.
2559
2560Of course, this same technique can be used if you are just reading the
2561code, rather than actually stepping through it. The same general
2562principle applies---when the code you are looking at calls something
2563else, just try to understand generally what the code being called does,
2564rather than worrying about all its details.
2565
2566A good place to start when tracking down some particular area is with a
2567command which invokes that feature. Suppose you want to know how
2568single-stepping works. As a GDB user, you know that the @code{step}
2569command invokes single-stepping. The command is invoked via command
2570tables (see @file{command.h}); by convention the function which actually
2571performs the command is formed by taking the name of the command and
2572adding @samp{_command}, or in the case of an @code{info} subcommand,
2573@samp{_info}. For example, the @code{step} command invokes the
2574@code{step_command} function and the @code{info display} command invokes
2575@code{display_info}. When this convention is not followed, you might
2576have to use @code{grep} or @kbd{M-x tags-search} in emacs, or run GDB on
2577itself and set a breakpoint in @code{execute_command}.
2578
2579If all of the above fail, it may be appropriate to ask for information
2580on @code{bug-gdb}. But @emph{never} post a generic question like ``I was
2581wondering if anyone could give me some tips about understanding
2582GDB''---if we had some magic secret we would put it in this manual.
2583Suggestions for improving the manual are always welcome, of course.
2584
2585@node Debugging GDB,,,Hints
2586
2587@section Debugging GDB with itself
2588
2589If GDB is limping on your machine, this is the preferred way to get it
2590fully functional. Be warned that in some ancient Unix systems, like
2591Ultrix 4.2, a program can't be running in one process while it is being
2592debugged in another. Rather than typing the command @code{@w{./gdb
2593./gdb}}, which works on Suns and such, you can copy @file{gdb} to
2594@file{gdb2} and then type @code{@w{./gdb ./gdb2}}.
2595
2596When you run GDB in the GDB source directory, it will read a
2597@file{.gdbinit} file that sets up some simple things to make debugging
2598gdb easier. The @code{info} command, when executed without a subcommand
2599in a GDB being debugged by gdb, will pop you back up to the top level
2600gdb. See @file{.gdbinit} for details.
2601
2602If you use emacs, you will probably want to do a @code{make TAGS} after
2603you configure your distribution; this will put the machine dependent
2604routines for your local machine where they will be accessed first by
2605@kbd{M-.}
2606
2607Also, make sure that you've either compiled GDB with your local cc, or
2608have run @code{fixincludes} if you are compiling with gcc.
2609
2610@section Submitting Patches
2611
2612Thanks for thinking of offering your changes back to the community of
2613GDB users. In general we like to get well designed enhancements.
2614Thanks also for checking in advance about the best way to transfer the
2615changes.
2616
2617The GDB maintainers will only install ``cleanly designed'' patches. You
2618may not always agree on what is clean design.
2619@c @pxref{Coding Style}, @pxref{Clean Design}.
2620
2621If the maintainers don't have time to put the patch in when it arrives,
2622or if there is any question about a patch, it goes into a large queue
2623with everyone else's patches and bug reports.
2624
2625The legal issue is that to incorporate substantial changes requires a
2626copyright assignment from you and/or your employer, granting ownership
2627of the changes to the Free Software Foundation. You can get the
2628standard document for doing this by sending mail to
2629@code{gnu@@prep.ai.mit.edu} and asking for it. I recommend that people
2630write in "All programs owned by the Free Software Foundation" as "NAME
2631OF PROGRAM", so that changes in many programs (not just GDB, but GAS,
2632Emacs, GCC, etc) can be contributed with only one piece of legalese
2633pushed through the bureacracy and filed with the FSF. I can't start
2634merging changes until this paperwork is received by the FSF (their
2635rules, which I follow since I maintain it for them).
2636
2637Technically, the easiest way to receive changes is to receive each
2638feature as a small context diff or unidiff, suitable for "patch".
2639Each message sent to me should include the changes to C code and
2640header files for a single feature, plus ChangeLog entries for each
2641directory where files were modified, and diffs for any changes needed
2642to the manuals (gdb/doc/gdb.texi or gdb/doc/gdbint.texi). If there
2643are a lot of changes for a single feature, they can be split down
2644into multiple messages.
2645
2646In this way, if I read and like the feature, I can add it to the
2647sources with a single patch command, do some testing, and check it in.
2648If you leave out the ChangeLog, I have to write one. If you leave
2649out the doc, I have to puzzle out what needs documenting. Etc.
2650
2651The reason to send each change in a separate message is that I will
2652not install some of the changes. They'll be returned to you with
2653questions or comments. If I'm doing my job, my message back to you
2654will say what you have to fix in order to make the change acceptable.
2655The reason to have separate messages for separate features is so
2656that other changes (which I @emph{am} willing to accept) can be installed
2657while one or more changes are being reworked. If multiple features
2658are sent in a single message, I tend to not put in the effort to sort
2659out the acceptable changes from the unacceptable, so none of the
2660features get installed until all are acceptable.
2661
2662If this sounds painful or authoritarian, well, it is. But I get a lot
2663of bug reports and a lot of patches, and most of them don't get
2664installed because I don't have the time to finish the job that the bug
2665reporter or the contributor could have done. Patches that arrive
2666complete, working, and well designed, tend to get installed on the day
2667they arrive. The others go into a queue and get installed if and when
2668I scan back over the queue -- which can literally take months
2669sometimes. It's in both our interests to make patch installation easy
2670-- you get your changes installed, and I make some forward progress on
2671GDB in a normal 12-hour day (instead of them having to wait until I
2672have a 14-hour or 16-hour day to spend cleaning up patches before I
2673can install them).
2674
2675Please send patches directly to the GDB maintainers at
2676@code{gdb-patches@@cygnus.com}.
2677
2678@section Obsolete Conditionals
b517f124
JG
2679
2680Fragments of old code in GDB sometimes reference or set the following
3cee93ac
CF
2681configuration macros. They should not be used by new code, and old uses
2682should be removed as those parts of the debugger are otherwise touched.
b517f124
JG
2683
2684@table @code
54109914 2685
b517f124
JG
2686@item STACK_END_ADDR
2687This macro used to define where the end of the stack appeared, for use
2688in interpreting core file formats that don't record this address in the
2689core file itself. This information is now configured in BFD, and GDB
2690gets the info portably from there. The values in GDB's configuration
2691files should be moved into BFD configuration files (if needed there),
2692and deleted from all of GDB's config files.
2693
2694Any @file{@var{foo}-xdep.c} file that references STACK_END_ADDR
2695is so old that it has never been converted to use BFD. Now that's old!
54109914 2696
3cee93ac
CF
2697@item PYRAMID_CONTROL_FRAME_DEBUGGING
2698pyr-xdep.c
2699@item PYRAMID_CORE
2700pyr-xdep.c
2701@item PYRAMID_PTRACE
2702pyr-xdep.c
9729ef22 2703
3cee93ac
CF
2704@item REG_STACK_SEGMENT
2705exec.c
9729ef22 2706
3cee93ac 2707@end table
9729ef22 2708
9729ef22 2709
ca714d03
RP
2710@contents
2711@bye