]> git.ipfire.org Git - thirdparty/glibc.git/blame - manual/startup.texi
Remove _BSD_SOURCE and _SVID_SOURCE.
[thirdparty/glibc.git] / manual / startup.texi
CommitLineData
17c389fc 1@node Program Basics, Processes, Signal Handling, Top
7a68c94a 2@c %MENU% Writing the beginning and end of your program
17c389fc 3@chapter The Basic Program/System Interface
28f540f4
RM
4
5@cindex process
17c389fc
UD
6@cindex program
7@cindex address space
8@cindex thread of control
28f540f4
RM
9@dfn{Processes} are the primitive units for allocation of system
10resources. Each process has its own address space and (usually) one
11thread of control. A process executes a program; you can have multiple
12processes executing the same program, but each process has its own copy
13of the program within its own address space and executes it
17c389fc
UD
14independently of the other copies. Though it may have multiple threads
15of control within the same program and a program may be composed of
16multiple logically separate modules, a process always executes exactly
17one program.
18
19Note that we are using a specific definition of ``program'' for the
20purposes of this manual, which corresponds to a common definition in the
21context of Unix system. In popular usage, ``program'' enjoys a much
22broader definition; it can refer for example to a system's kernel, an
23editor macro, a complex package of software, or a discrete section of
24code executing within a process.
25
26Writing the program is what this manual is all about. This chapter
27explains the most basic interface between your program and the system
28that runs, or calls, it. This includes passing of parameters (arguments
29and environment) from the system, requesting basic services from the
30system, and telling the system the program is done.
31
32A program starts another program with the @code{exec} family of system calls.
33This chapter looks at program startup from the execee's point of view. To
88197030 34see the event from the execor's point of view, see @ref{Executing a File}.
28f540f4
RM
35
36@menu
c7683a6d 37* Program Arguments:: Parsing your program's command-line arguments
17c389fc 38* Environment Variables:: Less direct parameters affecting your program
c7683a6d 39* Auxiliary Vector:: Least direct parameters affecting your program
17c389fc
UD
40* System Calls:: Requesting service from the system
41* Program Termination:: Telling the system you're done; return status
28f540f4
RM
42@end menu
43
481e3524 44@node Program Arguments, Environment Variables, , Program Basics
28f540f4
RM
45@section Program Arguments
46@cindex program arguments
47@cindex command line arguments
48@cindex arguments, to program
49
50@cindex program startup
51@cindex startup of program
52@cindex invocation of program
53@cindex @code{main} function
54@findex main
55The system starts a C program by calling the function @code{main}. It
56is up to you to write a function named @code{main}---otherwise, you
57won't even be able to link your program without errors.
58
f65fd747 59In @w{ISO C} you can define @code{main} either to take no arguments, or to
28f540f4
RM
60take two arguments that represent the command line arguments to the
61program, like this:
62
63@smallexample
64int main (int @var{argc}, char *@var{argv}[])
65@end smallexample
66
67@cindex argc (program argument count)
68@cindex argv (program argument vector)
69The command line arguments are the whitespace-separated tokens given in
70the shell command used to invoke the program; thus, in @samp{cat foo
71bar}, the arguments are @samp{foo} and @samp{bar}. The only way a
72program can look at its command line arguments is via the arguments of
73@code{main}. If @code{main} doesn't take arguments, then you cannot get
74at the command line.
75
76The value of the @var{argc} argument is the number of command line
77arguments. The @var{argv} argument is a vector of C strings; its
78elements are the individual command line argument strings. The file
79name of the program being run is also included in the vector as the
80first element; the value of @var{argc} counts this element. A null
81pointer always follows the last element: @code{@var{argv}[@var{argc}]}
82is this null pointer.
83
84For the command @samp{cat foo bar}, @var{argc} is 3 and @var{argv} has
85three elements, @code{"cat"}, @code{"foo"} and @code{"bar"}.
86
28f540f4
RM
87In Unix systems you can define @code{main} a third way, using three arguments:
88
89@smallexample
85857f93 90int main (int @var{argc}, char *@var{argv}[], char *@var{envp}[])
28f540f4
RM
91@end smallexample
92
93The first two arguments are just the same. The third argument
17c389fc 94@var{envp} gives the program's environment; it is the same as the value
28f540f4
RM
95of @code{environ}. @xref{Environment Variables}. POSIX.1 does not
96allow this three-argument form, so to be portable it is best to write
97@code{main} to take two arguments, and use the value of @code{environ}.
98
99@menu
2064087b 100* Argument Syntax:: By convention, options start with a hyphen.
b0de3e9e 101* Parsing Program Arguments:: Ways to parse program options and arguments.
28f540f4
RM
102@end menu
103
052b6a6c 104@node Argument Syntax, Parsing Program Arguments, , Program Arguments
28f540f4
RM
105@subsection Program Argument Syntax Conventions
106@cindex program argument syntax
107@cindex syntax, for program arguments
108@cindex command argument syntax
109
110POSIX recommends these conventions for command line arguments.
b0de3e9e
UD
111@code{getopt} (@pxref{Getopt}) and @code{argp_parse} (@pxref{Argp}) make
112it easy to implement them.
28f540f4
RM
113
114@itemize @bullet
115@item
116Arguments are options if they begin with a hyphen delimiter (@samp{-}).
117
118@item
119Multiple options may follow a hyphen delimiter in a single token if
120the options do not take arguments. Thus, @samp{-abc} is equivalent to
121@samp{-a -b -c}.
122
123@item
124Option names are single alphanumeric characters (as for @code{isalnum};
8b7fb588 125@pxref{Classification of Characters}).
28f540f4
RM
126
127@item
128Certain options require an argument. For example, the @samp{-o} command
129of the @code{ld} command requires an argument---an output file name.
130
131@item
132An option and its argument may or may not appear as separate tokens. (In
133other words, the whitespace separating them is optional.) Thus,
134@w{@samp{-o foo}} and @samp{-ofoo} are equivalent.
135
136@item
137Options typically precede other non-option arguments.
138
1f77f049
JM
139The implementations of @code{getopt} and @code{argp_parse} in @theglibc{}
140normally make it appear as if all the option arguments were
b0de3e9e
UD
141specified before all the non-option arguments for the purposes of
142parsing, even if the user of your program intermixed option and
143non-option arguments. They do this by reordering the elements of the
144@var{argv} array. This behavior is nonstandard; if you want to suppress
145it, define the @code{_POSIX_OPTION_ORDER} environment variable.
146@xref{Standard Environment}.
28f540f4
RM
147
148@item
149The argument @samp{--} terminates all options; any following arguments
150are treated as non-option arguments, even if they begin with a hyphen.
151
152@item
153A token consisting of a single hyphen character is interpreted as an
154ordinary non-option argument. By convention, it is used to specify
155input from or output to the standard input and output streams.
156
157@item
158Options may be supplied in any order, or appear multiple times. The
159interpretation is left up to the particular application program.
160@end itemize
161
162@cindex long-named options
163GNU adds @dfn{long options} to these conventions. Long options consist
164of @samp{--} followed by a name made of alphanumeric characters and
165dashes. Option names are typically one to three words long, with
166hyphens to separate words. Users can abbreviate the option names as
167long as the abbreviations are unique.
168
169To specify an argument for a long option, write
170@samp{--@var{name}=@var{value}}. This syntax enables a long option to
171accept an argument that is itself optional.
172
a7a93d50 173Eventually, @gnusystems{} will provide completion for long option names
28f540f4
RM
174in the shell.
175
052b6a6c 176@node Parsing Program Arguments, , Argument Syntax, Program Arguments
b0de3e9e
UD
177@subsection Parsing Program Arguments
178
28f540f4
RM
179@cindex program arguments, parsing
180@cindex command arguments, parsing
181@cindex parsing program arguments
b0de3e9e
UD
182If the syntax for the command line arguments to your program is simple
183enough, you can simply pick the arguments off from @var{argv} by hand.
184But unless your program takes a fixed number of arguments, or all of the
185arguments are interpreted in the same way (as file names, for example),
186you are usually better off using @code{getopt} (@pxref{Getopt}) or
187@code{argp_parse} (@pxref{Argp}) to do the parsing.
28f540f4 188
b0de3e9e
UD
189@code{getopt} is more standard (the short-option only version of it is a
190part of the POSIX standard), but using @code{argp_parse} is often
191easier, both for very simple and very complex option structures, because
192it does more of the dirty work for you.
28f540f4 193
b0de3e9e
UD
194@menu
195* Getopt:: Parsing program options using @code{getopt}.
196* Argp:: Parsing program options using @code{argp_parse}.
197* Suboptions:: Some programs need more detailed options.
198* Suboptions Example:: This shows how it could be done for @code{mount}.
199@end menu
28f540f4 200
b0de3e9e
UD
201@c Getopt and argp start at the @section level so that there's
202@c enough room for their internal hierarchy (mostly a problem with
203@c argp). -Miles
28f540f4 204
b0de3e9e
UD
205@include getopt.texi
206@include argp.texi
28f540f4 207
b0de3e9e
UD
208@node Suboptions, Suboptions Example, Argp, Parsing Program Arguments
209@c This is a @section so that it's at the same level as getopt and argp
052b6a6c 210@subsubsection Parsing of Suboptions
2064087b
RM
211
212Having a single level of options is sometimes not enough. There might
213be too many options which have to be available or a set of options is
214closely related.
215
216For this case some programs use suboptions. One of the most prominent
217programs is certainly @code{mount}(8). The @code{-o} option take one
218argument which itself is a comma separated list of options. To ease the
219programming of code like this the function @code{getsubopt} is
220available.
221
222@comment stdlib.h
8ded91fb 223@deftypefun int getsubopt (char **@var{optionp}, char *const *@var{tokens}, char **@var{valuep})
663b02d7
AO
224@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
225@c getsubopt ok
226@c strchrnul dup ok
227@c memchr dup ok
228@c strncmp dup ok
2064087b
RM
229
230The @var{optionp} parameter must be a pointer to a variable containing
231the address of the string to process. When the function returns the
232reference is updated to point to the next suboption or to the
233terminating @samp{\0} character if there is no more suboption available.
234
235The @var{tokens} parameter references an array of strings containing the
236known suboptions. All strings must be @samp{\0} terminated and to mark
237the end a null pointer must be stored. When @code{getsubopt} finds a
238possible legal suboption it compares it with all strings available in
239the @var{tokens} array and returns the index in the string as the
240indicator.
241
242In case the suboption has an associated value introduced by a @samp{=}
243character, a pointer to the value is returned in @var{valuep}. The
244string is @samp{\0} terminated. If no argument is available
245@var{valuep} is set to the null pointer. By doing this the caller can
246check whether a necessary value is given or whether no unexpected value
247is present.
248
249In case the next suboption in the string is not mentioned in the
250@var{tokens} array the starting address of the suboption including a
251possible value is returned in @var{valuep} and the return value of the
252function is @samp{-1}.
253@end deftypefun
254
b0de3e9e 255@node Suboptions Example, , Suboptions, Parsing Program Arguments
2064087b
RM
256@subsection Parsing of Suboptions Example
257
258The code which might appear in the @code{mount}(8) program is a perfect
259example of the use of @code{getsubopt}:
260
261@smallexample
262@include subopt.c.texi
263@end smallexample
264
265
481e3524 266@node Environment Variables, Auxiliary Vector, Program Arguments, Program Basics
28f540f4
RM
267@section Environment Variables
268
269@cindex environment variable
270When a program is executed, it receives information about the context in
271which it was invoked in two ways. The first mechanism uses the
272@var{argv} and @var{argc} arguments to its @code{main} function, and is
273discussed in @ref{Program Arguments}. The second mechanism uses
274@dfn{environment variables} and is discussed in this section.
275
276The @var{argv} mechanism is typically used to pass command-line
277arguments specific to the particular program being invoked. The
278environment, on the other hand, keeps track of information that is
279shared by many programs, changes infrequently, and that is less
280frequently used.
281
282The environment variables discussed in this section are the same
283environment variables that you set using assignments and the
284@code{export} command in the shell. Programs executed from the shell
285inherit all of the environment variables from the shell.
286@c !!! xref to right part of bash manual when it exists
287
288@cindex environment
289Standard environment variables are used for information about the user's
290home directory, terminal type, current locale, and so on; you can define
291additional variables for other purposes. The set of all environment
292variables that have values is collectively known as the
293@dfn{environment}.
294
295Names of environment variables are case-sensitive and must not contain
296the character @samp{=}. System-defined environment variables are
297invariably uppercase.
298
299The values of environment variables can be anything that can be
300represented as a string. A value must not contain an embedded null
301character, since this is assumed to terminate the string.
302
303
304@menu
2064087b 305* Environment Access:: How to get and set the values of
40a55d20 306 environment variables.
2064087b 307* Standard Environment:: These environment variables have
40a55d20 308 standard interpretations.
28f540f4
RM
309@end menu
310
311@node Environment Access
312@subsection Environment Access
313@cindex environment access
314@cindex environment representation
315
316The value of an environment variable can be accessed with the
317@code{getenv} function. This is declared in the header file
84b3fd84 318@file{stdlib.h}.
28f540f4
RM
319@pindex stdlib.h
320
84b3fd84
FW
321Libraries should use @code{secure_getenv} instead of @code{getenv}, so
322that they do not accidentally use untrusted environment variables.
323Modifications of environment variables are not allowed in
324multi-threaded programs. The @code{getenv} and @code{secure_getenv}
325functions can be safely used in multi-threaded programs.
326
28f540f4 327@comment stdlib.h
f65fd747 328@comment ISO
28f540f4 329@deftypefun {char *} getenv (const char *@var{name})
663b02d7
AO
330@safety{@prelim{}@mtsafe{@mtsenv{}}@assafe{}@acsafe{}}
331@c Unguarded access to __environ.
28f540f4
RM
332This function returns a string that is the value of the environment
333variable @var{name}. You must not modify this string. In some non-Unix
1f77f049 334systems not using @theglibc{}, it might be overwritten by subsequent
28f540f4
RM
335calls to @code{getenv} (but not by any other library function). If the
336environment variable @var{name} is not defined, the value is a null
337pointer.
338@end deftypefun
339
84b3fd84
FW
340@comment stdlib.h
341@comment GNU
342@deftypefun {char *} secure_getenv (const char *@var{name})
663b02d7
AO
343@safety{@prelim{}@mtsafe{@mtsenv{}}@assafe{}@acsafe{}}
344@c Calls getenv unless secure mode is enabled.
84b3fd84
FW
345This function is similar to @code{getenv}, but it returns a null
346pointer if the environment is untrusted. This happens when the
347program file has SUID or SGID bits set. General-purpose libraries
348should always prefer this function over @code{getenv} to avoid
349vulnerabilities if the library is referenced from a SUID/SGID program.
350
351This function is a GNU extension.
352@end deftypefun
353
28f540f4
RM
354
355@comment stdlib.h
356@comment SVID
14d9bd50 357@deftypefun int putenv (char *@var{string})
663b02d7
AO
358@safety{@prelim{}@mtunsafe{@mtasuconst{:@mtsenv{}}}@asunsafe{@ascuheap{} @asulock{}}@acunsafe{@acucorrupt{} @aculock{} @acsmem{}}}
359@c putenv @mtasuconst:@mtsenv @ascuheap @asulock @acucorrupt @aculock @acsmem
360@c strchr dup ok
361@c strndup dup @ascuheap @acsmem
362@c add_to_environ dup @mtasuconst:@mtsenv @ascuheap @asulock @acucorrupt @aculock @acsmem
363@c free dup @ascuheap @acsmem
364@c unsetenv dup @mtasuconst:@mtsenv @asulock @aculock
28f540f4
RM
365The @code{putenv} function adds or removes definitions from the environment.
366If the @var{string} is of the form @samp{@var{name}=@var{value}}, the
367definition is added to the environment. Otherwise, the @var{string} is
368interpreted as the name of an environment variable, and any definition
369for this variable in the environment is removed.
370
6ab0fbfc
AJ
371If the function is successful it returns @code{0}. Otherwise the return
372value is nonzero and @code{errno} is set to indicate the error.
373
66f8fa9b
UD
374The difference to the @code{setenv} function is that the exact string
375given as the parameter @var{string} is put into the environment. If the
376user should change the string after the @code{putenv} call this will
663b02d7
AO
377reflect automatically in the environment. This also requires that
378@var{string} not be an automatic variable whose scope is left before the
d364e525
UD
379variable is removed from the environment. The same applies of course to
380dynamically allocated variables which are freed later.
66f8fa9b 381
c941736c
JM
382This function is part of the extended Unix interface. You should define
383@var{_XOPEN_SOURCE} before including any header.
40a55d20
UD
384@end deftypefun
385
386
387@comment stdlib.h
388@comment BSD
389@deftypefun int setenv (const char *@var{name}, const char *@var{value}, int @var{replace})
663b02d7
AO
390@safety{@prelim{}@mtunsafe{@mtasuconst{:@mtsenv{}}}@asunsafe{@ascuheap{} @asulock{}}@acunsafe{@acucorrupt{} @aculock{} @acsmem{}}}
391@c setenv @mtasuconst:@mtsenv @ascuheap @asulock @acucorrupt @aculock @acsmem
392@c add_to_environ @mtasuconst:@mtsenv @ascuheap @asulock @acucorrupt @aculock @acsmem
393@c strlen dup ok
394@c libc_lock_lock @asulock @aculock
395@c strncmp dup ok
396@c realloc dup @ascuheap @acsmem
397@c libc_lock_unlock @aculock
398@c malloc dup @ascuheap @acsmem
399@c free dup @ascuheap @acsmem
400@c mempcpy dup ok
401@c memcpy dup ok
402@c KNOWN_VALUE ok
403@c tfind(strcmp) [no @mtsrace guarded access]
404@c strcmp dup ok
405@c STORE_VALUE @ascuheap @acucorrupt @acsmem
406@c tsearch(strcmp) @ascuheap @acucorrupt @acsmem [no @mtsrace or @asucorrupt guarded access makes for mtsafe and @asulock]
407@c strcmp dup ok
40a55d20
UD
408The @code{setenv} function can be used to add a new definition to the
409environment. The entry with the name @var{name} is replaced by the
410value @samp{@var{name}=@var{value}}. Please note that this is also true
66f8fa9b
UD
411if @var{value} is the empty string. To do this a new string is created
412and the strings @var{name} and @var{value} are copied. A null pointer
413for the @var{value} parameter is illegal. If the environment already
414contains an entry with key @var{name} the @var{replace} parameter
415controls the action. If replace is zero, nothing happens. Otherwise
416the old entry is replaced by the new one.
40a55d20
UD
417
418Please note that you cannot remove an entry completely using this function.
419
6ab0fbfc
AJ
420If the function is successful it returns @code{0}. Otherwise the
421environment is unchanged and the return value is @code{-1} and
422@code{errno} is set.
423
0423ee17
UD
424This function was originally part of the BSD library but is now part of
425the Unix standard.
40a55d20
UD
426@end deftypefun
427
428@comment stdlib.h
429@comment BSD
0423ee17 430@deftypefun int unsetenv (const char *@var{name})
663b02d7
AO
431@safety{@prelim{}@mtunsafe{@mtasuconst{:@mtsenv{}}}@asunsafe{@asulock{}}@acunsafe{@aculock{}}}
432@c unsetenv @mtasuconst:@mtsenv @asulock @aculock
433@c strchr dup ok
434@c strlen dup ok
435@c libc_lock_lock @asulock @aculock
436@c strncmp dup ok
437@c libc_lock_unlock @aculock
40a55d20
UD
438Using this function one can remove an entry completely from the
439environment. If the environment contains an entry with the key
440@var{name} this whole entry is removed. A call to this function is
441equivalent to a call to @code{putenv} when the @var{value} part of the
442string is empty.
443
0423ee17
UD
444The function return @code{-1} if @var{name} is a null pointer, points to
445an empty string, or points to a string containing a @code{=} character.
446It returns @code{0} if the call succeeded.
447
b912ca11 448This function was originally part of the BSD library but is now part of
0423ee17 449the Unix standard. The BSD version had no return value, though.
40a55d20
UD
450@end deftypefun
451
452There is one more function to modify the whole environment. This
453function is said to be used in the POSIX.9 (POSIX bindings for Fortran
45477) and so one should expect it did made it into POSIX.1. But this
455never happened. But we still provide this function as a GNU extension
456to enable writing standard compliant Fortran environments.
457
458@comment stdlib.h
459@comment GNU
460@deftypefun int clearenv (void)
663b02d7
AO
461@safety{@prelim{}@mtunsafe{@mtasuconst{:@mtsenv{}}}@asunsafe{@ascuheap{} @asulock{}}@acunsafe{@aculock{} @acsmem{}}}
462@c clearenv @mtasuconst:@mtsenv @ascuheap @asulock @aculock @acsmem
463@c libc_lock_lock @asulock @aculock
464@c free dup @ascuheap @acsmem
465@c libc_lock_unlock @aculock
40a55d20
UD
466The @code{clearenv} function removes all entries from the environment.
467Using @code{putenv} and @code{setenv} new entries can be added again
468later.
469
470If the function is successful it returns @code{0}. Otherwise the return
471value is nonzero.
28f540f4
RM
472@end deftypefun
473
28f540f4
RM
474
475You can deal directly with the underlying representation of environment
476objects to add more variables to the environment (for example, to
8b7fb588
UD
477communicate with another program you are about to execute;
478@pxref{Executing a File}).
28f540f4
RM
479
480@comment unistd.h
481@comment POSIX.1
482@deftypevar {char **} environ
483The environment is represented as an array of strings. Each string is
484of the format @samp{@var{name}=@var{value}}. The order in which
485strings appear in the environment is not significant, but the same
486@var{name} must not appear more than once. The last element of the
487array is a null pointer.
488
489This variable is declared in the header file @file{unistd.h}.
490
491If you just want to get the value of an environment variable, use
492@code{getenv}.
493@end deftypevar
494
a7a93d50 495Unix systems, and @gnusystems{}, pass the initial value of
28f540f4
RM
496@code{environ} as the third argument to @code{main}.
497@xref{Program Arguments}.
498
499@node Standard Environment
500@subsection Standard Environment Variables
501@cindex standard environment variables
502
503These environment variables have standard meanings. This doesn't mean
504that they are always present in the environment; but if these variables
505@emph{are} present, they have these meanings. You shouldn't try to use
506these environment variable names for some other purpose.
507
508@comment Extra blank lines make it look better.
509@table @code
510@item HOME
838e5ffe 511@cindex @code{HOME} environment variable
28f540f4
RM
512@cindex home directory
513
514This is a string representing the user's @dfn{home directory}, or
515initial default working directory.
516
517The user can set @code{HOME} to any value.
518If you need to make sure to obtain the proper home directory
519for a particular user, you should not use @code{HOME}; instead,
520look up the user's name in the user database (@pxref{User Database}).
521
522For most purposes, it is better to use @code{HOME}, precisely because
523this lets the user specify the value.
524
525@c !!! also USER
526@item LOGNAME
838e5ffe 527@cindex @code{LOGNAME} environment variable
28f540f4
RM
528
529This is the name that the user used to log in. Since the value in the
530environment can be tweaked arbitrarily, this is not a reliable way to
17c389fc 531identify the user who is running a program; a function like
28f540f4
RM
532@code{getlogin} (@pxref{Who Logged In}) is better for that purpose.
533
534For most purposes, it is better to use @code{LOGNAME}, precisely because
535this lets the user specify the value.
536
537@item PATH
838e5ffe 538@cindex @code{PATH} environment variable
28f540f4
RM
539
540A @dfn{path} is a sequence of directory names which is used for
541searching for a file. The variable @code{PATH} holds a path used
542for searching for programs to be run.
543
544The @code{execlp} and @code{execvp} functions (@pxref{Executing a File})
545use this environment variable, as do many shells and other utilities
546which are implemented in terms of those functions.
547
548The syntax of a path is a sequence of directory names separated by
1b82a4a8 549colons. An empty string instead of a directory name stands for the
28f540f4
RM
550current directory (@pxref{Working Directory}).
551
552A typical value for this environment variable might be a string like:
553
554@smallexample
555:/bin:/etc:/usr/bin:/usr/new/X11:/usr/new:/usr/local/bin
556@end smallexample
557
558This means that if the user tries to execute a program named @code{foo},
559the system will look for files named @file{foo}, @file{/bin/foo},
560@file{/etc/foo}, and so on. The first of these files that exists is
561the one that is executed.
562
563@c !!! also TERMCAP
564@item TERM
838e5ffe 565@cindex @code{TERM} environment variable
28f540f4
RM
566
567This specifies the kind of terminal that is receiving program output.
568Some programs can make use of this information to take advantage of
569special escape sequences or terminal modes supported by particular kinds
570of terminals. Many programs which use the termcap library
571(@pxref{Finding a Terminal Description,Find,,termcap,The Termcap Library
572Manual}) use the @code{TERM} environment variable, for example.
573
574@item TZ
838e5ffe 575@cindex @code{TZ} environment variable
28f540f4
RM
576
577This specifies the time zone. @xref{TZ Variable}, for information about
578the format of this string and how it is used.
579
580@item LANG
838e5ffe 581@cindex @code{LANG} environment variable
28f540f4
RM
582
583This specifies the default locale to use for attribute categories where
584neither @code{LC_ALL} nor the specific environment variable for that
585category is set. @xref{Locales}, for more information about
586locales.
587
588@ignore
589@c I doubt this really exists
590@item LC_ALL
838e5ffe 591@cindex @code{LC_ALL} environment variable
28f540f4
RM
592
593This is similar to the @code{LANG} environment variable. However, its
594value takes precedence over any values provided for the individual
595attribute category environment variables, or for the @code{LANG}
596environment variable.
597@end ignore
598
40a55d20 599@item LC_ALL
838e5ffe 600@cindex @code{LC_ALL} environment variable
40a55d20
UD
601
602If this environment variable is set it overrides the selection for all
603the locales done using the other @code{LC_*} environment variables. The
604value of the other @code{LC_*} environment variables is simply ignored
605in this case.
606
28f540f4 607@item LC_COLLATE
838e5ffe 608@cindex @code{LC_COLLATE} environment variable
28f540f4
RM
609
610This specifies what locale to use for string sorting.
611
612@item LC_CTYPE
838e5ffe 613@cindex @code{LC_CTYPE} environment variable
28f540f4
RM
614
615This specifies what locale to use for character sets and character
616classification.
617
40a55d20 618@item LC_MESSAGES
838e5ffe 619@cindex @code{LC_MESSAGES} environment variable
40a55d20
UD
620
621This specifies what locale to use for printing messages and to parse
f2ea0f5b 622responses.
40a55d20 623
28f540f4 624@item LC_MONETARY
838e5ffe 625@cindex @code{LC_MONETARY} environment variable
28f540f4
RM
626
627This specifies what locale to use for formatting monetary values.
628
629@item LC_NUMERIC
838e5ffe 630@cindex @code{LC_NUMERIC} environment variable
28f540f4
RM
631
632This specifies what locale to use for formatting numbers.
633
634@item LC_TIME
838e5ffe 635@cindex @code{LC_TIME} environment variable
28f540f4
RM
636
637This specifies what locale to use for formatting date/time values.
638
40a55d20 639@item NLSPATH
838e5ffe 640@cindex @code{NLSPATH} environment variable
40a55d20
UD
641
642This specifies the directories in which the @code{catopen} function
643looks for message translation catalogs.
644
28f540f4 645@item _POSIX_OPTION_ORDER
838e5ffe 646@cindex @code{_POSIX_OPTION_ORDER} environment variable.
28f540f4
RM
647
648If this environment variable is defined, it suppresses the usual
b0de3e9e
UD
649reordering of command line arguments by @code{getopt} and
650@code{argp_parse}. @xref{Argument Syntax}.
28f540f4
RM
651
652@c !!! GNU also has COREFILE, CORESERVER, EXECSERVERS
653@end table
654
c7683a6d
RH
655@node Auxiliary Vector
656@section Auxiliary Vector
657@cindex auxiliary vector
658
659When a program is executed, it receives information from the operating
660system about the environment in which it is operating. The form of this
661information is a table of key-value pairs, where the keys are from the
662set of @samp{AT_} values in @file{elf.h}. Some of the data is provided
663by the kernel for libc consumption, and may be obtained by ordinary
664interfaces, such as @code{sysconf}. However, on a platform-by-platform
665basis there may be information that is not available any other way.
666
667@subsection Definition of @code{getauxval}
668@comment sys/auxv.h
669@deftypefun {unsigned long int} getauxval (unsigned long int @var{type})
663b02d7
AO
670@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
671@c Reads from hwcap or iterates over constant auxv.
c7683a6d
RH
672This function is used to inquire about the entries in the auxiliary
673vector. The @var{type} argument should be one of the @samp{AT_} symbols
674defined in @file{elf.h}. If a matching entry is found, the value is
b9ab448f
BM
675returned; if the entry is not found, zero is returned and @code{errno} is
676set to @code{ENOENT}.
c7683a6d
RH
677@end deftypefun
678
679For some platforms, the key @code{AT_HWCAP} is the easiest way to inquire
680about any instruction set extensions available at runtime. In this case,
681there will (of necessity) be a platform-specific set of @samp{HWCAP_}
682values masked together that describe the capabilities of the cpu on which
683the program is being executed.
684
17c389fc
UD
685@node System Calls
686@section System Calls
687
688@cindex system call
689A system call is a request for service that a program makes of the
690kernel. The service is generally something that only the kernel has
691the privilege to do, such as doing I/O. Programmers don't normally
692need to be concerned with system calls because there are functions in
1f77f049 693@theglibc{} to do virtually everything that system calls do.
17c389fc 694These functions work by making system calls themselves. For example,
d364e525 695there is a system call that changes the permissions of a file, but
1f77f049
JM
696you don't need to know about it because you can just use @theglibc{}'s
697@code{chmod} function.
17c389fc
UD
698
699@cindex kernel call
700System calls are sometimes called kernel calls.
701
702However, there are times when you want to make a system call explicitly,
1f77f049 703and for that, @theglibc{} provides the @code{syscall} function.
17c389fc
UD
704@code{syscall} is harder to use and less portable than functions like
705@code{chmod}, but easier and more portable than coding the system call
706in assembler instructions.
707
708@code{syscall} is most useful when you are working with a system call
1f77f049 709which is special to your system or is newer than @theglibc{} you
17c389fc
UD
710are using. @code{syscall} is implemented in an entirely generic way;
711the function does not know anything about what a particular system
712call does or even if it is valid.
713
714The description of @code{syscall} in this section assumes a certain
1f77f049
JM
715protocol for system calls on the various platforms on which @theglibc{}
716runs. That protocol is not defined by any strong authority, but
17c389fc
UD
717we won't describe it here either because anyone who is coding
718@code{syscall} probably won't accept anything less than kernel and C
719library source code as a specification of the interface between them
720anyway.
721
722
723@code{syscall} is declared in @file{unistd.h}.
724
725@comment unistd.h
726@comment ???
cc6e48bc 727@deftypefun {long int} syscall (long int @var{sysno}, @dots{})
663b02d7 728@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
17c389fc
UD
729
730@code{syscall} performs a generic system call.
731
732@cindex system call number
733@var{sysno} is the system call number. Each kind of system call is
734identified by a number. Macros for all the possible system call numbers
735are defined in @file{sys/syscall.h}
736
737The remaining arguments are the arguments for the system call, in
738order, and their meanings depend on the kind of system call. Each kind
739of system call has a definite number of arguments, from zero to five.
740If you code more arguments than the system call takes, the extra ones to
741the right are ignored.
742
743The return value is the return value from the system call, unless the
744system call failed. In that case, @code{syscall} returns @code{-1} and
745sets @code{errno} to an error code that the system call returned. Note
746that system calls do not return @code{-1} when they succeed.
747@cindex errno
748
749If you specify an invalid @var{sysno}, @code{syscall} returns @code{-1}
750with @code{errno} = @code{ENOSYS}.
751
752Example:
753
754@smallexample
755
756#include <unistd.h>
757#include <sys/syscall.h>
758#include <errno.h>
759
95fdc6a0 760@dots{}
17c389fc
UD
761
762int rc;
763
764rc = syscall(SYS_chmod, "/etc/passwd", 0444);
765
d364e525 766if (rc == -1)
17c389fc
UD
767 fprintf(stderr, "chmod failed, errno = %d\n", errno);
768
769@end smallexample
770
771This, if all the compatibility stars are aligned, is equivalent to the
772following preferable code:
773
774@smallexample
775
776#include <sys/types.h>
777#include <sys/stat.h>
778#include <errno.h>
779
95fdc6a0 780@dots{}
17c389fc
UD
781
782int rc;
783
784rc = chmod("/etc/passwd", 0444);
785if (rc == -1)
786 fprintf(stderr, "chmod failed, errno = %d\n", errno);
787
788@end smallexample
789
790@end deftypefun
791
792
28f540f4
RM
793@node Program Termination
794@section Program Termination
795@cindex program termination
796@cindex process termination
797
798@cindex exit status value
799The usual way for a program to terminate is simply for its @code{main}
800function to return. The @dfn{exit status value} returned from the
801@code{main} function is used to report information back to the process's
802parent process or shell.
803
804A program can also terminate normally by calling the @code{exit}
805function.
806
807In addition, programs can be terminated by signals; this is discussed in
808more detail in @ref{Signal Handling}. The @code{abort} function causes
809a signal that kills the program.
810
811@menu
812* Normal Termination:: If a program calls @code{exit}, a
813 process terminates normally.
1b82a4a8
RM
814* Exit Status:: The @code{exit status} provides information
815 about why the process terminated.
28f540f4 816* Cleanups on Exit:: A process can run its own cleanup
1b82a4a8 817 functions upon normal termination.
28f540f4 818* Aborting a Program:: The @code{abort} function causes
1b82a4a8 819 abnormal program termination.
28f540f4
RM
820* Termination Internals:: What happens when a process terminates.
821@end menu
822
823@node Normal Termination
824@subsection Normal Termination
825
17c389fc
UD
826A process terminates normally when its program signals it is done by
827calling @code{exit}. Returning from @code{main} is equivalent to
828calling @code{exit}, and the value that @code{main} returns is used as
829the argument to @code{exit}.
28f540f4
RM
830
831@comment stdlib.h
f65fd747 832@comment ISO
28f540f4 833@deftypefun void exit (int @var{status})
663b02d7
AO
834@safety{@prelim{}@mtunsafe{@mtasurace{:exit}}@asunsafe{@asucorrupt{}}@acunsafe{@acucorrupt{} @aculock{}}}
835@c Access to the atexit/on_exit list, the libc_atexit hook and tls dtors
836@c is not guarded. Streams must be flushed, and that triggers the usual
837@c AS and AC issues with streams.
17c389fc
UD
838The @code{exit} function tells the system that the program is done, which
839causes it to terminate the process.
840
841@var{status} is the program's exit status, which becomes part of the
842process' termination status. This function does not return.
28f540f4
RM
843@end deftypefun
844
845Normal termination causes the following actions:
846
847@enumerate
1b82a4a8 848@item
28f540f4
RM
849Functions that were registered with the @code{atexit} or @code{on_exit}
850functions are called in the reverse order of their registration. This
851mechanism allows your application to specify its own ``cleanup'' actions
852to be performed at program termination. Typically, this is used to do
853things like saving program state information in a file, or unlocking
854locks in shared data bases.
855
1b82a4a8 856@item
28f540f4
RM
857All open streams are closed, writing out any buffered output data. See
858@ref{Closing Streams}. In addition, temporary files opened
859with the @code{tmpfile} function are removed; see @ref{Temporary Files}.
860
1b82a4a8 861@item
28f540f4
RM
862@code{_exit} is called, terminating the program. @xref{Termination Internals}.
863@end enumerate
864
865@node Exit Status
866@subsection Exit Status
867@cindex exit status
868
869When a program exits, it can return to the parent process a small
870amount of information about the cause of termination, using the
871@dfn{exit status}. This is a value between 0 and 255 that the exiting
872process passes as an argument to @code{exit}.
873
874Normally you should use the exit status to report very broad information
875about success or failure. You can't provide a lot of detail about the
876reasons for the failure, and most parent processes would not want much
877detail anyway.
878
879There are conventions for what sorts of status values certain programs
880should return. The most common convention is simply 0 for success and 1
881for failure. Programs that perform comparison use a different
882convention: they use status 1 to indicate a mismatch, and status 2 to
883indicate an inability to compare. Your program should follow an
884existing convention if an existing convention makes sense for it.
885
886A general convention reserves status values 128 and up for special
887purposes. In particular, the value 128 is used to indicate failure to
888execute another program in a subprocess. This convention is not
889universally obeyed, but it is a good idea to follow it in your programs.
890
891@strong{Warning:} Don't try to use the number of errors as the exit
892status. This is actually not very useful; a parent process would
893generally not care how many errors occurred. Worse than that, it does
894not work, because the status value is truncated to eight bits.
895Thus, if the program tried to report 256 errors, the parent would
896receive a report of 0 errors---that is, success.
897
898For the same reason, it does not work to use the value of @code{errno}
899as the exit status---these can exceed 255.
900
901@strong{Portability note:} Some non-POSIX systems use different
902conventions for exit status values. For greater portability, you can
903use the macros @code{EXIT_SUCCESS} and @code{EXIT_FAILURE} for the
904conventional status value for success and failure, respectively. They
905are declared in the file @file{stdlib.h}.
906@pindex stdlib.h
907
908@comment stdlib.h
f65fd747 909@comment ISO
28f540f4
RM
910@deftypevr Macro int EXIT_SUCCESS
911This macro can be used with the @code{exit} function to indicate
912successful program completion.
913
914On POSIX systems, the value of this macro is @code{0}. On other
915systems, the value might be some other (possibly non-constant) integer
916expression.
917@end deftypevr
918
919@comment stdlib.h
f65fd747 920@comment ISO
28f540f4
RM
921@deftypevr Macro int EXIT_FAILURE
922This macro can be used with the @code{exit} function to indicate
923unsuccessful program completion in a general sense.
924
925On POSIX systems, the value of this macro is @code{1}. On other
926systems, the value might be some other (possibly non-constant) integer
f166d865 927expression. Other nonzero status values also indicate failures. Certain
28f540f4
RM
928programs use different nonzero status values to indicate particular
929kinds of "non-success". For example, @code{diff} uses status value
930@code{1} to mean that the files are different, and @code{2} or more to
931mean that there was difficulty in opening the files.
932@end deftypevr
933
17c389fc 934Don't confuse a program's exit status with a process' termination status.
3835c55f 935There are lots of ways a process can terminate besides having its program
17c389fc 936finish. In the event that the process termination @emph{is} caused by program
11bf311e 937termination (i.e., @code{exit}), though, the program's exit status becomes
17c389fc
UD
938part of the process' termination status.
939
28f540f4
RM
940@node Cleanups on Exit
941@subsection Cleanups on Exit
942
943Your program can arrange to run its own cleanup functions if normal
944termination happens. If you are writing a library for use in various
945application programs, then it is unreliable to insist that all
946applications call the library's cleanup functions explicitly before
947exiting. It is much more robust to make the cleanup invisible to the
948application, by setting up a cleanup function in the library itself
949using @code{atexit} or @code{on_exit}.
950
951@comment stdlib.h
f65fd747 952@comment ISO
28f540f4 953@deftypefun int atexit (void (*@var{function}) (void))
663b02d7
AO
954@safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{} @asulock{}}@acunsafe{@aculock{} @acsmem{}}}
955@c atexit @ascuheap @asulock @aculock @acsmem
956@c cxa_atexit @ascuheap @asulock @aculock @acsmem
957@c __internal_atexit @ascuheap @asulock @aculock @acsmem
958@c __new_exitfn @ascuheap @asulock @aculock @acsmem
959@c __libc_lock_lock @asulock @aculock
960@c calloc dup @ascuheap @acsmem
961@c __libc_lock_unlock @aculock
962@c atomic_write_barrier dup ok
28f540f4
RM
963The @code{atexit} function registers the function @var{function} to be
964called at normal program termination. The @var{function} is called with
965no arguments.
966
967The return value from @code{atexit} is zero on success and nonzero if
1b82a4a8 968the function cannot be registered.
28f540f4
RM
969@end deftypefun
970
971@comment stdlib.h
972@comment SunOS
973@deftypefun int on_exit (void (*@var{function})(int @var{status}, void *@var{arg}), void *@var{arg})
663b02d7
AO
974@safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{} @asulock{}}@acunsafe{@aculock{} @acsmem{}}}
975@c on_exit @ascuheap @asulock @aculock @acsmem
976@c new_exitfn dup @ascuheap @asulock @aculock @acsmem
977@c atomic_write_barrier dup ok
28f540f4
RM
978This function is a somewhat more powerful variant of @code{atexit}. It
979accepts two arguments, a function @var{function} and an arbitrary
980pointer @var{arg}. At normal program termination, the @var{function} is
981called with two arguments: the @var{status} value passed to @code{exit},
982and the @var{arg}.
983
1f77f049 984This function is included in @theglibc{} only for compatibility
28f540f4
RM
985for SunOS, and may not be supported by other implementations.
986@end deftypefun
987
988Here's a trivial program that illustrates the use of @code{exit} and
989@code{atexit}:
990
991@smallexample
992@include atexit.c.texi
993@end smallexample
994
995@noindent
996When this program is executed, it just prints the message and exits.
997
998@node Aborting a Program
999@subsection Aborting a Program
1000@cindex aborting a program
1001
1002You can abort your program using the @code{abort} function. The prototype
1003for this function is in @file{stdlib.h}.
1004@pindex stdlib.h
1005
1006@comment stdlib.h
f65fd747 1007@comment ISO
28f540f4 1008@deftypefun void abort (void)
663b02d7
AO
1009@safety{@prelim{}@mtsafe{}@asunsafe{@asucorrupt{}}@acunsafe{@aculock{} @acucorrupt{}}}
1010@c The implementation takes a recursive lock and attempts to support
1011@c calls from signal handlers, but if we're in the middle of flushing or
1012@c using streams, we may encounter them in inconsistent states.
28f540f4
RM
1013The @code{abort} function causes abnormal program termination. This
1014does not execute cleanup functions registered with @code{atexit} or
1015@code{on_exit}.
1016
1017This function actually terminates the process by raising a
1018@code{SIGABRT} signal, and your program can include a handler to
1019intercept this signal; see @ref{Signal Handling}.
1020@end deftypefun
1021
1022@c Put in by rms. Don't remove.
1023@cartouche
1024@strong{Future Change Warning:} Proposed Federal censorship regulations
1025may prohibit us from giving you information about the possibility of
1026calling this function. We would be required to say that this is not an
1027acceptable way of terminating a program.
1028@end cartouche
1029
1030@node Termination Internals
1031@subsection Termination Internals
1032
1033The @code{_exit} function is the primitive used for process termination
1034by @code{exit}. It is declared in the header file @file{unistd.h}.
1035@pindex unistd.h
1036
1037@comment unistd.h
1038@comment POSIX.1
1039@deftypefun void _exit (int @var{status})
663b02d7
AO
1040@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
1041@c Direct syscall (exit_group or exit); calls __task_terminate on hurd,
1042@c and abort in the generic posix implementation.
28f540f4
RM
1043The @code{_exit} function is the primitive for causing a process to
1044terminate with status @var{status}. Calling this function does not
1045execute cleanup functions registered with @code{atexit} or
1046@code{on_exit}.
1047@end deftypefun
1048
e518937a
UD
1049@comment stdlib.h
1050@comment ISO
1051@deftypefun void _Exit (int @var{status})
663b02d7
AO
1052@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
1053@c Alias for _exit.
e518937a
UD
1054The @code{_Exit} function is the @w{ISO C} equivalent to @code{_exit}.
1055The @w{ISO C} committee members were not sure whether the definitions of
1056@code{_exit} and @code{_Exit} were compatible so they have not used the
1057POSIX name.
1058
ec751a23 1059This function was introduced in @w{ISO C99} and is declared in
e518937a
UD
1060@file{stdlib.h}.
1061@end deftypefun
1062
17c389fc
UD
1063When a process terminates for any reason---either because the program
1064terminates, or as a result of a signal---the
28f540f4
RM
1065following things happen:
1066
1067@itemize @bullet
1068@item
1069All open file descriptors in the process are closed. @xref{Low-Level I/O}.
1070Note that streams are not flushed automatically when the process
8b7fb588 1071terminates; see @ref{I/O on Streams}.
28f540f4
RM
1072
1073@item
17c389fc
UD
1074A process exit status is saved to be reported back to the parent process
1075via @code{wait} or @code{waitpid}; see @ref{Process Completion}. If the
1076program exited, this status includes as its low-order 8 bits the program
1077exit status.
1078
28f540f4
RM
1079
1080@item
1081Any child processes of the process being terminated are assigned a new
1082parent process. (On most systems, including GNU, this is the @code{init}
1083process, with process ID 1.)
1084
1085@item
1086A @code{SIGCHLD} signal is sent to the parent process.
1087
1088@item
1089If the process is a session leader that has a controlling terminal, then
1090a @code{SIGHUP} signal is sent to each process in the foreground job,
1091and the controlling terminal is disassociated from that session.
1092@xref{Job Control}.
1093
1094@item
1095If termination of a process causes a process group to become orphaned,
1096and any member of that process group is stopped, then a @code{SIGHUP}
1097signal and a @code{SIGCONT} signal are sent to each process in the
1098group. @xref{Job Control}.
1099@end itemize