]> git.ipfire.org Git - thirdparty/glibc.git/blob - manual/lang.texi
manual: Replace summary.awk with summary.pl.
[thirdparty/glibc.git] / manual / lang.texi
1 @c This node must have no pointers.
2 @node Language Features
3 @c @node Language Features, Library Summary, , Top
4 @c %MENU% C language features provided by the library
5 @appendix C Language Facilities in the Library
6
7 Some of the facilities implemented by the C library really should be
8 thought of as parts of the C language itself. These facilities ought to
9 be documented in the C Language Manual, not in the library manual; but
10 since we don't have the language manual yet, and documentation for these
11 features has been written, we are publishing it here.
12
13 @menu
14 * Consistency Checking:: Using @code{assert} to abort if
15 something ``impossible'' happens.
16 * Variadic Functions:: Defining functions with varying numbers
17 of args.
18 * Null Pointer Constant:: The macro @code{NULL}.
19 * Important Data Types:: Data types for object sizes.
20 * Data Type Measurements:: Parameters of data type representations.
21 @end menu
22
23 @node Consistency Checking
24 @section Explicitly Checking Internal Consistency
25 @cindex consistency checking
26 @cindex impossible events
27 @cindex assertions
28
29 When you're writing a program, it's often a good idea to put in checks
30 at strategic places for ``impossible'' errors or violations of basic
31 assumptions. These kinds of checks are helpful in debugging problems
32 with the interfaces between different parts of the program, for example.
33
34 @pindex assert.h
35 The @code{assert} macro, defined in the header file @file{assert.h},
36 provides a convenient way to abort the program while printing a message
37 about where in the program the error was detected.
38
39 @vindex NDEBUG
40 Once you think your program is debugged, you can disable the error
41 checks performed by the @code{assert} macro by recompiling with the
42 macro @code{NDEBUG} defined. This means you don't actually have to
43 change the program source code to disable these checks.
44
45 But disabling these consistency checks is undesirable unless they make
46 the program significantly slower. All else being equal, more error
47 checking is good no matter who is running the program. A wise user
48 would rather have a program crash, visibly, than have it return nonsense
49 without indicating anything might be wrong.
50
51 @deftypefn Macro void assert (int @var{expression})
52 @standards{ISO, assert.h}
53 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{} @asucorrupt{}}@acunsafe{@acsmem{} @aculock{} @acucorrupt{}}}
54 @c assert_fail_base calls asprintf, and fflushes stderr.
55 Verify the programmer's belief that @var{expression} is nonzero at
56 this point in the program.
57
58 If @code{NDEBUG} is not defined, @code{assert} tests the value of
59 @var{expression}. If it is false (zero), @code{assert} aborts the
60 program (@pxref{Aborting a Program}) after printing a message of the
61 form:
62
63 @smallexample
64 @file{@var{file}}:@var{linenum}: @var{function}: Assertion `@var{expression}' failed.
65 @end smallexample
66
67 @noindent
68 on the standard error stream @code{stderr} (@pxref{Standard Streams}).
69 The filename and line number are taken from the C preprocessor macros
70 @code{__FILE__} and @code{__LINE__} and specify where the call to
71 @code{assert} was made. When using the GNU C compiler, the name of
72 the function which calls @code{assert} is taken from the built-in
73 variable @code{__PRETTY_FUNCTION__}; with older compilers, the function
74 name and following colon are omitted.
75
76 If the preprocessor macro @code{NDEBUG} is defined before
77 @file{assert.h} is included, the @code{assert} macro is defined to do
78 absolutely nothing.
79
80 @strong{Warning:} Even the argument expression @var{expression} is not
81 evaluated if @code{NDEBUG} is in effect. So never use @code{assert}
82 with arguments that involve side effects. For example, @code{assert
83 (++i > 0);} is a bad idea, because @code{i} will not be incremented if
84 @code{NDEBUG} is defined.
85 @end deftypefn
86
87 Sometimes the ``impossible'' condition you want to check for is an error
88 return from an operating system function. Then it is useful to display
89 not only where the program crashes, but also what error was returned.
90 The @code{assert_perror} macro makes this easy.
91
92 @deftypefn Macro void assert_perror (int @var{errnum})
93 @standards{GNU, assert.h}
94 @safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{} @asucorrupt{}}@acunsafe{@acsmem{} @aculock{} @acucorrupt{}}}
95 @c assert_fail_base calls asprintf, and fflushes stderr.
96 Similar to @code{assert}, but verifies that @var{errnum} is zero.
97
98 If @code{NDEBUG} is not defined, @code{assert_perror} tests the value of
99 @var{errnum}. If it is nonzero, @code{assert_perror} aborts the program
100 after printing a message of the form:
101
102 @smallexample
103 @file{@var{file}}:@var{linenum}: @var{function}: @var{error text}
104 @end smallexample
105
106 @noindent
107 on the standard error stream. The file name, line number, and function
108 name are as for @code{assert}. The error text is the result of
109 @w{@code{strerror (@var{errnum})}}. @xref{Error Messages}.
110
111 Like @code{assert}, if @code{NDEBUG} is defined before @file{assert.h}
112 is included, the @code{assert_perror} macro does absolutely nothing. It
113 does not evaluate the argument, so @var{errnum} should not have any side
114 effects. It is best for @var{errnum} to be just a simple variable
115 reference; often it will be @code{errno}.
116
117 This macro is a GNU extension.
118 @end deftypefn
119
120 @strong{Usage note:} The @code{assert} facility is designed for
121 detecting @emph{internal inconsistency}; it is not suitable for
122 reporting invalid input or improper usage by the @emph{user} of the
123 program.
124
125 The information in the diagnostic messages printed by the @code{assert}
126 and @code{assert_perror} macro is intended to help you, the programmer,
127 track down the cause of a bug, but is not really useful for telling a user
128 of your program why his or her input was invalid or why a command could not
129 be carried out. What's more, your program should not abort when given
130 invalid input, as @code{assert} would do---it should exit with nonzero
131 status (@pxref{Exit Status}) after printing its error messages, or perhaps
132 read another command or move on to the next input file.
133
134 @xref{Error Messages}, for information on printing error messages for
135 problems that @emph{do not} represent bugs in the program.
136
137
138 @node Variadic Functions
139 @section Variadic Functions
140 @cindex variable number of arguments
141 @cindex variadic functions
142 @cindex optional arguments
143
144 @w{ISO C} defines a syntax for declaring a function to take a variable
145 number or type of arguments. (Such functions are referred to as
146 @dfn{varargs functions} or @dfn{variadic functions}.) However, the
147 language itself provides no mechanism for such functions to access their
148 non-required arguments; instead, you use the variable arguments macros
149 defined in @file{stdarg.h}.
150
151 This section describes how to declare variadic functions, how to write
152 them, and how to call them properly.
153
154 @strong{Compatibility Note:} Many older C dialects provide a similar,
155 but incompatible, mechanism for defining functions with variable numbers
156 of arguments, using @file{varargs.h}.
157
158 @menu
159 * Why Variadic:: Reasons for making functions take
160 variable arguments.
161 * How Variadic:: How to define and call variadic functions.
162 * Variadic Example:: A complete example.
163 @end menu
164
165 @node Why Variadic
166 @subsection Why Variadic Functions are Used
167
168 Ordinary C functions take a fixed number of arguments. When you define
169 a function, you specify the data type for each argument. Every call to
170 the function should supply the expected number of arguments, with types
171 that can be converted to the specified ones. Thus, if the function
172 @samp{foo} is declared with @code{int foo (int, char *);} then you must
173 call it with two arguments, a number (any kind will do) and a string
174 pointer.
175
176 But some functions perform operations that can meaningfully accept an
177 unlimited number of arguments.
178
179 In some cases a function can handle any number of values by operating on
180 all of them as a block. For example, consider a function that allocates
181 a one-dimensional array with @code{malloc} to hold a specified set of
182 values. This operation makes sense for any number of values, as long as
183 the length of the array corresponds to that number. Without facilities
184 for variable arguments, you would have to define a separate function for
185 each possible array size.
186
187 The library function @code{printf} (@pxref{Formatted Output}) is an
188 example of another class of function where variable arguments are
189 useful. This function prints its arguments (which can vary in type as
190 well as number) under the control of a format template string.
191
192 These are good reasons to define a @dfn{variadic} function which can
193 handle as many arguments as the caller chooses to pass.
194
195 Some functions such as @code{open} take a fixed set of arguments, but
196 occasionally ignore the last few. Strict adherence to @w{ISO C} requires
197 these functions to be defined as variadic; in practice, however, the GNU
198 C compiler and most other C compilers let you define such a function to
199 take a fixed set of arguments---the most it can ever use---and then only
200 @emph{declare} the function as variadic (or not declare its arguments
201 at all!).
202
203 @node How Variadic
204 @subsection How Variadic Functions are Defined and Used
205
206 Defining and using a variadic function involves three steps:
207
208 @itemize @bullet
209 @item
210 @emph{Define} the function as variadic, using an ellipsis
211 (@samp{@dots{}}) in the argument list, and using special macros to
212 access the variable arguments. @xref{Receiving Arguments}.
213
214 @item
215 @emph{Declare} the function as variadic, using a prototype with an
216 ellipsis (@samp{@dots{}}), in all the files which call it.
217 @xref{Variadic Prototypes}.
218
219 @item
220 @emph{Call} the function by writing the fixed arguments followed by the
221 additional variable arguments. @xref{Calling Variadics}.
222 @end itemize
223
224 @menu
225 * Variadic Prototypes:: How to make a prototype for a function
226 with variable arguments.
227 * Receiving Arguments:: Steps you must follow to access the
228 optional argument values.
229 * How Many Arguments:: How to decide whether there are more arguments.
230 * Calling Variadics:: Things you need to know about calling
231 variable arguments functions.
232 * Argument Macros:: Detailed specification of the macros
233 for accessing variable arguments.
234 @end menu
235
236 @node Variadic Prototypes
237 @subsubsection Syntax for Variable Arguments
238 @cindex function prototypes (variadic)
239 @cindex prototypes for variadic functions
240 @cindex variadic function prototypes
241
242 A function that accepts a variable number of arguments must be declared
243 with a prototype that says so. You write the fixed arguments as usual,
244 and then tack on @samp{@dots{}} to indicate the possibility of
245 additional arguments. The syntax of @w{ISO C} requires at least one fixed
246 argument before the @samp{@dots{}}. For example,
247
248 @smallexample
249 int
250 func (const char *a, int b, @dots{})
251 @{
252 @dots{}
253 @}
254 @end smallexample
255
256 @noindent
257 defines a function @code{func} which returns an @code{int} and takes two
258 required arguments, a @code{const char *} and an @code{int}. These are
259 followed by any number of anonymous arguments.
260
261 @strong{Portability note:} For some C compilers, the last required
262 argument must not be declared @code{register} in the function
263 definition. Furthermore, this argument's type must be
264 @dfn{self-promoting}: that is, the default promotions must not change
265 its type. This rules out array and function types, as well as
266 @code{float}, @code{char} (whether signed or not) and @w{@code{short int}}
267 (whether signed or not). This is actually an @w{ISO C} requirement.
268
269 @node Receiving Arguments
270 @subsubsection Receiving the Argument Values
271 @cindex variadic function argument access
272 @cindex arguments (variadic functions)
273
274 Ordinary fixed arguments have individual names, and you can use these
275 names to access their values. But optional arguments have no
276 names---nothing but @samp{@dots{}}. How can you access them?
277
278 @pindex stdarg.h
279 The only way to access them is sequentially, in the order they were
280 written, and you must use special macros from @file{stdarg.h} in the
281 following three step process:
282
283 @enumerate
284 @item
285 You initialize an argument pointer variable of type @code{va_list} using
286 @code{va_start}. The argument pointer when initialized points to the
287 first optional argument.
288
289 @item
290 You access the optional arguments by successive calls to @code{va_arg}.
291 The first call to @code{va_arg} gives you the first optional argument,
292 the next call gives you the second, and so on.
293
294 You can stop at any time if you wish to ignore any remaining optional
295 arguments. It is perfectly all right for a function to access fewer
296 arguments than were supplied in the call, but you will get garbage
297 values if you try to access too many arguments.
298
299 @item
300 You indicate that you are finished with the argument pointer variable by
301 calling @code{va_end}.
302
303 (In practice, with most C compilers, calling @code{va_end} does nothing.
304 This is always true in the GNU C compiler. But you might as well call
305 @code{va_end} just in case your program is someday compiled with a peculiar
306 compiler.)
307 @end enumerate
308
309 @xref{Argument Macros}, for the full definitions of @code{va_start},
310 @code{va_arg} and @code{va_end}.
311
312 Steps 1 and 3 must be performed in the function that accepts the
313 optional arguments. However, you can pass the @code{va_list} variable
314 as an argument to another function and perform all or part of step 2
315 there.
316
317 You can perform the entire sequence of three steps multiple times
318 within a single function invocation. If you want to ignore the optional
319 arguments, you can do these steps zero times.
320
321 You can have more than one argument pointer variable if you like. You
322 can initialize each variable with @code{va_start} when you wish, and
323 then you can fetch arguments with each argument pointer as you wish.
324 Each argument pointer variable will sequence through the same set of
325 argument values, but at its own pace.
326
327 @strong{Portability note:} With some compilers, once you pass an
328 argument pointer value to a subroutine, you must not keep using the same
329 argument pointer value after that subroutine returns. For full
330 portability, you should just pass it to @code{va_end}. This is actually
331 an @w{ISO C} requirement, but most ANSI C compilers work happily
332 regardless.
333
334 @node How Many Arguments
335 @subsubsection How Many Arguments Were Supplied
336 @cindex number of arguments passed
337 @cindex how many arguments
338 @cindex arguments, how many
339
340 There is no general way for a function to determine the number and type
341 of the optional arguments it was called with. So whoever designs the
342 function typically designs a convention for the caller to specify the number
343 and type of arguments. It is up to you to define an appropriate calling
344 convention for each variadic function, and write all calls accordingly.
345
346 One kind of calling convention is to pass the number of optional
347 arguments as one of the fixed arguments. This convention works provided
348 all of the optional arguments are of the same type.
349
350 A similar alternative is to have one of the required arguments be a bit
351 mask, with a bit for each possible purpose for which an optional
352 argument might be supplied. You would test the bits in a predefined
353 sequence; if the bit is set, fetch the value of the next argument,
354 otherwise use a default value.
355
356 A required argument can be used as a pattern to specify both the number
357 and types of the optional arguments. The format string argument to
358 @code{printf} is one example of this (@pxref{Formatted Output Functions}).
359
360 Another possibility is to pass an ``end marker'' value as the last
361 optional argument. For example, for a function that manipulates an
362 arbitrary number of pointer arguments, a null pointer might indicate the
363 end of the argument list. (This assumes that a null pointer isn't
364 otherwise meaningful to the function.) The @code{execl} function works
365 in just this way; see @ref{Executing a File}.
366
367
368 @node Calling Variadics
369 @subsubsection Calling Variadic Functions
370 @cindex variadic functions, calling
371 @cindex calling variadic functions
372 @cindex declaring variadic functions
373
374 You don't have to do anything special to call a variadic function.
375 Just put the arguments (required arguments, followed by optional ones)
376 inside parentheses, separated by commas, as usual. But you must declare
377 the function with a prototype and know how the argument values are converted.
378
379 In principle, functions that are @emph{defined} to be variadic must also
380 be @emph{declared} to be variadic using a function prototype whenever
381 you call them. (@xref{Variadic Prototypes}, for how.) This is because
382 some C compilers use a different calling convention to pass the same set
383 of argument values to a function depending on whether that function
384 takes variable arguments or fixed arguments.
385
386 In practice, the GNU C compiler always passes a given set of argument
387 types in the same way regardless of whether they are optional or
388 required. So, as long as the argument types are self-promoting, you can
389 safely omit declaring them. Usually it is a good idea to declare the
390 argument types for variadic functions, and indeed for all functions.
391 But there are a few functions which it is extremely convenient not to
392 have to declare as variadic---for example, @code{open} and
393 @code{printf}.
394
395 @cindex default argument promotions
396 @cindex argument promotion
397 Since the prototype doesn't specify types for optional arguments, in a
398 call to a variadic function the @dfn{default argument promotions} are
399 performed on the optional argument values. This means the objects of
400 type @code{char} or @w{@code{short int}} (whether signed or not) are
401 promoted to either @code{int} or @w{@code{unsigned int}}, as
402 appropriate; and that objects of type @code{float} are promoted to type
403 @code{double}. So, if the caller passes a @code{char} as an optional
404 argument, it is promoted to an @code{int}, and the function can access
405 it with @code{va_arg (@var{ap}, int)}.
406
407 Conversion of the required arguments is controlled by the function
408 prototype in the usual way: the argument expression is converted to the
409 declared argument type as if it were being assigned to a variable of
410 that type.
411
412 @node Argument Macros
413 @subsubsection Argument Access Macros
414
415 Here are descriptions of the macros used to retrieve variable arguments.
416 These macros are defined in the header file @file{stdarg.h}.
417 @pindex stdarg.h
418
419 @deftp {Data Type} va_list
420 @standards{ISO, stdarg.h}
421 The type @code{va_list} is used for argument pointer variables.
422 @end deftp
423
424 @deftypefn {Macro} void va_start (va_list @var{ap}, @var{last-required})
425 @standards{ISO, stdarg.h}
426 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
427 @c This is no longer provided by glibc, but rather by the compiler.
428 This macro initializes the argument pointer variable @var{ap} to point
429 to the first of the optional arguments of the current function;
430 @var{last-required} must be the last required argument to the function.
431 @end deftypefn
432
433 @deftypefn {Macro} @var{type} va_arg (va_list @var{ap}, @var{type})
434 @standards{ISO, stdarg.h}
435 @safety{@prelim{}@mtsafe{@mtsrace{:ap}}@assafe{}@acunsafe{@acucorrupt{}}}
436 @c This is no longer provided by glibc, but rather by the compiler.
437 @c Unlike the other va_ macros, that either start/end the lifetime of
438 @c the va_list object or don't modify it, this one modifies ap, and it
439 @c may leave it in a partially updated state.
440 The @code{va_arg} macro returns the value of the next optional argument,
441 and modifies the value of @var{ap} to point to the subsequent argument.
442 Thus, successive uses of @code{va_arg} return successive optional
443 arguments.
444
445 The type of the value returned by @code{va_arg} is @var{type} as
446 specified in the call. @var{type} must be a self-promoting type (not
447 @code{char} or @code{short int} or @code{float}) that matches the type
448 of the actual argument.
449 @end deftypefn
450
451 @deftypefn {Macro} void va_end (va_list @var{ap})
452 @standards{ISO, stdarg.h}
453 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
454 @c This is no longer provided by glibc, but rather by the compiler.
455 This ends the use of @var{ap}. After a @code{va_end} call, further
456 @code{va_arg} calls with the same @var{ap} may not work. You should invoke
457 @code{va_end} before returning from the function in which @code{va_start}
458 was invoked with the same @var{ap} argument.
459
460 In @theglibc{}, @code{va_end} does nothing, and you need not ever
461 use it except for reasons of portability.
462 @refill
463 @end deftypefn
464
465 Sometimes it is necessary to parse the list of parameters more than once
466 or one wants to remember a certain position in the parameter list. To
467 do this, one will have to make a copy of the current value of the
468 argument. But @code{va_list} is an opaque type and one cannot necessarily
469 assign the value of one variable of type @code{va_list} to another variable
470 of the same type.
471
472 @deftypefn {Macro} void va_copy (va_list @var{dest}, va_list @var{src})
473 @deftypefnx {Macro} void __va_copy (va_list @var{dest}, va_list @var{src})
474 @standardsx{va_copy, ISO, stdarg.h}
475 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
476 @c This is no longer provided by glibc, but rather by the compiler.
477 The @code{va_copy} macro allows copying of objects of type
478 @code{va_list} even if this is not an integral type. The argument pointer
479 in @var{dest} is initialized to point to the same argument as the
480 pointer in @var{src}.
481
482 This macro was added in ISO C99. When building for strict conformance
483 to ISO C90 (@samp{gcc -ansi}), it is not available. The macro
484 @code{__va_copy} is available as a GNU extension in any standards
485 mode; before GCC 3.0, it was the only macro for this functionality.
486 @end deftypefn
487
488 If you want to use @code{va_copy} and be portable to pre-C99 systems,
489 you should always be prepared for the
490 possibility that this macro will not be available. On architectures where a
491 simple assignment is invalid, hopefully @code{va_copy} @emph{will} be available,
492 so one should always write something like this if concerned about
493 pre-C99 portability:
494
495 @smallexample
496 @{
497 va_list ap, save;
498 @dots{}
499 #ifdef va_copy
500 va_copy (save, ap);
501 #else
502 save = ap;
503 #endif
504 @dots{}
505 @}
506 @end smallexample
507
508
509 @node Variadic Example
510 @subsection Example of a Variadic Function
511
512 Here is a complete sample function that accepts a variable number of
513 arguments. The first argument to the function is the count of remaining
514 arguments, which are added up and the result returned. While trivial,
515 this function is sufficient to illustrate how to use the variable
516 arguments facility.
517
518 @comment Yes, this example has been tested.
519 @smallexample
520 @include add.c.texi
521 @end smallexample
522
523 @node Null Pointer Constant
524 @section Null Pointer Constant
525 @cindex null pointer constant
526
527 The null pointer constant is guaranteed not to point to any real object.
528 You can assign it to any pointer variable since it has type @code{void
529 *}. The preferred way to write a null pointer constant is with
530 @code{NULL}.
531
532 @deftypevr Macro {void *} NULL
533 @standards{ISO, stddef.h}
534 This is a null pointer constant.
535 @end deftypevr
536
537 You can also use @code{0} or @code{(void *)0} as a null pointer
538 constant, but using @code{NULL} is cleaner because it makes the purpose
539 of the constant more evident.
540
541 If you use the null pointer constant as a function argument, then for
542 complete portability you should make sure that the function has a
543 prototype declaration. Otherwise, if the target machine has two
544 different pointer representations, the compiler won't know which
545 representation to use for that argument. You can avoid the problem by
546 explicitly casting the constant to the proper pointer type, but we
547 recommend instead adding a prototype for the function you are calling.
548
549 @node Important Data Types
550 @section Important Data Types
551
552 The result of subtracting two pointers in C is always an integer, but the
553 precise data type varies from C compiler to C compiler. Likewise, the
554 data type of the result of @code{sizeof} also varies between compilers.
555 ISO C defines standard aliases for these two types, so you can refer to
556 them in a portable fashion. They are defined in the header file
557 @file{stddef.h}.
558 @pindex stddef.h
559
560 @deftp {Data Type} ptrdiff_t
561 @standards{ISO, stddef.h}
562 This is the signed integer type of the result of subtracting two
563 pointers. For example, with the declaration @code{char *p1, *p2;}, the
564 expression @code{p2 - p1} is of type @code{ptrdiff_t}. This will
565 probably be one of the standard signed integer types (@w{@code{short
566 int}}, @code{int} or @w{@code{long int}}), but might be a nonstandard
567 type that exists only for this purpose.
568 @end deftp
569
570 @deftp {Data Type} size_t
571 @standards{ISO, stddef.h}
572 This is an unsigned integer type used to represent the sizes of objects.
573 The result of the @code{sizeof} operator is of this type, and functions
574 such as @code{malloc} (@pxref{Unconstrained Allocation}) and
575 @code{memcpy} (@pxref{Copying Strings and Arrays}) accept arguments of
576 this type to specify object sizes. On systems using @theglibc{}, this
577 will be @w{@code{unsigned int}} or @w{@code{unsigned long int}}.
578
579 @strong{Usage Note:} @code{size_t} is the preferred way to declare any
580 arguments or variables that hold the size of an object.
581 @end deftp
582
583 @strong{Compatibility Note:} Implementations of C before the advent of
584 @w{ISO C} generally used @code{unsigned int} for representing object sizes
585 and @code{int} for pointer subtraction results. They did not
586 necessarily define either @code{size_t} or @code{ptrdiff_t}. Unix
587 systems did define @code{size_t}, in @file{sys/types.h}, but the
588 definition was usually a signed type.
589
590 @node Data Type Measurements
591 @section Data Type Measurements
592
593 Most of the time, if you choose the proper C data type for each object
594 in your program, you need not be concerned with just how it is
595 represented or how many bits it uses. When you do need such
596 information, the C language itself does not provide a way to get it.
597 The header files @file{limits.h} and @file{float.h} contain macros
598 which give you this information in full detail.
599
600 @menu
601 * Width of Type:: How many bits does an integer type hold?
602 * Range of Type:: What are the largest and smallest values
603 that an integer type can hold?
604 * Floating Type Macros:: Parameters that measure the floating point types.
605 * Structure Measurement:: Getting measurements on structure types.
606 @end menu
607
608 @node Width of Type
609 @subsection Computing the Width of an Integer Data Type
610 @cindex integer type width
611 @cindex width of integer type
612 @cindex type measurements, integer
613
614 The most common reason that a program needs to know how many bits are in
615 an integer type is for using an array of @code{long int} as a bit vector.
616 You can access the bit at index @var{n} with
617
618 @smallexample
619 vector[@var{n} / LONGBITS] & (1 << (@var{n} % LONGBITS))
620 @end smallexample
621
622 @noindent
623 provided you define @code{LONGBITS} as the number of bits in a
624 @code{long int}.
625
626 @pindex limits.h
627 There is no operator in the C language that can give you the number of
628 bits in an integer data type. But you can compute it from the macro
629 @code{CHAR_BIT}, defined in the header file @file{limits.h}.
630
631 @table @code
632 @item CHAR_BIT
633 @standards{ISO, limits.h}
634 This is the number of bits in a @code{char}---eight, on most systems.
635 The value has type @code{int}.
636
637 You can compute the number of bits in any data type @var{type} like
638 this:
639
640 @smallexample
641 sizeof (@var{type}) * CHAR_BIT
642 @end smallexample
643 @end table
644
645 That expression includes padding bits as well as value and sign bits.
646 On all systems supported by @theglibc{}, standard integer types other
647 than @code{_Bool} do not have any padding bits. TS 18661-1:2014
648 defines additional macros for the width of integer types (the number
649 of value and sign bits); these macros can also be used in @code{#if}
650 preprocessor directives, whereas @code{sizeof} cannot. The following
651 macros are defined in @file{limits.h}.
652
653 @vtable @code
654 @item CHAR_WIDTH
655 @itemx SCHAR_WIDTH
656 @itemx UCHAR_WIDTH
657 @itemx SHRT_WIDTH
658 @itemx USHRT_WIDTH
659 @itemx INT_WIDTH
660 @itemx UINT_WIDTH
661 @itemx LONG_WIDTH
662 @itemx ULONG_WIDTH
663 @itemx LLONG_WIDTH
664 @itemx ULLONG_WIDTH
665 @standards{ISO, limits.h}
666
667 These are the widths of the types @code{char}, @code{signed char},
668 @code{unsigned char}, @code{short int}, @code{unsigned short int},
669 @code{int}, @code{unsigned int}, @code{long int}, @code{unsigned long
670 int}, @code{long long int} and @code{unsigned long long int},
671 respectively.
672 @end vtable
673
674 Further such macros are defined in @file{stdint.h}. Apart from those
675 for types specified by width (@pxref{Integers}), the following are
676 defined.
677
678 @vtable @code
679 @item INTPTR_WIDTH
680 @itemx UINTPTR_WIDTH
681 @itemx PTRDIFF_WIDTH
682 @itemx SIG_ATOMIC_WIDTH
683 @itemx SIZE_WIDTH
684 @itemx WCHAR_WIDTH
685 @itemx WINT_WIDTH
686 @standards{ISO, stdint.h}
687
688 These are the widths of the types @code{intptr_t}, @code{uintptr_t},
689 @code{ptrdiff_t}, @code{sig_atomic_t}, @code{size_t}, @code{wchar_t}
690 and @code{wint_t}, respectively.
691 @end vtable
692
693 @node Range of Type
694 @subsection Range of an Integer Type
695 @cindex integer type range
696 @cindex range of integer type
697 @cindex limits, integer types
698
699 Suppose you need to store an integer value which can range from zero to
700 one million. Which is the smallest type you can use? There is no
701 general rule; it depends on the C compiler and target machine. You can
702 use the @samp{MIN} and @samp{MAX} macros in @file{limits.h} to determine
703 which type will work.
704
705 Each signed integer type has a pair of macros which give the smallest
706 and largest values that it can hold. Each unsigned integer type has one
707 such macro, for the maximum value; the minimum value is, of course,
708 zero.
709
710 The values of these macros are all integer constant expressions. The
711 @samp{MAX} and @samp{MIN} macros for @code{char} and @w{@code{short
712 int}} types have values of type @code{int}. The @samp{MAX} and
713 @samp{MIN} macros for the other types have values of the same type
714 described by the macro---thus, @code{ULONG_MAX} has type
715 @w{@code{unsigned long int}}.
716
717 @comment Extra blank lines make it look better.
718 @vtable @code
719 @item SCHAR_MIN
720 @standards{ISO, limits.h}
721
722 This is the minimum value that can be represented by a @w{@code{signed char}}.
723
724 @item SCHAR_MAX
725 @itemx UCHAR_MAX
726 @standards{ISO, limits.h}
727
728 These are the maximum values that can be represented by a
729 @w{@code{signed char}} and @w{@code{unsigned char}}, respectively.
730
731 @item CHAR_MIN
732 @standards{ISO, limits.h}
733
734 This is the minimum value that can be represented by a @code{char}.
735 It's equal to @code{SCHAR_MIN} if @code{char} is signed, or zero
736 otherwise.
737
738 @item CHAR_MAX
739 @standards{ISO, limits.h}
740
741 This is the maximum value that can be represented by a @code{char}.
742 It's equal to @code{SCHAR_MAX} if @code{char} is signed, or
743 @code{UCHAR_MAX} otherwise.
744
745 @item SHRT_MIN
746 @standards{ISO, limits.h}
747
748 This is the minimum value that can be represented by a @w{@code{signed
749 short int}}. On most machines that @theglibc{} runs on,
750 @code{short} integers are 16-bit quantities.
751
752 @item SHRT_MAX
753 @itemx USHRT_MAX
754 @standards{ISO, limits.h}
755
756 These are the maximum values that can be represented by a
757 @w{@code{signed short int}} and @w{@code{unsigned short int}},
758 respectively.
759
760 @item INT_MIN
761 @standards{ISO, limits.h}
762
763 This is the minimum value that can be represented by a @w{@code{signed
764 int}}. On most machines that @theglibc{} runs on, an @code{int} is
765 a 32-bit quantity.
766
767 @item INT_MAX
768 @itemx UINT_MAX
769 @standards{ISO, limits.h}
770
771 These are the maximum values that can be represented by, respectively,
772 the type @w{@code{signed int}} and the type @w{@code{unsigned int}}.
773
774 @item LONG_MIN
775 @standards{ISO, limits.h}
776
777 This is the minimum value that can be represented by a @w{@code{signed
778 long int}}. On most machines that @theglibc{} runs on, @code{long}
779 integers are 32-bit quantities, the same size as @code{int}.
780
781 @item LONG_MAX
782 @itemx ULONG_MAX
783 @standards{ISO, limits.h}
784
785 These are the maximum values that can be represented by a
786 @w{@code{signed long int}} and @code{unsigned long int}, respectively.
787
788 @item LLONG_MIN
789 @standards{ISO, limits.h}
790
791 This is the minimum value that can be represented by a @w{@code{signed
792 long long int}}. On most machines that @theglibc{} runs on,
793 @w{@code{long long}} integers are 64-bit quantities.
794
795 @item LLONG_MAX
796 @itemx ULLONG_MAX
797 @standards{ISO, limits.h}
798
799 These are the maximum values that can be represented by a @code{signed
800 long long int} and @code{unsigned long long int}, respectively.
801
802 @item LONG_LONG_MIN
803 @itemx LONG_LONG_MAX
804 @itemx ULONG_LONG_MAX
805 @standards{GNU, limits.h}
806 These are obsolete names for @code{LLONG_MIN}, @code{LLONG_MAX}, and
807 @code{ULLONG_MAX}. They are only available if @code{_GNU_SOURCE} is
808 defined (@pxref{Feature Test Macros}). In GCC versions prior to 3.0,
809 these were the only names available.
810
811 @item WCHAR_MAX
812 @standards{GNU, limits.h}
813
814 This is the maximum value that can be represented by a @code{wchar_t}.
815 @xref{Extended Char Intro}.
816 @end vtable
817
818 The header file @file{limits.h} also defines some additional constants
819 that parameterize various operating system and file system limits. These
820 constants are described in @ref{System Configuration}.
821
822 @node Floating Type Macros
823 @subsection Floating Type Macros
824 @cindex floating type measurements
825 @cindex measurements of floating types
826 @cindex type measurements, floating
827 @cindex limits, floating types
828
829 The specific representation of floating point numbers varies from
830 machine to machine. Because floating point numbers are represented
831 internally as approximate quantities, algorithms for manipulating
832 floating point data often need to take account of the precise details of
833 the machine's floating point representation.
834
835 Some of the functions in the C library itself need this information; for
836 example, the algorithms for printing and reading floating point numbers
837 (@pxref{I/O on Streams}) and for calculating trigonometric and
838 irrational functions (@pxref{Mathematics}) use it to avoid round-off
839 error and loss of accuracy. User programs that implement numerical
840 analysis techniques also often need this information in order to
841 minimize or compute error bounds.
842
843 The header file @file{float.h} describes the format used by your
844 machine.
845
846 @menu
847 * Floating Point Concepts:: Definitions of terminology.
848 * Floating Point Parameters:: Details of specific macros.
849 * IEEE Floating Point:: The measurements for one common
850 representation.
851 @end menu
852
853 @node Floating Point Concepts
854 @subsubsection Floating Point Representation Concepts
855
856 This section introduces the terminology for describing floating point
857 representations.
858
859 You are probably already familiar with most of these concepts in terms
860 of scientific or exponential notation for floating point numbers. For
861 example, the number @code{123456.0} could be expressed in exponential
862 notation as @code{1.23456e+05}, a shorthand notation indicating that the
863 mantissa @code{1.23456} is multiplied by the base @code{10} raised to
864 power @code{5}.
865
866 More formally, the internal representation of a floating point number
867 can be characterized in terms of the following parameters:
868
869 @itemize @bullet
870 @item
871 @cindex sign (of floating point number)
872 The @dfn{sign} is either @code{-1} or @code{1}.
873
874 @item
875 @cindex base (of floating point number)
876 @cindex radix (of floating point number)
877 The @dfn{base} or @dfn{radix} for exponentiation, an integer greater
878 than @code{1}. This is a constant for a particular representation.
879
880 @item
881 @cindex exponent (of floating point number)
882 The @dfn{exponent} to which the base is raised. The upper and lower
883 bounds of the exponent value are constants for a particular
884 representation.
885
886 @cindex bias (of floating point number exponent)
887 Sometimes, in the actual bits representing the floating point number,
888 the exponent is @dfn{biased} by adding a constant to it, to make it
889 always be represented as an unsigned quantity. This is only important
890 if you have some reason to pick apart the bit fields making up the
891 floating point number by hand, which is something for which @theglibc{}
892 provides no support. So this is ignored in the discussion that
893 follows.
894
895 @item
896 @cindex mantissa (of floating point number)
897 @cindex significand (of floating point number)
898 The @dfn{mantissa} or @dfn{significand} is an unsigned integer which is a
899 part of each floating point number.
900
901 @item
902 @cindex precision (of floating point number)
903 The @dfn{precision} of the mantissa. If the base of the representation
904 is @var{b}, then the precision is the number of base-@var{b} digits in
905 the mantissa. This is a constant for a particular representation.
906
907 @cindex hidden bit (of floating point number mantissa)
908 Many floating point representations have an implicit @dfn{hidden bit} in
909 the mantissa. This is a bit which is present virtually in the mantissa,
910 but not stored in memory because its value is always 1 in a normalized
911 number. The precision figure (see above) includes any hidden bits.
912
913 Again, @theglibc{} provides no facilities for dealing with such
914 low-level aspects of the representation.
915 @end itemize
916
917 The mantissa of a floating point number represents an implicit fraction
918 whose denominator is the base raised to the power of the precision. Since
919 the largest representable mantissa is one less than this denominator, the
920 value of the fraction is always strictly less than @code{1}. The
921 mathematical value of a floating point number is then the product of this
922 fraction, the sign, and the base raised to the exponent.
923
924 @cindex normalized floating point number
925 We say that the floating point number is @dfn{normalized} if the
926 fraction is at least @code{1/@var{b}}, where @var{b} is the base. In
927 other words, the mantissa would be too large to fit if it were
928 multiplied by the base. Non-normalized numbers are sometimes called
929 @dfn{denormal}; they contain less precision than the representation
930 normally can hold.
931
932 If the number is not normalized, then you can subtract @code{1} from the
933 exponent while multiplying the mantissa by the base, and get another
934 floating point number with the same value. @dfn{Normalization} consists
935 of doing this repeatedly until the number is normalized. Two distinct
936 normalized floating point numbers cannot be equal in value.
937
938 (There is an exception to this rule: if the mantissa is zero, it is
939 considered normalized. Another exception happens on certain machines
940 where the exponent is as small as the representation can hold. Then
941 it is impossible to subtract @code{1} from the exponent, so a number
942 may be normalized even if its fraction is less than @code{1/@var{b}}.)
943
944 @node Floating Point Parameters
945 @subsubsection Floating Point Parameters
946
947 @pindex float.h
948 These macro definitions can be accessed by including the header file
949 @file{float.h} in your program.
950
951 Macro names starting with @samp{FLT_} refer to the @code{float} type,
952 while names beginning with @samp{DBL_} refer to the @code{double} type
953 and names beginning with @samp{LDBL_} refer to the @code{long double}
954 type. (If GCC does not support @code{long double} as a distinct data
955 type on a target machine then the values for the @samp{LDBL_} constants
956 are equal to the corresponding constants for the @code{double} type.)
957
958 Of these macros, only @code{FLT_RADIX} is guaranteed to be a constant
959 expression. The other macros listed here cannot be reliably used in
960 places that require constant expressions, such as @samp{#if}
961 preprocessing directives or in the dimensions of static arrays.
962
963 Although the @w{ISO C} standard specifies minimum and maximum values for
964 most of these parameters, the GNU C implementation uses whatever values
965 describe the floating point representation of the target machine. So in
966 principle GNU C actually satisfies the @w{ISO C} requirements only if the
967 target machine is suitable. In practice, all the machines currently
968 supported are suitable.
969
970 @vtable @code
971 @item FLT_ROUNDS
972 @standards{ISO, float.h}
973 This value characterizes the rounding mode for floating point addition.
974 The following values indicate standard rounding modes:
975
976 @need 750
977
978 @table @code
979 @item -1
980 The mode is indeterminable.
981 @item 0
982 Rounding is towards zero.
983 @item 1
984 Rounding is to the nearest number.
985 @item 2
986 Rounding is towards positive infinity.
987 @item 3
988 Rounding is towards negative infinity.
989 @end table
990
991 @noindent
992 Any other value represents a machine-dependent nonstandard rounding
993 mode.
994
995 On most machines, the value is @code{1}, in accordance with the IEEE
996 standard for floating point.
997
998 Here is a table showing how certain values round for each possible value
999 of @code{FLT_ROUNDS}, if the other aspects of the representation match
1000 the IEEE single-precision standard.
1001
1002 @smallexample
1003 0 1 2 3
1004 1.00000003 1.0 1.0 1.00000012 1.0
1005 1.00000007 1.0 1.00000012 1.00000012 1.0
1006 -1.00000003 -1.0 -1.0 -1.0 -1.00000012
1007 -1.00000007 -1.0 -1.00000012 -1.0 -1.00000012
1008 @end smallexample
1009
1010 @item FLT_RADIX
1011 @standards{ISO, float.h}
1012 This is the value of the base, or radix, of the exponent representation.
1013 This is guaranteed to be a constant expression, unlike the other macros
1014 described in this section. The value is 2 on all machines we know of
1015 except the IBM 360 and derivatives.
1016
1017 @item FLT_MANT_DIG
1018 @standards{ISO, float.h}
1019 This is the number of base-@code{FLT_RADIX} digits in the floating point
1020 mantissa for the @code{float} data type. The following expression
1021 yields @code{1.0} (even though mathematically it should not) due to the
1022 limited number of mantissa digits:
1023
1024 @smallexample
1025 float radix = FLT_RADIX;
1026
1027 1.0f + 1.0f / radix / radix / @dots{} / radix
1028 @end smallexample
1029
1030 @noindent
1031 where @code{radix} appears @code{FLT_MANT_DIG} times.
1032
1033 @item DBL_MANT_DIG
1034 @itemx LDBL_MANT_DIG
1035 @standardsx{DBL_MANT_DIG, ISO, float.h}
1036 This is the number of base-@code{FLT_RADIX} digits in the floating point
1037 mantissa for the data types @code{double} and @code{long double},
1038 respectively.
1039
1040 @comment Extra blank lines make it look better.
1041 @item FLT_DIG
1042 @standards{ISO, float.h}
1043
1044 This is the number of decimal digits of precision for the @code{float}
1045 data type. Technically, if @var{p} and @var{b} are the precision and
1046 base (respectively) for the representation, then the decimal precision
1047 @var{q} is the maximum number of decimal digits such that any floating
1048 point number with @var{q} base 10 digits can be rounded to a floating
1049 point number with @var{p} base @var{b} digits and back again, without
1050 change to the @var{q} decimal digits.
1051
1052 The value of this macro is supposed to be at least @code{6}, to satisfy
1053 @w{ISO C}.
1054
1055 @item DBL_DIG
1056 @itemx LDBL_DIG
1057 @standardsx{DBL_DIG, ISO, float.h}
1058
1059 These are similar to @code{FLT_DIG}, but for the data types
1060 @code{double} and @code{long double}, respectively. The values of these
1061 macros are supposed to be at least @code{10}.
1062
1063 @item FLT_MIN_EXP
1064 @standards{ISO, float.h}
1065 This is the smallest possible exponent value for type @code{float}.
1066 More precisely, it is the minimum negative integer such that the value
1067 @code{FLT_RADIX} raised to this power minus 1 can be represented as a
1068 normalized floating point number of type @code{float}.
1069
1070 @item DBL_MIN_EXP
1071 @itemx LDBL_MIN_EXP
1072 @standardsx{DBL_MIN_EXP, ISO, float.h}
1073
1074 These are similar to @code{FLT_MIN_EXP}, but for the data types
1075 @code{double} and @code{long double}, respectively.
1076
1077 @item FLT_MIN_10_EXP
1078 @standards{ISO, float.h}
1079 This is the minimum negative integer such that @code{10} raised to this
1080 power minus 1 can be represented as a normalized floating point number
1081 of type @code{float}. This is supposed to be @code{-37} or even less.
1082
1083 @item DBL_MIN_10_EXP
1084 @itemx LDBL_MIN_10_EXP
1085 @standardsx{DBL_MIN_10_EXP, ISO, float.h}
1086 These are similar to @code{FLT_MIN_10_EXP}, but for the data types
1087 @code{double} and @code{long double}, respectively.
1088
1089 @item FLT_MAX_EXP
1090 @standards{ISO, float.h}
1091 This is the largest possible exponent value for type @code{float}. More
1092 precisely, this is the maximum positive integer such that value
1093 @code{FLT_RADIX} raised to this power minus 1 can be represented as a
1094 floating point number of type @code{float}.
1095
1096 @item DBL_MAX_EXP
1097 @itemx LDBL_MAX_EXP
1098 @standardsx{DBL_MAX_EXP, ISO, float.h}
1099 These are similar to @code{FLT_MAX_EXP}, but for the data types
1100 @code{double} and @code{long double}, respectively.
1101
1102 @item FLT_MAX_10_EXP
1103 @standards{ISO, float.h}
1104 This is the maximum positive integer such that @code{10} raised to this
1105 power minus 1 can be represented as a normalized floating point number
1106 of type @code{float}. This is supposed to be at least @code{37}.
1107
1108 @item DBL_MAX_10_EXP
1109 @itemx LDBL_MAX_10_EXP
1110 @standardsx{DBL_MAX_10_EXP, ISO, float.h}
1111 These are similar to @code{FLT_MAX_10_EXP}, but for the data types
1112 @code{double} and @code{long double}, respectively.
1113
1114 @item FLT_MAX
1115 @standards{ISO, float.h}
1116
1117 The value of this macro is the maximum number representable in type
1118 @code{float}. It is supposed to be at least @code{1E+37}. The value
1119 has type @code{float}.
1120
1121 The smallest representable number is @code{- FLT_MAX}.
1122
1123 @item DBL_MAX
1124 @itemx LDBL_MAX
1125 @standardsx{DBL_MAX, ISO, float.h}
1126
1127 These are similar to @code{FLT_MAX}, but for the data types
1128 @code{double} and @code{long double}, respectively. The type of the
1129 macro's value is the same as the type it describes.
1130
1131 @item FLT_MIN
1132 @standards{ISO, float.h}
1133
1134 The value of this macro is the minimum normalized positive floating
1135 point number that is representable in type @code{float}. It is supposed
1136 to be no more than @code{1E-37}.
1137
1138 @item DBL_MIN
1139 @itemx LDBL_MIN
1140 @standardsx{DBL_MIN, ISO, float.h}
1141
1142 These are similar to @code{FLT_MIN}, but for the data types
1143 @code{double} and @code{long double}, respectively. The type of the
1144 macro's value is the same as the type it describes.
1145
1146 @item FLT_EPSILON
1147 @standards{ISO, float.h}
1148
1149 This is the difference between 1 and the smallest floating point
1150 number of type @code{float} that is greater than 1. It's supposed to
1151 be no greater than @code{1E-5}.
1152
1153 @item DBL_EPSILON
1154 @itemx LDBL_EPSILON
1155 @standardsx{DBL_EPSILON, ISO, float.h}
1156
1157 These are similar to @code{FLT_EPSILON}, but for the data types
1158 @code{double} and @code{long double}, respectively. The type of the
1159 macro's value is the same as the type it describes. The values are not
1160 supposed to be greater than @code{1E-9}.
1161 @end vtable
1162
1163 @node IEEE Floating Point
1164 @subsubsection IEEE Floating Point
1165 @cindex IEEE floating point representation
1166 @cindex floating point, IEEE
1167
1168 Here is an example showing how the floating type measurements come out
1169 for the most common floating point representation, specified by the
1170 @cite{IEEE Standard for Binary Floating Point Arithmetic (ANSI/IEEE Std
1171 754-1985)}. Nearly all computers designed since the 1980s use this
1172 format.
1173
1174 The IEEE single-precision float representation uses a base of 2. There
1175 is a sign bit, a mantissa with 23 bits plus one hidden bit (so the total
1176 precision is 24 base-2 digits), and an 8-bit exponent that can represent
1177 values in the range -125 to 128, inclusive.
1178
1179 So, for an implementation that uses this representation for the
1180 @code{float} data type, appropriate values for the corresponding
1181 parameters are:
1182
1183 @smallexample
1184 FLT_RADIX 2
1185 FLT_MANT_DIG 24
1186 FLT_DIG 6
1187 FLT_MIN_EXP -125
1188 FLT_MIN_10_EXP -37
1189 FLT_MAX_EXP 128
1190 FLT_MAX_10_EXP +38
1191 FLT_MIN 1.17549435E-38F
1192 FLT_MAX 3.40282347E+38F
1193 FLT_EPSILON 1.19209290E-07F
1194 @end smallexample
1195
1196 Here are the values for the @code{double} data type:
1197
1198 @smallexample
1199 DBL_MANT_DIG 53
1200 DBL_DIG 15
1201 DBL_MIN_EXP -1021
1202 DBL_MIN_10_EXP -307
1203 DBL_MAX_EXP 1024
1204 DBL_MAX_10_EXP 308
1205 DBL_MAX 1.7976931348623157E+308
1206 DBL_MIN 2.2250738585072014E-308
1207 DBL_EPSILON 2.2204460492503131E-016
1208 @end smallexample
1209
1210 @node Structure Measurement
1211 @subsection Structure Field Offset Measurement
1212
1213 You can use @code{offsetof} to measure the location within a structure
1214 type of a particular structure member.
1215
1216 @deftypefn {Macro} size_t offsetof (@var{type}, @var{member})
1217 @standards{ISO, stddef.h}
1218 @safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
1219 @c This is no longer provided by glibc, but rather by the compiler.
1220 This expands to an integer constant expression that is the offset of the
1221 structure member named @var{member} in the structure type @var{type}.
1222 For example, @code{offsetof (struct s, elem)} is the offset, in bytes,
1223 of the member @code{elem} in a @code{struct s}.
1224
1225 This macro won't work if @var{member} is a bit field; you get an error
1226 from the C compiler in that case.
1227 @end deftypefn