]> git.ipfire.org Git - thirdparty/systemd.git/blob - docs/CODING_STYLE.md
CODING_STYLE: split out section about destructors
[thirdparty/systemd.git] / docs / CODING_STYLE.md
1 ---
2 title: Coding Style
3 ---
4
5 # Coding Style
6
7 ## Formatting
8
9 - 8ch indent, no tabs, except for files in `man/` which are 2ch indent, and
10 still no tabs, and shell scripts, which are 4ch indent, and no tabs either.
11
12 - We prefer `/* comments */` over `// comments` in code you commit,
13 please. This way `// comments` are left for developers to use for local,
14 temporary commenting of code for debug purposes (i.e. uncommittable stuff),
15 making such comments easily discernible from explanatory, documenting code
16 comments (i.e. committable stuff).
17
18 - Don't break code lines too eagerly. We do **not** force line breaks at 80ch,
19 all of today's screens should be much larger than that. But then again, don't
20 overdo it, ~109ch should be enough really. The `.editorconfig`, `.vimrc` and
21 `.dir-locals.el` files contained in the repository will set this limit up for
22 you automatically, if you let them (as well as a few other things). Please
23 note that emacs loads `.dir-locals.el` automatically, but vim needs to be
24 configured to load `.vimrc`, see that file for instructions.
25
26 - Try to write this:
27
28 ```c
29 void foo() {
30 }
31 ```
32
33 instead of this:
34
35 ```c
36 void foo()
37 {
38 }
39 ```
40
41 - Single-line `if` blocks should not be enclosed in `{}`. Write this:
42
43 ```c
44 if (foobar)
45 waldo();
46 ```
47
48 instead of this:
49
50 ```c
51 if (foobar) {
52 waldo();
53 }
54 ```
55
56 - Do not write `foo ()`, write `foo()`.
57
58 - Preferably allocate local variables on the top of the block:
59
60 ```c
61 {
62 int a, b;
63
64 a = 5;
65 b = a;
66 }
67 ```
68
69 ## Other
70
71 - Variables and functions **must** be static, unless they have a
72 prototype, and are supposed to be exported.
73
74 - structs in `PascalCase` (with exceptions, such as public API structs),
75 variables and functions in `snake_case`.
76
77 - To minimize strict aliasing violations, we prefer unions over casting.
78
79 - Do not issue NSS requests (that includes user name and host name
80 lookups) from PID 1 as this might trigger deadlocks when those
81 lookups involve synchronously talking to services that we would need
82 to start up.
83
84 - Do not synchronously talk to any other service from PID 1, due to
85 risk of deadlocks.
86
87 - Be exceptionally careful when formatting and parsing floating point
88 numbers. Their syntax is locale dependent (i.e. `5.000` in en_US is
89 generally understood as 5, while in de_DE as 5000.).
90
91 - Do not mix function invocations with variable definitions in one
92 line. Wrong:
93
94 ```c
95 {
96 int a = foobar();
97 uint64_t x = 7;
98 }
99 ```
100
101 Right:
102
103 ```c
104 {
105 int a;
106 uint64_t x = 7;
107
108 a = foobar();
109 }
110 ```
111
112 - Use `goto` for cleaning up, and only use it for that. i.e. you may
113 only jump to the end of a function, and little else. Never jump
114 backwards!
115
116 - Public API calls (i.e. functions exported by our shared libraries)
117 must be marked `_public_` and need to be prefixed with `sd_`. No
118 other functions should be prefixed like that.
119
120 - For every function you add, think about whether it is a "logging"
121 function or a "non-logging" function. "Logging" functions do logging
122 on their own, "non-logging" function never log on their own and
123 expect their callers to log. All functions in "library" code,
124 i.e. in `src/shared/` and suchlike must be "non-logging". Every time a
125 "logging" function calls a "non-logging" function, it should log
126 about the resulting errors. If a "logging" function calls another
127 "logging" function, then it should not generate log messages, so
128 that log messages are not generated twice for the same errors.
129
130 - If possible, do a combined log & return operation:
131
132 ```c
133 r = operation(...);
134 if (r < 0)
135 return log_(error|warning|notice|...)_errno(r, "Failed to ...: %m");
136 ```
137
138 If the error value is "synthetic", i.e. it was not received from
139 the called function, use `SYNTHETIC_ERRNO` wrapper to tell the logging
140 system to not log the errno value, but still return it:
141
142 ```c
143 n = read(..., s, sizeof s);
144 if (n != sizeof s)
145 return log_error_errno(SYNTHETIC_ERRNO(EIO), "Failed to read ...");
146 ```
147
148 - Avoid static variables, except for caches and very few other
149 cases. Think about thread-safety! While most of our code is never
150 used in threaded environments, at least the library code should make
151 sure it works correctly in them. Instead of doing a lot of locking
152 for that, we tend to prefer using TLS to do per-thread caching (which
153 only works for small, fixed-size cache objects), or we disable
154 caching for any thread that is not the main thread. Use
155 `is_main_thread()` to detect whether the calling thread is the main
156 thread.
157
158 - Do not write functions that clobber call-by-reference variables on
159 failure. Use temporary variables for these cases and change the
160 passed in variables only on success.
161
162
163 - Instead of using `memzero()`/`memset()` to initialize structs allocated
164 on the stack, please try to use c99 structure initializers. It's
165 short, prettier and actually even faster at execution. Hence:
166
167 ```c
168 struct foobar t = {
169 .foo = 7,
170 .bar = "bazz",
171 };
172 ```
173
174 instead of:
175
176 ```c
177 struct foobar t;
178 zero(t);
179 t.foo = 7;
180 t.bar = "bazz";
181 ```
182
183 - The order in which header files are included doesn't matter too
184 much. systemd-internal headers must not rely on an include order, so
185 it is safe to include them in any order possible.
186 However, to not clutter global includes, and to make sure internal
187 definitions will not affect global headers, please always include the
188 headers of external components first (these are all headers enclosed
189 in <>), followed by our own exported headers (usually everything
190 that's prefixed by `sd-`), and then followed by internal headers.
191 Furthermore, in all three groups, order all includes alphabetically
192 so duplicate includes can easily be detected.
193
194 - To implement an endless loop, use `for (;;)` rather than `while (1)`.
195 The latter is a bit ugly anyway, since you probably really
196 meant `while (true)`. To avoid the discussion what the right
197 always-true expression for an infinite while loop is, our
198 recommendation is to simply write it without any such expression by
199 using `for (;;)`.
200
201 - Avoid leaving long-running child processes around, i.e. `fork()`s that
202 are not followed quickly by an `execv()` in the child. Resource
203 management is unclear in this case, and memory CoW will result in
204 unexpected penalties in the parent much, much later on.
205
206 - Don't block execution for arbitrary amounts of time using `usleep()`
207 or a similar call, unless you really know what you do. Just "giving
208 something some time", or so is a lazy excuse. Always wait for the
209 proper event, instead of doing time-based poll loops.
210
211 - To determine the length of a constant string `"foo"`, don't bother with
212 `sizeof("foo")-1`, please use `strlen()` instead (both gcc and clang optimize
213 the call away for fixed strings). The only exception is when declaring an
214 array. In that case use STRLEN, which evaluates to a static constant and
215 doesn't force the compiler to create a VLA.
216
217 - Please avoid using global variables as much as you can. And if you
218 do use them make sure they are static at least, instead of
219 exported. Especially in library-like code it is important to avoid
220 global variables. Why are global variables bad? They usually hinder
221 generic reusability of code (since they break in threaded programs,
222 and usually would require locking there), and as the code using them
223 has side-effects make programs non-transparent. That said, there are
224 many cases where they explicitly make a lot of sense, and are OK to
225 use. For example, the log level and target in `log.c` is stored in a
226 global variable, and that's OK and probably expected by most. Also
227 in many cases we cache data in global variables. If you add more
228 caches like this, please be careful however, and think about
229 threading. Only use static variables if you are sure that
230 thread-safety doesn't matter in your case. Alternatively, consider
231 using TLS, which is pretty easy to use with gcc's `thread_local`
232 concept. It's also OK to store data that is inherently global in
233 global variables, for example data parsed from command lines, see
234 below.
235
236 - When exposing public C APIs, be careful what function parameters you make
237 `const`. For example, a parameter taking a context object should probably not
238 be `const`, even if you are writing an otherwise read-only accessor function
239 for it. The reason is that making it `const` fixates the contract that your
240 call won't alter the object ever, as part of the API. However, that's often
241 quite a promise, given that this even prohibits object-internal caching or
242 lazy initialization of object variables. Moreover, it's usually not too useful
243 for client applications. Hence, please be careful and avoid `const` on object
244 parameters, unless you are very sure `const` is appropriate.
245
246 - Make sure to enforce limits on every user controllable resource. If the user
247 can allocate resources in your code, your code must enforce some form of
248 limits after which it will refuse operation. It's fine if it is hard-coded (at
249 least initially), but it needs to be there. This is particularly important
250 for objects that unprivileged users may allocate, but also matters for
251 everything else any user may allocated.
252
253 - You might wonder what kind of common code belongs in `src/shared/` and what
254 belongs in `src/basic/`. The split is like this: anything that is used to
255 implement the public shared object we provide (sd-bus, sd-login, sd-id128,
256 nss-systemd, nss-mymachines, nss-resolve, nss-myhostname, pam_systemd), must
257 be located in `src/basic` (those objects are not allowed to link to
258 libsystemd-shared.so). Conversely, anything which is shared between multiple
259 components and does not need to be in `src/basic/`, should be in
260 `src/shared/`.
261
262 To summarize:
263
264 `src/basic/`
265 - may be used by all code in the tree
266 - may not use any code outside of `src/basic/`
267
268 `src/libsystemd/`
269 - may be used by all code in the tree, except for code in `src/basic/`
270 - may not use any code outside of `src/basic/`, `src/libsystemd/`
271
272 `src/shared/`
273 - may be used by all code in the tree, except for code in `src/basic/`,
274 `src/libsystemd/`, `src/nss-*`, `src/login/pam_systemd.*`, and files under
275 `src/journal/` that end up in `libjournal-client.a` convenience library.
276 - may not use any code outside of `src/basic/`, `src/libsystemd/`, `src/shared/`
277
278 - Our focus is on the GNU libc (glibc), not any other libcs. If other libcs are
279 incompatible with glibc it's on them. However, if there are equivalent POSIX
280 and Linux/GNU-specific APIs, we generally prefer the POSIX APIs. If there
281 aren't, we are happy to use GNU or Linux APIs, and expect non-GNU
282 implementations of libc to catch up with glibc.
283
284 - Whenever installing a signal handler, make sure to set `SA_RESTART` for it, so
285 that interrupted system calls are automatically restarted, and we minimize
286 hassles with handling `EINTR` (in particular as `EINTR` handling is pretty broken
287 on Linux).
288
289 - When applying C-style unescaping as well as specifier expansion on the same
290 string, always apply the C-style unescaping fist, followed by the specifier
291 expansion. When doing the reverse, make sure to escape `%` in specifier-style
292 first (i.e. `%` → `%%`), and then do C-style escaping where necessary.
293
294 ## Destructors
295
296 - The destructors always deregister the object from the next bigger object, not
297 the other way around.
298
299 - For robustness reasons, destructors should be able to destruct
300 half-initialized objects, too.
301
302 - When you define a destructor or `unref()` call for an object, please accept a
303 `NULL` object and simply treat this as NOP. This is similar to how libc
304 `free()` works, which accepts `NULL` pointers and becomes a NOP for them. By
305 following this scheme a lot of `if` checks can be removed before invoking
306 your destructor, which makes the code substantially more readable and robust.
307
308 - Related to this: when you define a destructor or `unref()` call for an
309 object, please make it return the same type it takes and always return `NULL`
310 from it. This allows writing code like this:
311
312 ```c
313 p = foobar_unref(p);
314 ```
315
316 which will always work regardless if `p` is initialized or not,x and
317 guarantees that `p` is `NULL` afterwards, all in just one line.
318
319 ## Error Handling
320
321 - Error codes are returned as negative `Exxx`. e.g. `return -EINVAL`. There are
322 some exceptions: for constructors, it is OK to return `NULL` on OOM. For
323 lookup functions, `NULL` is fine too for "not found".
324
325 Be strict with this. When you write a function that can fail due to more than
326 one cause, it *really* should have an `int` as the return value for the error
327 code.
328
329 - Do not bother with error checking whether writing to stdout/stderr worked.
330
331 - Do not log errors from "library" code, only do so from "main program"
332 code. (With one exception: it is OK to log with DEBUG level from any code,
333 with the exception of maybe inner loops).
334
335 - In public API calls, you **must** validate all your input arguments for
336 programming error with `assert_return()` and return a sensible return
337 code. In all other calls, it is recommended to check for programming errors
338 with a more brutal `assert()`. We are more forgiving to public users than for
339 ourselves! Note that `assert()` and `assert_return()` really only should be
340 used for detecting programming errors, not for runtime errors. `assert()` and
341 `assert_return()` by usage of `_likely_()` inform the compiler that he should
342 not expect these checks to fail, and they inform fellow programmers about the
343 expected validity and range of parameters.
344
345 - When you invoke certain calls like `unlink()`, or `mkdir_p()` and you know it
346 is safe to ignore the error it might return (because a later call would
347 detect the failure anyway, or because the error is in an error path and you
348 thus couldn't do anything about it anyway), then make this clear by casting
349 the invocation explicitly to `(void)`. Code checks like Coverity understand
350 that, and will not complain about ignored error codes. Hence, please use
351 this:
352
353 ```c
354 (void) unlink("/foo/bar/baz");
355 ```
356
357 instead of just this:
358
359 ```c
360 unlink("/foo/bar/baz");
361 ```
362
363 Don't cast function calls to `(void)` that return no error
364 conditions. Specifically, the various `xyz_unref()` calls that return a
365 `NULL` object shouldn't be cast to `(void)`, since not using the return value
366 does not hide any errors.
367
368 - When returning a return code from `main()`, please preferably use
369 `EXIT_FAILURE` and `EXIT_SUCCESS` as defined by libc.
370
371 ## Memory Allocation
372
373 - Always check OOM. There is no excuse. In program code, you can use
374 `log_oom()` for then printing a short message, but not in "library" code.
375
376 - Avoid fixed-size string buffers, unless you really know the maximum size and
377 that maximum size is small. They are a source of errors, since they possibly
378 result in truncated strings. It is often nicer to use dynamic memory,
379 `alloca()` or VLAs. If you do allocate fixed-size strings on the stack, then
380 it is probably only OK if you either use a maximum size such as `LINE_MAX`,
381 or count in detail the maximum size a string can have. (`DECIMAL_STR_MAX` and
382 `DECIMAL_STR_WIDTH` macros are your friends for this!)
383
384 Or in other words, if you use `char buf[256]` then you are likely doing
385 something wrong!
386
387 - Make use of `_cleanup_free_` and friends. It makes your code much nicer to
388 read (and shorter)!
389
390 - Use `alloca()`, but never forget that it is not OK to invoke `alloca()`
391 within a loop or within function call parameters. `alloca()` memory is
392 released at the end of a function, and not at the end of a `{}` block. Thus,
393 if you invoke it in a loop, you keep increasing the stack pointer without
394 ever releasing memory again. (VLAs have better behavior in this case, so
395 consider using them as an alternative.) Regarding not using `alloca()`
396 within function parameters, see the BUGS section of the `alloca(3)` man page.
397
398 - If you want to concatenate two or more strings, consider using `strjoina()`
399 or `strjoin()` rather than `asprintf()`, as the latter is a lot slower. This
400 matters particularly in inner loops (but note that `strjoina()` cannot be
401 used there).
402
403 ## Types
404
405 - Think about the types you use. If a value cannot sensibly be negative, do not
406 use `int`, but use `unsigned`.
407
408 - Use `char` only for actual characters. Use `uint8_t` or `int8_t` when you
409 actually mean a byte-sized signed or unsigned integers. When referring to a
410 generic byte, we generally prefer the unsigned variant `uint8_t`. Do not use
411 types based on `short`. They *never* make sense. Use `int`, `long`, `long
412 long`, all in unsigned and signed fashion, and the fixed-size types
413 `uint8_t`, `uint16_t`, `uint32_t`, `uint64_t`, `int8_t`, `int16_t`, `int32_t`
414 and so on, as well as `size_t`, but nothing else. Do not use kernel types
415 like `u32` and so on, leave that to the kernel.
416
417 - Stay uniform. For example, always use `usec_t` for time values. Do not mix
418 `usec` and `msec`, and `usec` and whatnot.
419
420 - Never use the `off_t` type, and particularly avoid it in public APIs. It's
421 really weirdly defined, as it usually is 64-bit and we don't support it any
422 other way, but it could in theory also be 32-bit. Which one it is depends on
423 a compiler switch chosen by the compiled program, which hence corrupts APIs
424 using it unless they can also follow the program's choice. Moreover, in
425 systemd we should parse values the same way on all architectures and cannot
426 expose `off_t` values over D-Bus. To avoid any confusion regarding conversion
427 and ABIs, always use simply `uint64_t` directly.
428
429 - Unless you allocate an array, `double` is always a better choice than
430 `float`. Processors speak `double` natively anyway, so there is no speed
431 benefit, and on calls like `printf()` `float`s get promoted to `double`s
432 anyway, so there is no point.
433
434 - Use the bool type for booleans, not integers. One exception: in public
435 headers (i.e those in `src/systemd/sd-*.h`) use integers after all, as `bool`
436 is C99 and in our public APIs we try to stick to C89 (with a few extension).
437
438 ## File Descriptors
439
440 - When you allocate a file descriptor, it should be made `O_CLOEXEC` right from
441 the beginning, as none of our files should leak to forked binaries by
442 default. Hence, whenever you open a file, `O_CLOEXEC` must be specified,
443 right from the beginning. This also applies to sockets. Effectively, this
444 means that all invocations to:
445
446 - `open()` must get `O_CLOEXEC` passed,
447 - `socket()` and `socketpair()` must get `SOCK_CLOEXEC` passed,
448 - `recvmsg()` must get `MSG_CMSG_CLOEXEC` set,
449 - `F_DUPFD_CLOEXEC` should be used instead of `F_DUPFD`, and so on,
450 - invocations of `fopen()` should take `e`.
451
452 - It's a good idea to use `O_NONBLOCK` when opening 'foreign' regular files,
453 i.e. file system objects that are supposed to be regular files whose paths
454 where specified by the user and hence might actually refer to other types of
455 file system objects. This is a good idea so that we don't end up blocking on
456 'strange' file nodes, for example if the user pointed us to a FIFO or device
457 node which may block when opening. Moreover even for actual regular files
458 `O_NONBLOCK` has a benefit: it bypasses any mandatory lock that might be in
459 effect on the regular file. If in doubt consider turning off `O_NONBLOCK`
460 again after opening.
461
462 ## Command Line
463
464 - If you parse a command line, and want to store the parsed parameters in
465 global variables, please consider prefixing their names with `arg_`. We have
466 been following this naming rule in most of our tools, and we should continue
467 to do so, as it makes it easy to identify command line parameter variables,
468 and makes it clear why it is OK that they are global variables.
469
470 - Command line option parsing:
471 - Do not print full `help()` on error, be specific about the error.
472 - Do not print messages to stdout on error.
473 - Do not POSIX_ME_HARDER unless necessary, i.e. avoid `+` in option string.
474
475 ## Referencing Concepts
476
477 - When referring to a configuration file option in the documentation and such,
478 please always suffix it with `=`, to indicate that it is a configuration file
479 setting.
480
481 - When referring to a command line option in the documentation and such, please
482 always prefix with `--` or `-` (as appropriate), to indicate that it is a
483 command line option.
484
485 - When referring to a file system path that is a directory, please always
486 suffix it with `/`, to indicate that it is a directory, not a regular file
487 (or other file system object).
488
489 ## Functions to Avoid
490
491 - Use `memzero()` or even better `zero()` instead of `memset(..., 0, ...)`
492
493 - Please use `streq()` and `strneq()` instead of `strcmp()`, `strncmp()` where
494 applicable (i.e. wherever you just care about equality/inequality, not about
495 the sorting order).
496
497 - Never use `strtol()`, `atoi()` and similar calls. Use `safe_atoli()`,
498 `safe_atou32()` and suchlike instead. They are much nicer to use in most
499 cases and correctly check for parsing errors.
500
501 - `htonl()`/`ntohl()` and `htons()`/`ntohs()` are weird. Please use `htobe32()`
502 and `htobe16()` instead, it's much more descriptive, and actually says what
503 really is happening, after all `htonl()` and `htons()` don't operate on
504 `long`s and `short`s as their name would suggest, but on `uint32_t` and
505 `uint16_t`. Also, "network byte order" is just a weird name for "big endian",
506 hence we might want to call it "big endian" right-away.
507
508 - Please never use `dup()`. Use `fcntl(fd, F_DUPFD_CLOEXEC, 3)` instead. For
509 two reason: first, you want `O_CLOEXEC` set on the new `fd` (see
510 above). Second, `dup()` will happily duplicate your `fd` as 0, 1, 2,
511 i.e. stdin, stdout, stderr, should those `fd`s be closed. Given the special
512 semantics of those `fd`s, it's probably a good idea to avoid
513 them. `F_DUPFD_CLOEXEC` with `3` as parameter avoids them.
514
515 - Don't use `fgets()`, it's too hard to properly handle errors such as overly
516 long lines. Use `read_line()` instead, which is our own function that handles
517 this much nicer.
518
519 - Don't invoke `exit()`, ever. It is not replacement for proper error
520 handling. Please escalate errors up your call chain, and use normal `return`
521 to exit from the main function of a process. If you `fork()`ed off a child
522 process, please use `_exit()` instead of `exit()`, so that the exit handlers
523 are not run.
524
525 - We never use the POSIX version of `basename()` (which glibc defines it in
526 `libgen.h`), only the GNU version (which glibc defines in `string.h`). The
527 only reason to include `libgen.h` is because `dirname()` is needed. Every
528 time you need that please immediately undefine `basename()`, and add a
529 comment about it, so that no code ever ends up using the POSIX version!
530
531 # Committing to git
532
533 - Commit message subject lines should be prefixed with an appropriate component
534 name of some kind. For example "journal: ", "nspawn: " and so on.
535
536 - Do not use "Signed-Off-By:" in your commit messages. That's a kernel thing we
537 don't do in the systemd project.