]> git.ipfire.org Git - thirdparty/systemd.git/blob - TODO
Merge pull request #26535 from yuwata/systemctl-list-cleanups
[thirdparty/systemd.git] / TODO
1 Bugfixes:
2
3 * Many manager configuration settings that are only applicable to user
4 manager or system manager can be always set. It would be better to reject
5 them when parsing config.
6
7 * Jun 01 09:43:02 krowka systemd[1]: Unit user@1000.service has alias user@.service.
8 Jun 01 09:43:02 krowka systemd[1]: Unit user@6.service has alias user@.service.
9 Jun 01 09:43:02 krowka systemd[1]: Unit user-runtime-dir@6.service has alias user-runtime-dir@.service.
10
11 External:
12
13 * Fedora: add an rpmlint check that verifies that all unit files in the RPM are listed in %systemd_post macros.
14
15 * dbus:
16 - natively watch for dbus-*.service symlinks (PENDING)
17 - teach dbus to activate all services it finds in /etc/systemd/services/org-*.service
18
19 * kernel: add device_type = "fb", "fbcon" to class "graphics"
20
21 * /usr/bin/service should actually show the new command line
22
23 * fedora: suggest auto-restart on failure, but not on success and not on coredump. also, ask people to think about changing the start limit logic. Also point people to RestartPreventExitStatus=, SuccessExitStatus=
24
25 * neither pkexec nor sudo initialize environ[] from the PAM environment?
26
27 * fedora: update policy to declare access mode and ownership of unit files to root:root 0644, and add an rpmlint check for it
28
29 * register catalog database signature as file magic
30
31 * zsh shell completion:
32 - <command> <verb> -<TAB> should complete options, but currently does not
33 - systemctl add-wants,add-requires
34 - systemctl reboot --boot-loader-entry=
35
36 * systemctl status should know about 'systemd-analyze calendar ... --iterations='
37 * If timer has just OnInactiveSec=..., it should fire after a specified time
38 after being started.
39
40 * write blog stories about:
41 - hwdb: what belongs into it, lsusb
42 - enabling dbus services
43 - how to make changes to sysctl and sysfs attributes
44 - remote access
45 - how to pass throw-away units to systemd, or dynamically change properties of existing units
46 - testing with Harald's awesome test kit
47 - auto-restart
48 - how to develop against journal browsing APIs
49 - the journal HTTP iface
50 - non-cgroup resource management
51 - dynamic resource management with cgroups
52 - refreshed, longer missions statement
53 - calendar time events
54 - init=/bin/sh vs. "emergency" mode, vs. "rescue" mode, vs. "multi-user" mode, vs. "graphical" mode, and the debug shell
55 - how to create your own target
56 - instantiated apache, dovecot and so on
57 - hooking a script into various stages of shutdown/rearly booot
58
59 Regularly:
60
61 * look for close() vs. close_nointr() vs. close_nointr_nofail()
62
63 * check for strerror(r) instead of strerror(-r)
64
65 * pahole
66
67 * set_put(), hashmap_put() return values check. i.e. == 0 does not free()!
68
69 * use secure_getenv() instead of getenv() where appropriate
70
71 * link up selected blog stories from man pages and unit files Documentation= fields
72
73 Janitorial Clean-ups:
74
75 * rework mount.c and swap.c to follow proper state enumeration/deserialization
76 semantics, like we do for device.c now
77
78 * get rid of prefix_roota() and similar, only use chase_symlinks() and related
79 calls instead.
80
81 * get rid of basename() and replace by path_extract_filename()
82
83 * Replace our fstype_is_network() with a call to libmount's mnt_fstype_is_netfs()?
84 Having two lists is not nice, but maybe it's now worth making a dependency on
85 libmount for something so trivial.
86
87 Deprecations and removals:
88
89 * Remove any support for booting without /usr pre-mounted in the initrd entirely.
90 Update INITRD_INTERFACE.md accordingly.
91
92 * 2019-10 – Remove POINTINGSTICK_CONST_ACCEL references from the hwdb, see #9573
93
94 * remove cgrouspv1 support EOY 2023. As per
95 https://lists.freedesktop.org/archives/systemd-devel/2022-July/048120.html
96 and then rework cgroupsv2 support around fds, i.e. keep one fd per active
97 unit around, and always operate on that, instead of cgroup fs paths.
98
99 * drop support for kernels that lack ambient capabilities support (i.e. make
100 4.3 new baseline). Then drop support for "!!" modifier for ExecStart= which
101 is only supported for such old kernels.
102
103 * drop support for kernels lacking memfd_create() (i.e. make 3.17 new
104 baseline), then drop all pipe() based fallbacks.
105
106 * drop support for getrandom()-less kernels. (GRND_INSECURE means once kernel
107 5.6 becomes our baseline). See
108 https://github.com/systemd/systemd/pull/24101#issuecomment-1193966468 for
109 details. Maybe before that: at taint-flags/warn about kernels that lack
110 getrandom()/environments where it is blocked.
111
112 * drop support for LOOP_CONFIGURE-less loopback block devices, once kernel
113 baseline is 5.8.
114
115 * drop fd_is_mount_point() fallback mess once we can rely on
116 STATX_ATTR_MOUNT_ROOT to exist i.e. kernel baseline 5.8
117
118 * rework our PID tracking in services and so on, to be strictly based on pidfd,
119 once kernel baseline is 5.13.
120
121 * H2 2023: remove support for unmerged-usr
122
123 * Remove /dev/mem ACPI FPDT parsing when /sys/firmware/acpi/fpdt is ubiquitous.
124 That requires distros to enable CONFIG_ACPI_FPDT, and have kernels v5.12 for
125 x86 and v6.2 for arm.
126
127 * Once baseline is 4.13, remove support for INTERFACE_OLD= checks in "udevadm
128 trigger"'s waiting logic, since we can then rely on uuid-tagged uevents
129
130 Features:
131
132 * landlock: lock down RuntimeDirectory= via landlock, so that services lose
133 ability to write anywehere else below /run/. Similar for
134 StateDirectory=. Benefit would be clear delegation via unit files: services
135 get the directories they get, and nothing else even if they wanted to.
136
137 * landlock: for unprivileged systemd (i.e. systemd --user), use landlock to
138 implement ProtectSystem=, ProtectHome= and so on. Landlock does not require
139 privs, and we can implement pretty similar behaviour. Also, maybe add a mode
140 where ProtectSystem= combined with an explicit PrivateMounts=no could request
141 similar behaviour for system services, too.
142
143 * Add systemd-mount@.service which is instantiated for a block device and
144 invokes systemd-mount and exits. This is then useful to use in
145 ENV{SYSTEMD_WANTS} in udev rules, and a bit prettier than using RUN+=
146
147 * sd-journal puts a limit on parallel journal files to view at once. journald
148 should probably honour that same limit (JOURNAL_FILES_MAX) when vacuuming to
149 ensure we never generate more files than we can actually view.
150
151 * in order to make binding to PCR 4 realistic:
152 - generate one keypair "U" and store it in a tpm2 nvindex.
153 - Generate another keypair "P" and store it in a second tpm2 nvindex.
154 - allocate a persistent counter object "C" in the tpm2
155 - Enroll all user objects (i.e. luks volumes, creds, …) to a tpm2 policy
156 signed by U.
157 - Lock both U and P down with a tpm2 policy signed by P (yes, P can only be
158 used if a signature by P itself can be provided)
159 - For regular reboots generate a signature for a restrictive PCR4 + counter C
160 based policy with key P. Place signature in EFI var, so it can be found on
161 next boot
162 - For reboots where a firmware update is expected generate a signature with a
163 more open policy against just counter C. Place signature in same EFI var.
164 - Increase C whenever switching between these two signature types.
165 - During early boot, use the signature from the EFI var to unlock U and P.
166 Use it to generate a signature for unlocking user objects given the current
167 PCR 4 value, store that away into /run somewhere, for user during the whole
168 later boot.
169 - When booting up automatically update the mentioned efi var so that it
170 contains the restrictive signature. But also generate a signature ahead of
171 time that could be used in case during the current boot we later detect we might
172 need to reboot for a firmware update. Store that in /run somewhere, so that
173 it can be placed in the EFI var, if needed.
174
175 * repart/gpt-auto/DDIs: maybe introduce a concept of "extension" partitions,
176 that have a new type uuid and can "extend" earlier partitions, to work around
177 the fact that systemd-repart can only grow the last partition defined. During
178 activation we'd simply set up a dm-linear mapping to merge them again. A
179 partition that is to be extended would just set a bit in the partition flags
180 field to indicate that there's another extension partition to look for. The
181 identifiying UUID of the extension partition would be hashed in counter mode
182 from the uuid of the original partition it extends. Inspiration for this is
183 the "dynamic partitions" concept of new Android. This would be a minimalistic
184 concept of a volume manager, with the extents it manages being exposes as GPT
185 partitions. I a partition is extended multiple times they should probably
186 grow exponentially in size to ensure O(log(n)) time for finding them on
187 access.
188
189 * split out execute.c into new "systemd-executor" binary. Then make PID 1 fork
190 that off via vfork(), and then let that executor do the hard work. Ultimately
191 the executor then gets replaced by the real binary sooner or later. Reason:
192 currently the intermediary "stub" process is a CoW trap that doubles memory
193 usage of PID 1 on each service start. Also, strictly speaking we are not
194 allowed to do NSS from the stub process yet we do anyway. Next steps would
195 then be maybe use CLONE_INTO_CGROUP for the executor, given that we don't
196 need glibc anymore in the stub process then. Then, switch nspawn to just be a
197 frontend for this too, so that we have to ways into the executor: via unit
198 files/dbus/varlin through PID1 and via cmdline/OCI through nspawn.
199
200 * sd-stub: detect if we are running with uefi console output on serial, and if so
201 automatically add console= to kernel cmdline matching the same port.
202
203 * add a utility that can be used with the kernel's
204 CONFIG_STATIC_USERMODEHELPER_PATH and then handles them within pid1 so that
205 security, resource management and cgroup settings can be enforced properly
206 for all umh processes.
207
208 * systemd-shutdown: keep sending sd_notify() status updates immediately before
209 going down, in particular include the "reboot param" string.
210
211 * homed: when resizing an fs don't sync identity beforehand there might simply
212 not be enough disk space for that. try to be defensive and sync only after
213 resize.
214
215 * homed: if for some reason the partition ended up being much smaller than
216 whole disk, recover from that, and grow it again.
217
218 * in journald, write out a recognizable log record whenever the system clock is
219 changed ("stepped"), and in timesyncd whenever we acquire an NTP fix
220 ("slewing"). Then, in journalctl for each boot time we come across, find
221 these records, and use the structured info they include to display
222 "corrected" wallclock time, as calculted from the monotonic timestamp in the
223 log record, adjusted by the delta declared in the structured log record.
224
225 * in journald: whenever we start a new journal file because the boot ID
226 changed, let's generate a recognizable log record containing info about old
227 and new ID. Then, when displaying log stream in journalctl look for these
228 records, to be able to order them.
229
230 * timesyncd: when saving/restoring clock try to take boot time into account.
231 Specifically, along with the saved clock, store the current boot ID. When
232 starting, check if the boot id matches. If so, don't do anything (we are on
233 the same boot and clock just kept running anyway). If not, then read
234 CLOCK_BOOTTIME (which started at boot), and add it to the saved clock
235 timestamp, to compensate for the time we spent booting. If EFI timestamps are
236 available, also include that in the calculation. With this we'll then only
237 miss the time spent during shutdown after timesync stopped and before the
238 system actually reset.
239
240 * systemd-stub: maybe store a "boot counter" in the ESP, and pass it down to
241 userspace to allow ordering boots (for example in journalctl). The counter
242 would be monotonically increased on every boot.
243
244 * systemd-sysext: for sysext DDIs picked up via EFI stub, set much stricter
245 image policy by default
246
247 * systemd-dissect: maybe add "--attach" and "--detach" verbs which
248 synchronously attach a DDI to a loopback device but not actually mount them.
249
250 * pam_systemd_home: add module parameter to control whether to only accept
251 only password or only pcks11/fido2 auth, and then use this to hook nicely
252 into two of the three PAM stacks gdm provides.
253 See discussion at https://github.com/authselect/authselect/pull/311
254
255 * sd-boot: make boot loader spec type #1 accept http urls in "linux"
256 lines. Then, do the uefi http dance to download kernels and boot them. This
257 is then useful for network boot, by embdedding a cpio with type #1 snippets
258 in sd-boot, which reference remote kernels.
259
260 * fix systemd-gpt-auto-generator in case a UKI is spawned from XBOOTLDR without
261 sd-boot. In that case LoaderDevicePartUUID will point to the XBOOTLDR, and we
262 should then derive the root disk from that, and then the ESP/XBOOTLDR from
263 that. Right now we will only mount ESP if it matches LoaderDEvicePartUUID
264 which isn't quite the same.
265
266 * maybe prohibit setuid() to the nobody user, to lock things down, via seccomp.
267 the nobody is not a user any code should run under, ever, as that user would
268 possibly get a lot of access to resources it really shouldn't be getting
269 access to due to the userns + nfs semantics of the user. Alternatively: use
270 the seccomp log action, and allow it.
271
272 * sd-boot: add a new PE section .bls or so that carries a cpio with additional
273 boot loader entries (both type1 and type2). Then when initializing, find this
274 section, iterate through it and populate menu with it. cpio is simple enough
275 to make a parser for this reasonably robust. use same path structures as in
276 the ESP. Similar add one for signature key drop-ins.
277
278 * sd-boot: also allow passing in the cpio as in the previous item via SMBIOS
279
280 * add a new EFI tool "sd-fetch" or so. It looks in a PE section ".url" for an
281 URL, then downloads the file from it using UEFI HTTP APIs, and executes it.
282 Usecase: provide a minimal ESP with sd-boot and a couple of these sd-fetch
283 binaries in place of UKIs, and download them on-the-fly.
284
285 * bootctl: warn if ESP is mounted world-readable (and in particular the seed).
286
287 * maybe: systemd-loop-generator that sets up loopback devices if requested via kernel
288 cmdline. usecase: include encrypted/verity root fs in UKI.
289
290 * systemd-gpt-auto-generator: add kernel cmdline option to override block
291 device to dissect. also support dissecting a regular file. useccase: include
292 encrypted/verity root fs in UKI.
293
294 * sd-stub: add ".bootcfg" section for kernel bootconfig data (as per
295 https://docs.kernel.org/admin-guide/bootconfig.html)
296
297 * tpm2: add (optional) support for generating a local signing key from PCR 15
298 state. use private key part to sign PCR 7+14 policies. stash signatures for
299 expected PCR7+14 policies in EFI var. use public key part in disk encryption.
300 generate new sigs whenever db/dbx/mok/mokx gets updated. that way we can
301 securely bind against SecureBoot/shim state, without having to renroll
302 everything on each update (but we still have to generate one sig on each
303 update, but that should be robust/idempotent). needs rollback protection, as
304 usual.
305
306 * Lennart: big blog story about DDIs
307
308 * Lennart: big blog story about building initrds
309
310 * Lennart: big blog story about "why systemd-boot"
311
312 * bpf: see if we can use BPF to solve the syslog message cgroup source problem:
313 one idea would be to patch source sockaddr of all AF_UNIX/SOCK_DGRAM to
314 implicitly contain the source cgroup id. Another idea would be to patch
315 sendto()/connect()/sendmsg() sockaddr on-the-fly to use a different target
316 sockaddr.
317
318 * bpf: see if we can address opportunistic inode sharing of immutable fs images
319 with BPF. i.e. if bpf gives us power to hook into openat() and return a
320 different inode than is requested for which we however it has same contents
321 then we can use that to implement opportunistic inode sharing among DDIs:
322 make all DDIs ship xattr on all reg files with a SHA256 hash. Then, also
323 dictate that DDIs should come with a top-level subdir where all reg files are
324 linked into by their SHA256 sum. Then, whenever an inode is opened with the
325 xattr set, check bpf table to find dirs with hashes for other prior DDIs and
326 try to use inode from there.
327
328 * extend the verity signature partition to permit multiple signatures for the
329 same root hash, so that people can sign a single image with multiple keys.
330
331 * consider adding a new partition type, just for /opt/ for usage in system
332 extensions
333
334 * gpt-auto-discovery: also use the pkcs7 signature stuff, and pass signature to
335 kernel. So far we only did this for the various --image= switches, but not
336 for the root fs or /usr/.
337
338 * dissection policy should enforce that unlocking can only take place by
339 certain means, i.e. only via pw, only via tpm2, or only via fido, or a
340 combination thereof.
341
342 * make the systemd-repart "seed" value provisionable via credentials, so that
343 confidential computing environments can set it and deterministically
344 enforce the uuids for partitions created, so that they can calculate PCR 15
345 ahead of time.
346
347 * systemd-repart: also derive the volume key from the seed value, for the
348 aforementioned purpose.
349
350 * in the initrd: derive the default machine ID to pass to the host PID 1 via
351 $machine_id from the same seed credential.
352
353 * Add systemd-sysupdate-initrd.service or so that runs systemd-sysupdate in the
354 initrd to bootstrap the initrd to populate the initial partitions. Some things
355 to figure out:
356 - Should it run on firstboot or on every boot?
357 - If run on every boot, should it use the sysupdate config from the host on
358 subsequent boots?
359
360 * hook up journald with TPMs? measure new journal records to the TPM in regular
361 intervals, validate the journal against current TPM state with that. (taking
362 inspiration from IMA log)
363
364 * provide an API to apps to encrypt/decrypt credentials. usecase: allow
365 bluez bluetooth daemon to pass pairings to initrd that way, without shelling
366 out to our tools.
367
368 * revisit default PCR bindings in cryptenroll and systemd-creds. Currently they
369 use PCR 7 which should contain secureboot state db/dbx. Which sounded like a
370 safe bet, given that it should change only on policy changes, and not
371 software updates. But that's wrong. Recent fwupd (rightfully) contains code
372 for updating the dbx denylist. This means even without any active policy
373 change PCR 7 might change. Hence, better idea might be in systemd-creds to
374 default to PCR 15 at least if sd-stub is used (i.e. bind to system identity),
375 and in cryptsetup simply the empty list? Also, PCR 14 almost certainly should
376 be included as much as PCR 7 (as it contains shim's policy, which is
377 certainly as relevant as PCR 7 on many systems)
378
379 * To mimic the new tpm2-measure-pcr= crypttab option add the same to veritytab
380 (measuring the root hash) and integritytab (measuring the HMAC key if one is
381 used)
382
383 * We should start measuring all services, containers, and system extensions we
384 activate. probably into PCR 13. i.e. add --tpm2-measure-pcr= or so to
385 systemd-nspawn, and MeasurePCR= to unit files. Should contain a measurement
386 of the activated configuration and the image that is being activated (in case
387 verity is used, hash of the root hash).
388
389 * whenever we measure something into a TPM PCR from userspace, write a record in
390 TCG's "Canonical Event Log" format to some file, so that we can reason about
391 how PCR values we manage came to
392 be. https://trustedcomputinggroup.org/resource/canonical-event-log-format/
393
394 * bootspec: permit graceful "update" from type #2 to type #1. If both a type #1
395 and a type #2 entry exist under otherwise the exact same name, then use the
396 type #1 entry, and ignore the type #2 entry. This way, people can "upgrade"
397 from the UKI with all parameters baked in to a Type #1 .conf file with manual
398 parametrization, if needed. This matches our usual rule that admin config
399 should win over vendor defaults.
400
401 * sd-stub: optionally allow users to configure manual kernel command line even
402 in SecureBoot by authenticating it via shim's APIs, integrating with MOK and
403 similar: instead of authenticating just PE code shim should be capable of
404 authenticating any kind of data for us, including files containing kernel
405 command lines.
406
407 * write a "search path" spec, that documents the prefixes to search in
408 (i.e. the usual /etc/, /run/, /usr/lib/ dance, potentially /usr/etc/), how to
409 sort found entries, how masking works and overriding.
410
411 * automatic boot assessment: add one more default success check that just waits
412 for a bit after boot, and blesses the boot if the system stayed up that long.
413
414 * implement concept of "versioned" resources inside a dir, and write a spec for
415 it. Make all tools in systemd, in particular
416 RootImage=/RootDirectory=/--image=/--directory= implement this. Idea:
417 directories ending in ".v/" indicate a directory with versioned resources in
418 them. Versioned resources inside a .v dir are always named in the pattern
419 <prefix>_<version>[+<tries-left>[-<tries-done>]].<suffix>
420
421 * add support for using this .v/ logic on the root fs itself: in the initrd,
422 after mounting the rootfs, look for root-<arch>.v/ in the root fs, and then
423 apply the logic, moving the switch root logic there.
424
425 * systemd-repart: add support for generating ISO9660 images
426
427 * systemd-repart: in addition to the existing "factory reset" mode (which
428 simply empties existing partitions marked for that). add a mode where
429 partitions marked for it are entirely removed. Usecase: remove secondary OS
430 copy, and redundant partitions entirely, and recreate them anew.
431
432 * systemd-boot: maybe add support for collapsing menu entries of the same OS
433 into one item that can be opened (like in a "tree view" UI element) or
434 collapsed. If only a single OS is installed, disable this mode, but if
435 multiple OSes are installed might make sense to default to it, so that user
436 is not immediately bombarded with a multitude of Linux kernel versions but
437 only one for each OS.
438
439 * systemd-repart: if the GPT *disk* UUID (i.e. the one global for the entire
440 disk) is set to all FFFFF then use this as trigger for factory reset, in
441 addition to the existing mechanisms via EFI variables and kernel command
442 line. Benefit: works also on non-EFI systems, and can be requested on one
443 boot, for the next.
444
445 * figure out a sane way when building UKIs how to extract SBAT data from inner
446 kernel, extend it with component info, and add to outer kernel.
447
448 * systemd-sysupdate: make transport pluggable, so people can plug casync or
449 similar behind it, instead of http.
450
451 * systemd-tmpfiles: add concept for conditionalizing lines on factory reset
452 boot, or on first boot.
453
454 * in UKIs: add way to define allowlist of additional words that can be added to
455 the kernel cmdline even in SecureBoot mode
456
457 * we probably needs .pcrpkeyrd or so as additional PE section in UKIs,
458 which contains a separate public key for PCR values that only apply in the
459 initrd, i.e. in the boot phase "enter-initrd". Then, consumers in userspace
460 can easily bind resources to just the initrd. Similar, maybe one more for
461 "enter-initrd:leave-initrd" for resources that shall be accessible only
462 before unprivileged user code is allowed. (we only need this for .pcrpkey,
463 not for .pcrsig, since the latter is a list of signatures anyway). With that,
464 when you enroll a LUKS volume or similar, pick either the .pcrkey (for
465 coverage through all phases of the boot, but excluding shutdown), the
466 .pcrpkeyrd (for coverage in the initrd only) and .pcrpkeybt (for coverage
467 until users are allowed to log in).
468
469 * Once the root fs LUKS volume key is measured into PCR 15, default to binding
470 credentials to PCR 15 in "systemd-creds"
471
472 * add support for asymmetric LUKS2 TPM based encryption. i.e. allow preparing
473 an encrypted image on some host given a public key belonging to a specific
474 other host, so that only hosts possessing the private key in the TPM2 chip
475 can decrypt the volume key and activate the volume. Usecase: systemd-syscfg
476 for a central orchestrator to generate syscfg images securely that can only
477 be activated on one specific host (which can be used for installing a bunch
478 of creds in /etc/credstore/ for example). Extending on this: allow binding
479 LUKS2 TPM based encryption also to the TPM2 internal clock. Net result:
480 prepare a syscfg image that can only be activated on a specific host that
481 runs a specific software in a specific time window. syscfg would be
482 automatically invalidated outside of it.
483
484 * maybe add a "systemd-report" tool, that generates a TPM2-backed "report" of
485 current system state, i.e. a combination of PCR information, local system
486 time and TPM clock, running services, recent high-priority log
487 messages/coredumps, system load/PSI, signed by the local TPM chip, to form an
488 enhanced remote attestation quote. Usecase: a simple orchestrator could use
489 this: have the report tool upload these reports every 3min somewhere. Then
490 have the orchestrator collect these reports centrally over a 3min time
491 window, and use them to determine what which node should now start/stop what,
492 and generate a small syscfg for each node, that uses Uphold= to pin services
493 on each node. The syscfg would be encrypted using the asymmetric encryption
494 proposed above, so that it can only be activated on the specific host, if the
495 software is in a good state, and within a specific time frame. Then run a
496 loop on each node that sends report to orchestrator and then sysupdate to
497 update syscfg. Orchestrator would be stateless, i.e. operate on desired
498 config and collected reports in the last 3min time window only, and thus can
499 be trivially scaled up since all instances of the orchestrator should come to
500 the same conclusions given the same inputs of reports/desired workload info.
501 Could also be used to deliver Wireguard secrets and thus to clients, thus
502 permitting zero-trust networking: secrets are rolled over via syscfg updates,
503 and via the time window TPM logic invalidated if node doesn't keep itself
504 updated, or becomes corrupted in some way.
505
506 * in the initrd, once the rootfs encryption key has been measured to PCR 15,
507 derive default machine ID to use from it, and pass it to host PID 1.
508
509 * tree-wide: convert as much as possible over to use sd_event_set_signal_exit(), instead
510 of manually hooking into SIGINT/SIGTERM
511
512 * tree-wide: convert as much as possible over to SD_EVENT_SIGNAL_PROCMASK
513 instead of manual blocking.
514
515 * sd-boot: for each installed OS, grey out older entries (i.e. all but the
516 newest), to indicate they are obsolete
517
518 * automatically propagate LUKS password credential into cryptsetup from host
519 (i.e. SMBIOS type #11, …), so that one can unlock LUKS via VM hypervisor
520 supplied password.
521
522 * add ability to path_is_valid() to classify paths that refer to a dir from
523 those which may refer to anything, and use that in various places to filter
524 early. i.e. stuff ending in "/", "/." and "/.." definitely refers to a
525 directory, and paths ending that way can be refused early in many contexts.
526
527 * systemd-measure: allow operating with PEM certificates in addition to PEM
528 public keys when signing PCR values. SecureBoot and our Verity signatures
529 operate with certificates already, hence I guess we should also just deal for
530 convencience with certificates for the PCR stuff too.
531
532 * systemd-measure: add --pcrpkey-auto as an alternative to --pcrpkey=, where it
533 would just use the same public key specified with --public-key= (or the one
534 automatically derived from --private-key=).
535
536 * push people to use ".sysext.raw" as suffix for sysext DDIs (DDI =
537 discoverable disk images, i.e. the new name for gpt disk images following the
538 discoverable disk spec). [Also: just ".sysext/" for directory-based sysext]
539
540 * Add "purpose" flag to partition flags in discoverable partition spec that
541 indicate if partition is intended for sysext, for portable service, for
542 booting and so on. Then, when dissecting DDI allow specifying a purpose to
543 use as additional search condition. Usecase: images that combined a sysext
544 partition with a portable service partition in one.
545
546 * On boot, auto-generate an asymmetric key pair from the TPM,
547 and use it for validating DDIs and credentials. Maybe upload it to the kernel
548 keyring, so that the kernel does this validation for us for verity and kernel
549 modules
550
551 * for systemd-syscfg: add a tool that can generate suitable DDIs with verity +
552 sig using squashfs-tools-ng's library. Maybe just systemd-repart called under
553 a new name with a built-in config?
554
555 * gpt-auto: generate mount units that reference partitions via
556 /dev/disk/by-diskseq/… so that they can't be swapped out behind our back.
557
558 * lock down acceptable encrypted credentials at boot, via simple allowlist,
559 maybe on kernel command line:
560 systemd.import_encrypted_creds=foobar.waldo,tmpfiles.extra to protect locked
561 down kernels from credentials generated on the host with a weak kernel
562
563 * Add support for extra verity configuration options to systemd-repart (FEC,
564 hash type, etc)
565
566 * chase_symlinks(): take inspiration from path_extract_filename() and return
567 O_DIRECTORY if input path contains trailing slash.
568
569 * chase_symlinks(): refuse resolution if trailing slash is specified on input,
570 but final node is not a directory
571
572 * chase_symlinks(): add new flag that simply refuses all symlink use in a path,
573 then use that for accessing XBOOTLDR/ESP
574
575 * document in boot loader spec that symlinks in XBOOTLDR/ESP are not OK even if
576 non-VFAT fs is used.
577
578 * measure credentials picked up from SMBIOS to some suitable PCR
579
580 * measure GPT and LUKS headers somewhere when we use them (i.e. in
581 systemd-gpt-auto-generator/systemd-repart and in systemd-cryptsetup?)
582
583 * pick up creds from EFI vars
584
585 * sd-boot: we probably should include all BootXY EFI variable defined boot
586 entries in our menu, and then suppress ourselves. Benefit: instant
587 compatibility with all other OSes which register things there, in particular
588 on other disks. Always boot into them via NextBoot EFI variable, to not
589 affect PCR values.
590
591 * systemd-measure tool:
592 - pre-calculate PCR 12 (command line) + PCR 13 (sysext) the same way we can precalculate PCR 11
593
594 * in sd-boot: load EFI drivers from a new PE section. That way, one can have a
595 "supercharged" sd-boot binary, that could carry ext4 drivers built-in.
596
597 * sd-bus: document that sd_bus_process() only returns messages that non of the
598 filters/handlers installed on the connection took possession of.
599
600 * sd-device: add an API for acquiring list of child devices, given a device
601 objects (i.e. all child dirents that dirs or symlinks to dirs)
602
603 * sd-device: maybe pin the sysfs dir with an fd, during the entire runtime of
604 an sd_device, then always work based on that.
605
606 * add small wrapper around qemu that implements sd_notify/AF_VSOCK + machined and
607 maybe some other stuff and boots it
608
609 * maybe add new flags to gpt partition tables for rootfs and usrfs indicating
610 purpose, i.e. whether something is supposed to be bootable in a VM, on
611 baremetal, on an nspawn-style container, if it is a portable service image,
612 or a sysext for initrd, for host os, or for portable container. Then hook
613 portabled/… up to udev to watch block devices coming up with the flags set, and
614 use it.
615
616 * sd-boot should look for information what to boot in SMBIOS, too, so that VM
617 managers can tell sd-boot what to boot into and suchlike
618
619 * add "systemd-sysext identify" verb, that you can point on any file in /usr/
620 and that determines from which overlayfs layer it originates, which image, and with
621 what it was signed.
622
623 * journald: generate recognizable log events whenever we shutdown journald
624 cleanly, and when we migrate run → var. This way tools can verify that a
625 previous boot terminated cleanly, because either of these two messages must
626 be safely written to disk, then.
627
628 * systemd-creds: extend encryption logic to support asymmetric
629 encryption/authentication. Idea: add new verb "systemd-creds public-key"
630 which generates a priv/pub key pair on the TPM2 and stores the priv key
631 locally in /var. It then outputs a certificate for the pub part to stdout.
632 This can then be copied/taken elsewhere, and can be used for encrypting creds
633 that only the host on its specific hw can decrypt. Then, support a drop-in
634 dir with certificates that can be used to authenticate credentials. Flow of
635 operations is then this: build image with owner certificate, then after
636 boot up issue "systemd-creds public-key" to acquire pubkey of the machine.
637 Then, when passing data to the machine, sign with privkey belonging to one of
638 the dropped in certs and encrypted with machine pubkey, and pass to machine.
639 Machine is then able to authenticate you, and confidentiality is guaranteed.
640
641 * building on top of the above, the pub/priv key pair generated on the TPM2
642 should probably also one you can use to get a remote attestation quote.
643
644 * Process credentials in:
645 • networkd/udevd: add a way to define additional .link, .network, .netdev files
646 via the credentials logic.
647 • fstab-generator: allow defining additional fstab-like mounts via
648 credentials (similar: crypttab-generator, verity-generator,
649 integrity-generator)
650 • getty-generator: allow defining additional getty instances via a credential
651 • run-generator: allow defining additional commands to run via a credential
652 • resolved: allow defining additional /etc/hosts entries via a credential (it
653 might make sense to then synthesize a new combined /etc/hosts file in /run
654 and bind mount it on /etc/hosts for other clients that want to read it.
655 • repart: allow defining additional partitions via credential
656 • timesyncd: pick NTP server info from credential
657 • portabled: read a credential "portable.extra" or so, that takes a list of
658 file system paths to enable on start.
659 • make systemd-fstab-generator look for a system credential encoding root= or
660 usr=
661 • systemd-homed: when initializing, look for a credential
662 systemd.homed.register or so with JSON user records to automatically
663 register if not registered yet. Usecase: deploy a system, and add an
664 account one can directly log into.
665 • initialize machine ID from systemd credential picked up from the ESP via
666 sd-stub, so that machine ID is stable even on systems where unified kernels
667 are used, and hence kernel cmdline cannot be modified locally
668 • in gpt-auto-generator: check partition uuids against such uuids supplied via
669 sd-stub credentials. That way, we can support parallel OS installations with
670 pre-built kernels.
671
672 * define a JSON format for units, separating out unit definitions from unit
673 runtime state. Then, expose it:
674
675 1. Add Describe() method to Unit D-Bus object that returns a JSON object
676 about the unit.
677 2. Expose this natively via Varlink, in similar style
678 3. Use it when invoking binaries (i.e. make PID 1 fork off systemd-executor
679 binary which reads the JSON definition and runs it), to address the cow
680 trap issue and the fact that NSS is actually forbidden in
681 forked-but-not-exec'ed children
682 4. Add varlink API to run transient units based on provided JSON definitions
683
684 * Add SUPPORT_END_URL= field to os-release with more *actionable* information
685 what to do if support ended
686
687 * pam_systemd: on interactive logins, maybe show SUPPORT_END information at
688 login time, á la motd
689
690 * sd-boot: instead of unconditionally deriving the ESP to search boot loader
691 spec entries in from the paths of sd-boot binary, let's optionally allow it
692 to be configured on sd-boot cmdline + efi var. Usecase: embed sd-boot in the
693 UEFI firmware (for example, ovmf supports that via qemu cmdline option), and
694 use it to load stuff from the ESP.
695
696 * mount /var/ from initrd, so that we can apply sysext and stuff before the
697 initrd transition. Specifically:
698 1. There should be a var= kernel cmdline option, matching root= and usr=
699 2. systemd-gpt-auto-generator should auto-mount /var if it finds it on disk
700 3. mount.x-initrd mount option in fstab should be implied for /var
701
702 * implement varlink introspection
703
704 * we should probably drop all use of prefix_roota() and friends, and use
705 chase_symlinks() instead
706
707 * make persistent restarts easier by adding a new setting OpenPersistentFile=
708 or so, which allows opening one or more files that is "persistent" across
709 service restarts, hot reboot, cold reboots (depending on configuration): the
710 files are created empty on first invocation, and on subsequent invocations
711 the files are reboot. The files would be backed by tmpfs, pmem or /var
712 depending on desired level of persistency.
713
714 * sd-event: add ability to "chain" event sources. Specifically, add a call
715 sd_event_source_chain(x, y), which will automatically enable event source y
716 in oneshot mode once x is triggered. Use case: in src/core/mount.c implement
717 the /proc/self/mountinfo rescan on SIGCHLD with this: whenever a SIGCHLD is
718 seen, trigger the rescan defer event source automatically, and allow it to be
719 dispatched *before* the SIGCHLD is handled (based on priorities). Benefit:
720 dispatch order is strictly controlled by priorities again. (next step: chain
721 event sources to the ratelimit being over)
722
723 * if we fork of a service with StandardOutput=journal, and it forks off a
724 subprocess that quickly dies, we might not be able to identify the cgroup it
725 comes from, but we can still derive that from the stdin socket its output
726 came from. We apparently don't do that right now.
727
728 * add ability to set hostname with suffix derived from machine id at boot
729
730 * add PR_SET_DUMPABLE service setting
731
732 * homed/userdb: maybe define a "companion" dir for home directories where apps
733 can safely put privileged stuff in. Would not be writable by the user, but
734 still conceptually belong to the user. Would be included in user's quota if
735 possible, even if files are not owned by UID of user. Usecase: container
736 images that owned by arbitrary UIDs, and are owned/managed by the users, but
737 are not directly belonging to the user's UID. Goal: we shouldn't place more
738 privileged dirs inside of unprivileged dirs, and thus containers really
739 should not be placed inside of traditional UNIX home dirs (which are owned by
740 users themselves) but somewhere else, that is separate, but still close
741 by. Inform user code about path to this companion dir via env var, so that
742 container managers find it. the ~/.identity file is also a candidate for a
743 file to move there, since it is managed by privileged code (i.e. homed) and
744 not unprivileged code.
745
746 * given that /etc/ssh/ssh_config.d/ is a thing now, ship a drop-in for that
747 that hooks up userbdctl ssh-key stuff.
748
749 * maybe add support for binding and connecting AF_UNIX sockets in the file
750 system outside of the 108ch limit. When connecting, open O_PATH fd to socket
751 inode first, then connect to /proc/self/fd/XYZ. When binding, create symlink
752 to target dir in /tmp, and bind through it.
753
754 * add a proper concept of a "developer" mode, i.e. where cryptographic
755 protections of the root OS are weakened after interactive confirmation, to
756 allow hackers to allow their own stuff. idea: allow entering developer mode
757 only via explicit choice in boot menu: i.e. add explicit boot menu item for
758 it. When developer mode is entered, generate a key pair in the TPM2, and add
759 the public part of it automatically to keychain of valid code signature keys
760 on subsequent boots. Then provide a tool to sign code with the key in the
761 TPM2. Ensure that boot menu item is the only way to enter developer mode, by
762 binding it to locality/PCRs so that keys cannot be generated otherwise.
763
764 * services: add support for cryptographically unlocking per-service directories
765 via TPM2. Specifically, for StateDirectory= (and related dirs) use fscrypt to
766 set up the directory so that it can only be accessed if host and app are in
767 order.
768
769 * TPM2: extend unlock policy to protect against version downgrades in signed
770 policies: policy probably must take some nvram based generation counter into
771 account that can only monotonically increase and can be used to invalidate
772 old PCR signatures. Otherwise people could downgrade to old signed PCR sets
773 whenever they want.
774
775 * update HACKING.md to suggest developing systemd with the ideas from:
776 https://0pointer.net/blog/testing-my-system-code-in-usr-without-modifying-usr.html
777 https://0pointer.net/blog/running-an-container-off-the-host-usr.html
778
779 * add a clear concept how the initrd can make up credentials on their own to
780 pass to the system when transitioning into the host OS. usecase: things like
781 cloud-init/ignitation and similar can parameterize the host with data they
782 acquire.
783
784 * sd-event: compat wd reuse in inotify code: keep a set of removed watch
785 descriptors, and clear this set piecemeal when we see the IN_IGNORED event
786 for it, or when read() returns EAGAIN or on IN_Q_OVERFLOW. Then, whenever we
787 see an inotify wd event check against this set, and if it is contained ignore
788 the event. (to be fully correct this would have to count the occurrences, in
789 case the same wd is reused multiple times before we start processing
790 IN_IGNORED again)
791
792 * systemd-fstab-generator: support addition mount specifications via kernel
793 cmdline. Usecase: invoke a VM, and mount a host homedir into it via
794 virtio-fs.
795
796 * for vendor-built signed initrds:
797 - make sysext run in the initrd
798 - sysext should pick up sysext images from /.extra/ in the initrd, and insist
799 on verification if in secureboot mode
800 - kernel-install should be able to install pre-built unified kernel images in
801 type #2 drop-in dir in the ESP.
802 - kernel-install should be able install encrypted creds automatically for
803 machine id, root pw, rootfs uuid, resume partition uuid, and place next to
804 EFI kernel, for sd-stub to pick them up. These creds should be locked to
805 the TPM, and bind to the right PCR the kernel is measured to.
806 - kernel-install should be able to pick up initrd sysexts automatically and
807 place them next to EFI kernel, for sd-stub to pick them up.
808 - systemd-fstab-generator should look for rootfs device to mount in creds
809 - pid 1 should look for machine ID in creds
810 - systemd-resume-generator should look for resume partition uuid in creds
811 - sd-stub: automatically pick up microcode from ESP (/loader/microcode/*)
812 and synthesize initrd from it, and measure it. Signing is not necessary, as
813 microcode does that on its own. Pass as first initrd to kernel.
814
815 * Maybe extend the service protocol to support handling of some specific SIGRT
816 signal for setting service log level, that carries the level via the
817 sigqueue() data parameter. Enable this via unit file setting.
818
819 * firstboot: maybe just default to C.UTF-8 locale if nothing is set, so that we
820 don't query this unnecessarily in entirely uninitialized
821 containers. (i.e. containers with empty /etc).
822
823 * sd_notify/vsock: maybe support binding to AF_VSOCK in Type=notify services,
824 then passing $NOTIFY_SOCKET and $NOTIFY_GUESTCID with PID1's cid (typically
825 fixed to "2", i.e. the official host cid) and the expected guest cid, for the
826 two sides of the channel. The latter env var could then be used in an
827 appropriate qemu cmdline. That way qemu payloads could talk sd_notify()
828 directly to host service manager.
829
830 * maybe write a tool that binds an AF_VFSOCK socket, then invokes qemu,
831 extending the command line to enable vsock on the VM, and using fw_cfg to
832 configure socket address.
833
834 * sd-boot: add menu item for shutdown? or hotkey?
835
836 * sd-device has an API to create an sd_device object from a device id, but has
837 no api to query the device id
838
839 * sd-device should return the devnum type (i.e. 'b' or 'c') via some API for an
840 sd_device object, so that data passed into sd_device_new_from_devnum() can
841 also be queried.
842
843 * sd-event: optionally, if per-event source rate limit is hit, downgrade
844 priority, but leave enabled, and once ratelimit window is over, upgrade
845 priority again. That way we can combat event source starvation without
846 stopping processing events from one source entirely.
847
848 * sd-event: similar to existing inotify support add fanotify support (given
849 that apparently new features in this area are only going to be added to the
850 latter).
851
852 * sd-event: add 1st class event source for clock changes
853
854 * sd-event: add 1st class event source for timezone changes
855
856 * support uefi/http boots with sd-boot: instead of looking for dropin files in
857 /loader/entries/ dir, look for a file /loader/entries/SHA256SUMS and use that
858 as directory manifest. The file would be a standard directory listing as
859 generated by GNU sha256sums.
860
861 * sd-boot: maybe add support for embedding the various auxiliary resources we
862 look for right in the sd-boot binary. i.e. take inspiration from sd-stub
863 logic: allow combining sd-boot via objcopy with kernels to enumerate, .conf
864 files, drivers, keys to enroll and so on. Then, add whatever we find that way
865 to the menu. Usecase: allow building a single PE image you can boot into via
866 UEFI HTTP boot.
867
868 * maybe add a new UEFI stub binary "sd-http". It works similar to sd-stub, but
869 all it does is download a file from a http server, and execute it, after
870 optionally checking its hash sum. idea would be: combine this "sd-http" stub
871 binary with some minimal info about an URL + hash sum, plus .osrel data, and
872 drop it into the unified kernel dir in the ESP. And bam you have something
873 that is tiny, feels a lot like a unified kernel, but all it does is chainload
874 the real kernel. benefit: downloading these stubs would be tiny and quick,
875 hence cheap for enumeration.
876
877 * sysext: measure all activated sysext into a TPM PCR
878
879 * maybe add a "syscfg" concept, that is almost entirely identical to "sysext",
880 but operates on /etc/ instead of /usr/ and /opt/. Use case would be: trusted,
881 authenticated, atomic, additive configuration management primitive: drop in a
882 configuration bundle, and activate it, so that it is instantly visible,
883 comprehensively.
884
885 * systemd-dissect: show available versions inside of a disk image, i.e. if
886 multiple versions are around of the same resource, show which ones. (in other
887 words: show partition labels).
888
889 * maybe add a generator that reads /proc/cmdline, looks for
890 systemd.pull-raw-portable=, systemd-pull-raw-sysext= and similar switches
891 that take an URL as parameter. It then generates service units for
892 systemd-pull calls that download these URLs if not installed yet. usecase:
893 invoke a VM or nspawn container in a way it automatically deploys/runs these
894 images as OS payloads. i.e. have a generic OS image you can point to any
895 payload you like, which is then downloaded, securely verified and run.
896
897 * improve scope units to support creation by pidfd instead of by PID
898
899 * deprecate cgroupsv1 further (print log message at boot)
900
901 * systemd-dissect: add --cat switch for dumping files such as /etc/os-release
902
903 * per-service sandboxing option: ProtectIds=. If used, will overmount
904 /etc/machine-id and /proc/sys/kernel/random/boot_id with synthetic files, to
905 make it harder for the service to identify the host. Depending on the user
906 setting it should be fully randomized at invocation time, or a hash of the
907 real thing, keyed by the unit name or so. Of course, there are other ways to
908 get these IDs (e.g. journal) or similar ids (e.g. MAC addresses, DMI ids, CPU
909 ids), so this knob would only be useful in combination with other lockdown
910 options. Particularly useful for portable services, and anything else that
911 uses RootDirectory= or RootImage=. (Might also over-mount
912 /sys/class/dmi/id/*{uuid,serial} with /dev/null).
913
914 * journalctl/timesyncd: whenever timesyncd acquires a synchronization from NTP,
915 create a structured log entry that contains boot ID, monotonic clock and
916 realtime clock (I mean, this requires no special work, as these three fields
917 are implicit). Then in journalctl when attempting to display the realtime
918 timestamp of a log entry, first search for the closest later log entry
919 of this kinda that has a matching boot id, and convert the monotonic clock
920 timestamp of the entry to the realtime clock using this info. This way we can
921 retroactively correct the wallclock timestamps, in particular for systems
922 without RTC, i.e. where initially wallclock timestamps carry rubbish, until
923 an NTP sync is acquired.
924
925 * kernel-install:
926 - add --all switch for rerunning kernel-install for all installed kernels
927 - maybe add env var that shortcuts kernel-install for installers that want to
928 call it at the end only
929
930 * doc: prep a document explaining resolved's internal objects, i.e. Query
931 vs. Question vs. Transaction vs. Stream and so on.
932
933 * doc: prep a document explaining PID 1's internal logic, i.e. transactions,
934 jobs, units
935
936 * bootspec: bring UEFI and userspace enumeration of bootspec entries back into
937 sync, i.e. parse out architecture field in sd-boot (currently only done in
938 userspace)
939
940 * automatically ignore threaded cgroups in cg_xyz().
941
942 * add linker script that implicitly adds symbol for build ID and new coredump
943 json package metadata, and use that when logging
944
945 * Enable RestrictFileSystems= for all our long-running services (similar:
946 RestrictNetworkInterfaces=)
947
948 * Add systemd-analyze security checks for RestrictFileSystems= and
949 RestrictNetworkInterfaces=
950
951 * cryptsetup/homed: implement TOTP authentication backed by TPM2 and its
952 internal clock.
953
954 * man: rework os-release(5), and clearly separate our extension-release.d/ and
955 initrd-release parts, i.e. list explicitly which fields are about what.
956
957 * sysext: before applying a sysext, do a superficial validation run so that
958 things are not rearranged to wildy. I.e. protect against accidental fuckups,
959 such as masking out /usr/lib/ or so. We should probably refuse if existing
960 inodes are replaced by other types of inodes or so.
961
962 * userdb: when synthesizing NSS records, pick "best" password from defined
963 passwords, not just the first. i.e. if there are multiple defined, prefer
964 unlocked over locked and prefer non-empty over empty.
965
966 * maybe add a tool inspired by the GPT auto discovery spec that runs in the
967 initrd and rearranges the rootfs hierarchy via bind mounts, if
968 enabled. Specifically in some top-level dir /@auto/ it will look for
969 dirs/symlinks/subvolumes that are named after their purpose, and optionally
970 encode a version as well as assessment counters, and then mount them into the
971 file system tree to boot into, similar to how we do that for the gpt auto
972 logic. Maybe then bind mount the original root into /.superior or something
973 like that (so that update tools can look there). Further discussion in this
974 thread:
975 https://lists.freedesktop.org/archives/systemd-devel/2021-November/047059.html
976 The GPT dissection logic should automatically enable this tool whenever we
977 detect a specially marked root fs (i.e introduce a new generic root gpt type
978 for this, that is arch independent). The also implement this in the image
979 dissection logic, so that nspawn/RootImage= and so on grok it. Maybe make
980 generic enough so that it can also work for ostrees arrangements.
981
982 * if a path ending in ".auto.d/" is set for RootDirectory=/RootImage= then do a
983 strverscmp() of everything inside that dir and use that. i.e. implement very
984 simple version control. Also use this in systemd-nspawn --image= and so on.
985
986 * homed: while a home dir is not activated generate slightly different NSS
987 records for it, that reports the home dir as "/" and the shell as some binary
988 provided by us. Then, when an SSH login happens and SSH permits it our binary
989 is invoked. This binary can then talk to homed and activate the homedir if
990 it's not around yet, prompting the user for a password. Once that succeeded
991 we'll switch to the real user record, i.e. home dir and shell, and our tool
992 exec()s the latter. Net effect: ssh'ing into a homed account will just work:
993 we'll neatly prompt for the homedir's password if its needed. –– Building on
994 this we could take this even further: since this tool will potentially have
995 access to the client's ssh-agent (if ssh-agent forwarding is enabled) we
996 could implement SSH unlocking of a homedir with that: when enrolling a new
997 ssh pubkey in a user record we'd ask the ssh-agent to sign some random value
998 with the privkey, then use that as luks key to unlock the home dir. Will not
999 work for ECDSA keys since their signatures contain a random component, but
1000 will work for RSA and Ed25519 keys.
1001
1002 * add tiny service that decrypts encrypted user records passed via initrd
1003 credential logic and drops them into /run where nss-systemd can pick them up,
1004 similar to /run/host/userdb/. Usecase: drop a root user JSON record there,
1005 and use it in the initrd to log in as root with locally selected password,
1006 for debugging purposes. Other usecase: boot into qemu with regular user
1007 mounted from host. maybe put this in systemd-user-sessions.service?
1008
1009 * drop dependency on libcap, replace by direct syscalls based on
1010 CapabilityQuintet we already have. (This likely allows us to drop libcap
1011 dep in the base OS image)
1012
1013 * sysext: automatically activate sysext images dropped in via new sd-stub
1014 sysext pickup logic. (must insist on verity + signature on those though)
1015
1016 * add concept for "exitrd" as inverse of "initrd", that we can transition to at
1017 shutdown, and has similar security semantics. This should then take the place
1018 of dracut's shutdown logic. Should probably support sysexts too. Care needs
1019 to be taken that the resulting logic ends up in RAM, i.e. is copied out of
1020 on-disk storage.
1021
1022 * userdbd: implement an additional varlink service socket that provides the
1023 host user db in restricted form, then allow this to be bind mounted into
1024 sandboxed environments that want the host database in minimal form. All
1025 records would be stripped of all meta info, except the basic UID/name
1026 info. Then use this in portabled environments that do not use PrivateUsers=1.
1027
1028 * portabled: when extracting unit files and copying to system.attached, if a
1029 .p7s is available in the image, use it to protect the system.attached copy
1030 with fs-verity, so that it cannot be tampered with
1031
1032 * logind introduce two types of sessions: "heavy" and "light". The former would
1033 be our current sessions. But the latter would be a new type of session that
1034 is mostly the same but does not pull in user@.service or wait for it. Then,
1035 allow configuration which type of session is desired via pam_systemd
1036 parameters, and then make user@.service's session one of these "light" ones.
1037 People could then choose to make FTP sessions and suchlike "light" if they
1038 don't want the service manager to be started for that.
1039
1040 * /etc/veritytab: allow that the roothash column can be specified as fs path
1041 including a path to an AF_UNIX path, similar to how we do things with the
1042 keys of /etc/crypttab. That way people can store/provide the roothash
1043 externally and provide to us on demand only.
1044
1045 * add high-level lockdown level for GPT dissection logic: e.g. an enum that can
1046 be ANY (to mount anything), TRUSTED (to require that /usr is on signed
1047 verity, but rest doesn't matter), LOCKEDDOWN (to require that everything is
1048 on signed verity, except for ESP), SUPERLOCKDOWN (like LOCKEDDOWN but ESP not
1049 allowed). And then maybe some flavours of that that declare what is expected
1050 from home/srv/var… Then, add a new cmdline flag to all tools that parse such
1051 images, to configure this. Also, add a kernel cmdline option for this, to be
1052 honoured by the gpt auto generator.
1053
1054 Alternative idea: add "systemd.gpt_auto_policy=rhvs" to allow gpt-auto to
1055 only mount root dir, /home/ dir, /var/ and /srv/, but nothing else. And then
1056 minor extension to this, insisting on encryption, for example
1057 "systemd.gpt_auto_policy=r+v+h" to require encryption for root and var but not
1058 for /home/, and similar. Similar add --image-dissect-policy= to tools that
1059 take --image= that take the same short string.
1060
1061 * we probably should extend the root verity hash of the root fs into some PCR
1062 on boot. (i.e. maybe add a veritytab option tpm2-measure=12 or so to measure
1063 it into PCR 12); Similar: we probably should extend the LUKS volume key of
1064 the root fs into some PCR on boot. (i.e. maybe add a crypttab option
1065 tpm2-measure=15 or so to measure it into PCR 15); once both are in place
1066 update gpt-auto-discovery to generate these by default for the partitions it
1067 discovers. Static vendor stuff should probably end up in PCR 12 (i.e. the
1068 verity hash), with local keys in PCR 15 (i.e. the encryption volume
1069 key). That way, we nicely distinguish resources supplied by the OS vendor
1070 (i.e. sysext, root verity) from those inherently local (i.e. encryption key),
1071 which is useful if they shall be signed separately.
1072
1073 * add a "policy" to the dissection logic. i.e. a bit mask what is OK to mount,
1074 what must be read-only, what requires encryption, and what requires
1075 authentication.
1076
1077 * in uefi stub: query firmware regarding which PCR banks are being used, store
1078 that in EFI var. then use this when enrolling TPM2 in cryptsetup to verify
1079 that the selected PCRs actually are used by firmware.
1080
1081 * rework recursive read-only remount to use new mount API
1082
1083 * PAM: pick up authentication token from credentials
1084
1085 * when mounting disk images: if IMAGE_ID/IMAGE_VERSION is set in os-release
1086 data in the image, make sure the image filename actually matches this, so
1087 that images cannot be misused.
1088
1089 * New udev block device symlink names:
1090 /dev/disk/by-parttypelabel/<pttype>-<ptlabel>. Use case: if pt label is used
1091 as partition image version string, this is a safe way to reference a specific
1092 version of a specific partition type, in particular where related partitions
1093 are processed (e.g. verity + rootfs both named "LennartOS_0.7").
1094
1095 * sysupdate:
1096 - add fuzzing to the pattern parser
1097 - support casync as download mechanism
1098 - "systemd-sysupdate update --all" support, that iterates through all components
1099 defined on the host, plus all images installed into /var/lib/machines/,
1100 /var/lib/portable/ and so on.
1101 - figure out what to do about system extensions (i.e. they need to imply an
1102 update component, since otherwise system extenion' sysupdate.d/ files would
1103 override the host's update files.)
1104 - Allow invocation with a single transfer definition, i.e. with
1105 --definitions= pointing to a file rather than a dir.
1106 - add ability to disable implicit decompression of downloaded artifacts,
1107 i.e. a Compress=no option in the transfer definitions
1108
1109 * in sd-id128: also parse UUIDs in RFC4122 URN syntax (i.e. chop off urn:uuid: prefix)
1110
1111 * DynamicUser= + StateDirectory= → use uid mapping mounts, too, in order to
1112 make dirs appear under right UID.
1113
1114 * systemd-sysext: optionally, run it in initrd already, before transitioning
1115 into host, to open up possibility for services shipped like that.
1116
1117 * maybe add a tool that displays most recent journal logs as QR code to scan
1118 off screen and run it automatically on boot failures, emergency logs and
1119 such. Use DRM APIs directly, see
1120 https://github.com/dvdhrm/docs/blob/master/drm-howto/modeset.c for an example
1121 for doing that.
1122
1123 * introduce /dev/disk/root/* symlinks that allow referencing partitions on the
1124 disk the rootfs is on in a reasonably secure way. (or maybe: add
1125 /dev/gpt-auto-{home,srv,boot,…} similar in style to /dev/gpt-auto-root as we
1126 already have it.
1127
1128 * whenever we receive fds via SCM_RIGHTS make sure none got dropped due to the
1129 reception limit the kernel silently enforces.
1130
1131 * Add service unit setting ConnectStream= which takes IP addresses and connects to them.
1132
1133 * Similar, Load= which takes literal data in text or base64 format, and puts it
1134 into a memfd, and passes that. This enables some fun stuff, such as embedding
1135 bash scripts in unit files, by combining Load= with ExecStart=/bin/bash
1136 /proc/self/fd/3
1137
1138 * add a ConnectSocket= setting to service unit files, that may reference a
1139 socket unit, and which will connect to the socket defined therein, and pass
1140 the resulting fd to the service program via socket activation proto.
1141
1142 * Add a concept of ListenStream=anonymous to socket units: listen on a socket
1143 that is deleted in the fs. Usecase would be with ConnectSocket= above.
1144
1145 * importd: support image signature verification with PKCS#7 + OpenBSD signify
1146 logic, as alternative to crummy gpg
1147
1148 * add "systemd-analyze debug" + AttachDebugger= in unit files: The former
1149 specifies a command to execute; the latter specifies that an already running
1150 "systemd-analyze debug" instance shall be contacted and execution paused
1151 until it gives an OK. That way, tools like gdb or strace can be safely be
1152 invoked on processes forked off PID 1.
1153
1154 * expose MS_NOSYMFOLLOW in various places
1155
1156 * credentials system:
1157 - acquire from EFI variable?
1158 - acquire via ask-password?
1159 - acquire creds via keyring?
1160 - pass creds via keyring?
1161 - pass creds via memfd?
1162 - acquire + decrypt creds from pkcs11?
1163 - make systemd-cryptsetup acquire pw via creds logic
1164 - make PAMName= acquire pw via creds logic
1165 - make macsec/wireguard code in networkd read key via creds logic
1166 - make gatwayd/remote read key via creds logic
1167 - add sd_notify() command for flushing out creds not needed anymore
1168 - make user manager instances create and use a user-specific key (the one in
1169 /var/lib is root-only) and add --user switch to systemd-creds to use it
1170
1171 * add tpm.target or so which is delayed until TPM2 device showed up in case
1172 firmware indicates there is one.
1173
1174 * TPM2: auto-reenroll in cryptsetup, as fallback for hosed firmware upgrades
1175 and such
1176
1177 * introduce a new group to own TPM devices
1178
1179 * cyptsetup: add option for automatically removing empty password slot on boot
1180
1181 * cryptsetup: optionally, when run during boot-up and password is never
1182 entered, and we are on battery power (or so), power off machine again
1183
1184 * cryptsetup: when waiting for FIDO2/PKCS#11 token, tell plymouth that, and
1185 allow plymouth to abort the waiting and enter pw instead
1186
1187 * make cryptsetup lower --iter-time
1188
1189 * cryptsetup: allow encoding key directly in /etc/crypttab, maybe with a
1190 "base64:" prefix. Useful in particular for pkcs11 mode.
1191
1192 * cryptsetup: reimplement the mkswap/mke2fs in cryptsetup-generator to use
1193 systemd-makefs.service instead.
1194
1195 * cryptsetup:
1196 - cryptsetup-generator: allow specification of passwords in crypttab itself
1197 - support rd.luks.allow-discards= kernel cmdline params in cryptsetup generator
1198
1199 * when configuring loopback netif, and it fails due to EPERM, eat up error if
1200 it happens to be set up alright already.
1201
1202 * at boot: check if battery above some threshold, if not power off again after explanation
1203
1204 * userdb: add field for ambient caps, so that a user can have CAP_WAKE_ALARM
1205 for example. And add code that resets ambient caps for all services by
1206 default.
1207
1208 * sd-bus: when connecting to some dbus server socker, set originating AF_UNIX
1209 socket name in abstract namespace to include "description" string, and pick
1210 it up from there in sd_bus_creds logic. i.e. we can use the socket peer
1211 address as conduit for some minimal connection metainfo, and use it to
1212 restore the "description" logic that kdbus used to have.
1213
1214 * systemd-analyze netif that explains predictable interface (or networkctl)
1215
1216 * Add service setting to run a service within the specified VRF. i.e. do the
1217 equivalent of "ip vrf exec".
1218
1219 * change SwitchRoot() implementation in PID 1 to use pivot_root(".", "."), as
1220 documented in the pivot_root(2) man page, so that we can drop the /oldroot
1221 temporary dir.
1222
1223 * special case some calls of chase_symlinks() to use openat2() internally, so
1224 that the kernel does what we otherwise do.
1225
1226 * add a new flag to chase_symlinks() that stops chasing once the first missing
1227 component is found and then allows the caller to create the rest.
1228
1229 * make use of new glibc 2.32 APIs sigabbrev_np() and strerrorname_np().
1230
1231 * if /usr/bin/swapoff fails due to OOM, log a friendly explanatory message about it
1232
1233 * pid1: Move to tracking of main pid/control pid of units per pidfd
1234
1235 * pid1: support new clone3() fork-into-cgroup feature
1236
1237 * pid1: also remove PID files of a service when the service starts, not just
1238 when it exits
1239
1240 * make us use dynamically fewer deps for containers in general purpose distros:
1241 o turn into dlopen() deps:
1242 - kmod-libs (only when called from PID 1)
1243 - libblkid (only in RootImage= handling in PID 1, but not elsewhere)
1244 - libpam (only when called from PID 1)
1245 - bzip2, xz, lz4 (always — gzip and zstd should probably stay static deps the way they are,
1246 since they are so basic and our defaults)
1247 o move into separate libsystemd-shared-iptables.so .so
1248 - iptables-libs (only used by nspawn + networkd)
1249
1250 * seccomp: maybe use seccomp_merge() to merge our filters per-arch if we can.
1251 Apparently kernel performance is much better with fewer larger seccomp
1252 filters than with more smaller seccomp filters.
1253
1254 * systemd-path: add ESP and XBOOTLDR path. Add "private" runtime/state/cache dir enum,
1255 mapping to $RUNTIME_DIRECTORY, $STATE_DIRECTORY and such
1256
1257 * seccomp: by default mask x32 ABI system wide on x86-64. it's on its way out
1258
1259 * seccomp: don't install filters for ABIs that are masked anyway for the
1260 specific service
1261
1262 * busctl: maybe expose a verb "ping" for pinging a dbus service to see if it
1263 exists and responds.
1264
1265 * Maybe add a separate GPT partition type to the discoverable partition spec
1266 for "hibernate" partitions, that are exactly like swap partitions but only
1267 activated right before hibernation and thus never used for regular swapping.
1268
1269 * socket units: allow creating a udev monitor socket with ListenDevices= or so,
1270 with matches, then activate app through that passing socket over
1271
1272 * unify on openssl:
1273 - kill gnutls support in resolved
1274 - figure out what to do about libmicrohttpd, which has a hard dependency on
1275 gnutls
1276 - port fsprg over to a dlopen lib, then switch it to openssl
1277
1278 * add growvol and makevol options for /etc/crypttab, similar to
1279 x-systemd.growfs and x-systemd-makefs.
1280
1281 * userdb: allow username prefix searches in varlink API, allow realname and
1282 realname substr searches in varlink API
1283
1284 * userdb: allow uid/gid range checks
1285
1286 * userdb: allow existence checks
1287
1288 * pid1: activation by journal search expression
1289
1290 * when switching root from initrd to host, set the machine_id env var so that
1291 if the host has no machine ID set yet we continue to use the random one the
1292 initrd had set.
1293
1294 * sd-event: add native support for P_ALL waitid() watching, then move PID 1 to
1295 it for reaping assigned but unknown children. This needs to some special care
1296 to operate somewhat sensibly in light of priorities: P_ALL will return
1297 arbitrary processes, regardless of the priority we want to watch them with,
1298 hence on each event loop iteration check all processes which we shall watch
1299 with higher prio explicitly, and then watch the entire rest with P_ALL.
1300
1301 * tweak sd-event's child watching: keep a prioq of children to watch and use
1302 waitid() only on the children with the highest priority until one is waitable
1303 and ignore all lower-prio ones from that point on
1304
1305 * maybe introduce xattrs that can be set on the root dir of the root fs
1306 partition that declare the volatility mode to use the image in. Previously I
1307 thought marking this via GPT partition flags but that's not ideal since
1308 that's outside of the LUKS encryption/verity verification, and we probably
1309 shouldn't operate in a volatile mode unless we got told so from a trusted
1310 source.
1311
1312 * coredump: maybe when coredumping read a new xattr from /proc/$PID/exe that
1313 may be used to mark a whole binary as non-coredumpable. Would fix:
1314 https://bugs.freedesktop.org/show_bug.cgi?id=69447
1315
1316 * teach parse_timestamp() timezones like the calendar spec already knows it
1317
1318 * beef up hibernation to optionally do swapon/swapoff immediately before/after
1319 the hibernation
1320
1321 * beef up s2h to implement a battery watch loop: instead of entering
1322 hibernation unconditionally after coming back from resume make a decision
1323 based on the battery load level: if battery level is above a specific
1324 threshold, go to suspend again, only hibernate if below it. This means we'd
1325 stick to suspend usually, but fall back to hibernation only when battery runs
1326 empty (well, subject to our sampling interval). Related to this, check if we
1327 can make ACPI _BTP (i.e. /sys/class/power_supply/*/alarm) work for us too,
1328 i.e. see if it can wake up machines from suspend, so that we could resume
1329 automatically when the system is low on power and move automatically to
1330 hibernation mode. (see
1331 https://uefi.org/sites/default/files/resources/ACPI%206_2_A_Sept29.pdf
1332 section 10.2.2.8 and
1333 https://docs.microsoft.com/en-us/windows-hardware/design/device-experiences/modern-standby-wake-sources
1334 at the end).
1335
1336 * We should probably replace /etc/rc.d/README with a symlink to doc
1337 content. After all it is constant vendor data.
1338
1339 * maybe add kernel cmdline params: to force random seed crediting
1340
1341 * introduce a new per-process uuid, similar to the boot id, the machine id, the
1342 invocation id, that is derived from process creds, specifically a hashed
1343 combination of AT_RANDOM + getpid() + the starttime from
1344 /proc/self/status. Then add these ids implicitly when logging. Deriving this
1345 uuid from these three things has the benefit that it can be derived easily
1346 from /proc/$PID/ in a stable, and unique way that changes on both fork() and
1347 exec().
1348
1349 * let's not GC a unit while its ratelimits are still pending
1350
1351 * when killing due to service watchdog timeout maybe detect whether target
1352 process is under ptracing and then log loudly and continue instead.
1353
1354 * make rfkill uaccess controllable by default, i.e. steal rule from
1355 gnome-bluetooth and friends
1356
1357 * make MAINPID= message reception checks even stricter: if service uses User=,
1358 then check sending UID and ignore message if it doesn't match the user or
1359 root.
1360
1361 * maybe trigger a uevent "change" on a device if "systemctl reload xyz.device"
1362 is issued.
1363
1364 * when importing an fs tree with machined, optionally apply userns-rec-chown
1365
1366 * when importing an fs tree with machined, complain if image is not an OS
1367
1368 * Maybe introduce a helper safe_exec() or so, which is to execve() which
1369 safe_fork() is to fork(). And then make revert the RLIMIT_NOFILE soft limit
1370 to 1K implicitly, unless explicitly opted-out.
1371
1372 * rework seccomp/nnp logic that even if User= is used in combination with
1373 a seccomp option we don't have to set NNP. For that, change uid first whil
1374 keeping CAP_SYS_ADMIN, then apply seccomp, the drop cap.
1375
1376 * when no locale is configured, default to UEFI's PlatformLang variable
1377
1378 * add a new syscall group "@esoteric" for more esoteric stuff such as bpf() and
1379 usefaultd() and make systemd-analyze check for it.
1380
1381 * paranoia: whenever we process passwords, call mlock() on the memory
1382 first. i.e. look for all places we use free_and_erasep() and
1383 augment them with mlock(). Also use MADV_DONTDUMP.
1384 Alternatively (preferably?) use memfd_secret().
1385
1386 * Move RestrictAddressFamily= to the new cgroup create socket
1387
1388 * maybe implicitly attach monotonic+realtime timestamps to outgoing messages in
1389 log.c and sd-journal-send
1390
1391 * optionally: turn on cgroup delegation for per-session scope units
1392
1393 * introduce per-unit (i.e. per-slice, per-service) journal log size limits.
1394
1395 * sd-boot: optionally, show boot menu when previous default boot item has
1396 non-zero "tries done" count
1397
1398 * augment CODE_FILE=, CODE_LINE= with something like CODE_BASE= or so which
1399 contains some identifier for the project, which allows us to include
1400 clickable links to source files generating these log messages. The identifier
1401 could be some abberviated URL prefix or so (taking inspiration from Go
1402 imports). For example, for systemd we could use
1403 CODE_BASE=github.com/systemd/systemd/blob/98b0b1123cc or so which is
1404 sufficient to build a link by prefixing "http://" and suffixing the
1405 CODE_FILE.
1406
1407 * Augment MESSAGE_ID with MESSAGE_BASE, in a similar fashion so that we can
1408 make clickable links from log messages carrying a MESSAGE_ID, that lead to
1409 some explanatory text online.
1410
1411 * maybe extend .path units to expose fanotify() per-mount change events
1412
1413 * When reloading configuration PID 1 should reset all its properties to the
1414 original defaults before calling parse_config()
1415
1416 * hibernate/s2h: make this robust and safe to enable in Fedora by default.
1417 Specifically:
1418
1419 1. add resume_offset support to the resume code (i.e. support swap files
1420 properly)
1421 2. check if swap is on weird storage and refuse if so
1422 3. add auto-detection of hibernation images
1423
1424 * cgroups: use inotify to get notified when somebody else modifies cgroups
1425 owned by us, then log a friendly warning.
1426
1427 * beef up log.c with support for stripping ANSI sequences from strings, so that
1428 it is OK to include them in log strings. This would be particularly useful so
1429 that our log messages could contain clickable links for example for unit
1430 files and suchlike we operate on.
1431
1432 * importd: add ability download images for portabled + sysext
1433
1434 * add support for "portablectl attach http://foobar.com/waaa.raw (i.e. importd integration)
1435
1436 * sync dynamic uids/gids between host+portable srvice (i.e. if DynamicUser=1 is set for a service, make sure that the
1437 selected user is resolvable in the service even if it ships its own /etc/passwd)
1438
1439 * Fix DECIMAL_STR_MAX or DECIMAL_STR_WIDTH. One includes a trailing NUL, the
1440 other doesn't. What a disaster. Probably to exclude it.
1441
1442 * Check that users of inotify's IN_DELETE_SELF flag are using it properly, as
1443 usually IN_ATTRIB is the right way to watch deleted files, as the former only
1444 fires when a file is actually removed from disk, i.e. the link count drops to
1445 zero and is not open anymore, while the latter happens when a file is
1446 unlinked from any dir.
1447
1448 * port systemctl, busctl, … over to format-table.[ch]'s table formatters
1449
1450 * pid1: lock image configured with RootDirectory=/RootImage= using the usual nspawn semantics while the unit is up
1451
1452 * add --vacuum-xyz options to coredumpctl, matching those journalctl already has.
1453
1454 * introduce Ephemeral= unit file switch, that creates an ephemeral copy of all
1455 files and directories that are left writable for a unit, and which are
1456 removed after the unit goes down again. A bit like --ephemeral for
1457 systemd-nspawn but for system services. If used together with RootImage= this
1458 should reflink the image file itself.
1459
1460 Related: add Ephemeral=<path1> <path2> … which would allow marking
1461 specific paths only like this.
1462
1463 * add CopyFile= or so as unit file setting that may be used to copy files or
1464 directory trees from the host to the services RootImage= and RootDirectory=
1465 environment. Which we can use for /etc/machine-id and in particular
1466 /etc/resolv.conf. Should be smart and do something useful on read-only
1467 images, for example fall back to read-only bind mounting the file instead.
1468
1469 * show invocation ID in systemd-run output
1470
1471 * bypass SIGTERM state in unit files if KillSignal is SIGKILL
1472
1473 * add proper dbus APIs for the various sd_notify() commands, such as MAINPID=1
1474 and so on, which would mean we could report errors and such.
1475
1476 * introduce DefaultSlice= or so in system.conf that allows changing where we
1477 place our units by default, i.e. change system.slice to something
1478 else. Similar, ManagerSlice= should exist so that PID1's own scope unit could
1479 be moved somewhere else too. Finally machined and logind should get similar
1480 options so that it is possible to move user session scopes and machines to a
1481 different slice too by default. Usecase: people who want to put resources on
1482 the entire system, with the exception of one specific service. See:
1483 https://lists.freedesktop.org/archives/systemd-devel/2018-February/040369.html
1484
1485 * maybe rework get_user_creds() to query the user database if $SHELL is used
1486 for root, but only then.
1487
1488 * be stricter with fds we receive for the fdstore: close them asynchronously
1489
1490 * calenderspec: add support for week numbers and day numbers within a
1491 year. This would allow us to define "bi-weekly" triggers safely.
1492
1493 * sd-bus: add vtable flag, that may be used to request client creds implicitly
1494 and asynchronously before dispatching the operation
1495
1496 * sd-bus: parse addresses given in sd_bus_set_addresses immediately and not
1497 only when used. Add unit tests.
1498
1499 * make use of ethtool veth peer info in machined, for automatically finding out
1500 host-side interface pointing to the container.
1501
1502 * add some special mode to LogsDirectory=/StateDirectory=… that allows
1503 declaring these directories without necessarily pulling in deps for them, or
1504 creating them when starting up. That way, we could declare that
1505 systemd-journald writes to /var/log/journal, which could be useful when we
1506 doing disk usage calculations and so on.
1507
1508 * deprecate RootDirectoryStartOnly= in favour of a new ExecStart= prefix char
1509
1510 * add a new RuntimeDirectoryPreserve= mode that defines a similar lifecycle for
1511 the runtime dir as we maintain for the fdstore: i.e. keep it around as long
1512 as the unit is running or has a job queued.
1513
1514 * support projid-based quota in machinectl for containers
1515
1516 * add a way to lock down cgroup migration: a boolean, which when set for a unit
1517 makes sure the processes in it can never migrate out of it
1518
1519 * blog about fd store and restartable services
1520
1521 * document Environment=SYSTEMD_LOG_LEVEL=debug drop-in in debugging document
1522
1523 * rework ExecOutput and ExecInput enums so that EXEC_OUTPUT_NULL loses its
1524 magic meaning and is no longer upgraded to something else if set explicitly.
1525
1526 * in the long run: permit a system with /etc/machine-id linked to /dev/null, to
1527 make it lose its identity, i.e. be anonymous. For this we'd have to patch
1528 through the whole tree to make all code deal with the case where no machine
1529 ID is available.
1530
1531 * optionally, collect cgroup resource data, and store it in per-unit RRD files,
1532 suitable for processing with rrdtool. Add bus API to access this data, and
1533 possibly implement a CPULoad property based on it.
1534
1535 * beef up pam_systemd to take unit file settings such as cgroups properties as
1536 parameters
1537
1538 * maybe hook up xfs/ext4 quotactl() with services? i.e. automatically manage
1539 the quota of the user indicated in User= via unit file settings, like the
1540 other resource management concepts. Would mix nicely with DynamicUser=1. Or
1541 alternatively, do this with projids, so that we can also cover services
1542 running as root. Quota should probably cover all the special dirs such as
1543 StateDirectory=, LogsDirectory=, CacheDirectory=, as well as RootDirectory= if it
1544 is set, plus the whole disk space any image configured with RootImage=.
1545
1546 * In DynamicUser= mode: before selecting a UID, use disk quota APIs on relevant
1547 disks to see if the UID is already in use.
1548
1549 * expose IO accounting data on the bus, show it in systemd-run --wait and log
1550 about it in the resource log message
1551
1552 * Add AddUser= setting to unit files, similar to DynamicUser=1 which however
1553 creates a static, persistent user rather than a dynamic, transient user. We
1554 can leverage code from sysusers.d for this.
1555
1556 * add some optional flag to ReadWritePaths= and friends, that has the effect
1557 that we create the dir in question when the service is started. Example:
1558
1559 ReadWritePaths=:/var/lib/foobar
1560
1561 * Add ExecMonitor= setting. May be used multiple times. Forks off a process in
1562 the service cgroup, which is supposed to monitor the service, and when it
1563 exits the service is considered failed by its monitor.
1564
1565 * track the per-service PAM process properly (i.e. as an additional control
1566 process), so that it may be queried on the bus and everything.
1567
1568 * add a new "debug" job mode, that is propagated to unit_start() and for
1569 services results in two things: we raise SIGSTOP right before invoking
1570 execve() and turn off watchdog support. Then, use that to implement
1571 "systemd-gdb" for attaching to the start-up of any system service in its
1572 natural habitat.
1573
1574 * gpt-auto logic: support encrypted swap, add kernel cmdline option to force
1575 it, and honour a gpt bit about it, plus maybe a configuration file
1576
1577 * add a percentage syntax for TimeoutStopSec=, e.g. TimeoutStopSec=150%, and
1578 then use that for the setting used in user@.service. It should be understood
1579 relative to the configured default value.
1580
1581 * enable LockMLOCK to take a percentage value relative to physical memory
1582
1583 * Permit masking specific netlink APIs with RestrictAddressFamily=
1584
1585 * define gpt header bits to select volatility mode
1586
1587 * ProtectClock= (drops CAP_SYS_TIMES, adds seecomp filters for settimeofday, adjtimex), sets DeviceAllow o /dev/rtc
1588
1589 * ProtectTracing= (drops CAP_SYS_PTRACE, blocks ptrace syscall, makes /sys/kernel/tracing go away)
1590
1591 * ProtectMount= (drop mount/umount/pivot_root from seccomp, disallow fuse via DeviceAllow, imply Mountflags=slave)
1592
1593 * ProtectKeyRing= to take keyring calls away
1594
1595 * RemoveKeyRing= to remove all keyring entries of the specified user
1596
1597 * ProtectReboot= that masks reboot() and kexec_load() syscalls, prohibits kill
1598 on PID 1 with the relevant signals, and makes relevant files in /sys and
1599 /proc (such as the sysrq stuff) unavailable
1600
1601 * Support ReadWritePaths/ReadOnlyPaths/InaccessiblePaths in systemd --user instances
1602 via the new unprivileged Landlock LSM (https://landlock.io)
1603
1604 * make sure the ratelimit object can deal with USEC_INFINITY as way to turn off things
1605
1606 * in nss-systemd, if we run inside of RootDirectory= with PrivateUsers= set,
1607 find a way to map the User=/Group= of the service to the right name. This way
1608 a user/group for a service only has to exist on the host for the right
1609 mapping to work.
1610
1611 * add bus API for creating unit files in /etc, reusing the code for transient units
1612
1613 * add bus API to remove unit files from /etc
1614
1615 * add bus API to retrieve current unit file contents (i.e. implement "systemctl cat" on the bus only)
1616
1617 * rework fopen_temporary() to make use of open_tmpfile_linkable() (problem: the
1618 kernel doesn't support linkat() that replaces existing files, currently)
1619
1620 * transient units: don't bother with actually setting unit properties, we
1621 reload the unit file anyway
1622
1623 * optionally, also require WATCHDOG=1 notifications during service start-up and shutdown
1624
1625 * cache sd_event_now() result from before the first iteration...
1626
1627 * PID1: find a way how we can reload unit file configuration for
1628 specific units only, without reloading the whole of systemd
1629
1630 * add an explicit parser for LimitRTPRIO= that verifies
1631 the specified range and generates sane error messages for incorrect
1632 specifications.
1633
1634 * when we detect that there are waiting jobs but no running jobs, do something
1635
1636 * PID 1 should send out sd_notify("WATCHDOG=1") messages (for usage in the --user mode, and when run via nspawn)
1637
1638 * there's probably something wrong with having user mounts below /sys,
1639 as we have for debugfs. for example, src/core/mount.c handles mounts
1640 prefixed with /sys generally special.
1641 https://lists.freedesktop.org/archives/systemd-devel/2015-June/032962.html
1642
1643 * fstab-generator: default to tmpfs-as-root if only usr= is specified on the kernel cmdline
1644
1645 * docs: bring https://www.freedesktop.org/wiki/Software/systemd/MyServiceCantGetRealtime up to date
1646
1647 * add a job mode that will fail if a transaction would mean stopping
1648 running units. Use this in timedated to manage the NTP service
1649 state.
1650 https://lists.freedesktop.org/archives/systemd-devel/2015-April/030229.html
1651
1652 * The udev blkid built-in should expose a property that reflects
1653 whether media was sensed in USB CF/SD card readers. This should then
1654 be used to control SYSTEMD_READY=1/0 so that USB card readers aren't
1655 picked up by systemd unless they contain a medium. This would mirror
1656 the behaviour we already have for CD drives.
1657
1658 * hostnamectl: show root image uuid
1659
1660 * Find a solution for SMACK capabilities stuff:
1661 https://lists.freedesktop.org/archives/systemd-devel/2014-December/026188.html
1662
1663 * synchronize console access with BSD locks:
1664 https://lists.freedesktop.org/archives/systemd-devel/2014-October/024582.html
1665
1666 * as soon as we have sender timestamps, revisit coalescing multiple parallel daemon reloads:
1667 https://lists.freedesktop.org/archives/systemd-devel/2014-December/025862.html
1668
1669 * figure out when we can use the coarse timers
1670
1671 * maybe allow timer units with an empty Units= setting, so that they
1672 can be used for resuming the system but nothing else.
1673
1674 * what to do about udev db binary stability for apps? (raw access is not an option)
1675
1676 * exponential backoff in timesyncd when we cannot reach a server
1677
1678 * timesyncd: add ugly bus calls to set NTP servers per-interface, for usage by NM
1679
1680 * merge ~/.local/share and ~/.local/lib into one similar /usr/lib and /usr/share....
1681
1682 * add systemd.abort_on_kill or some other such flag to send SIGABRT instead of SIGKILL
1683 (throughout the codebase, not only PID1)
1684
1685 * drop nss-myhostname in favour of nss-resolve?
1686
1687 * resolved:
1688 - mDNS/DNS-SD
1689 - service registration
1690 - service/domain/types browsing
1691 - avahi compat
1692 - DNS-SD service registration from socket units
1693 - resolved should optionally register additional per-interface LLMNR
1694 names, so that for the container case we can establish the same name
1695 (maybe "host") for referencing the server, everywhere.
1696 - allow clients to request DNSSEC for a single lookup even if DNSSEC is off (?)
1697 - hook up resolved with machined-based address resolution
1698
1699 * refcounting in sd-resolve is borked
1700
1701 * add new gpt type for btrfs volumes
1702
1703 * generator that automatically discovers btrfs subvolumes, identifies their purpose based on some xattr on them.
1704
1705 * a way for container managers to turn off getty starting via $container_headless= or so...
1706
1707 * figure out a nice way how we can let the admin know what child/sibling unit causes cgroup membership for a specific unit
1708
1709 * For timer units: add some mechanisms so that timer units that trigger immediately on boot do not have the services
1710 they run added to the initial transaction and thus confuse Type=idle.
1711
1712 * add bus api to query unit file's X fields.
1713
1714 * gpt-auto-generator:
1715 - Define new partition type for encrypted swap? Support probed LUKS for encrypted swap?
1716 - Make /home automount rather than mount?
1717
1718 * add generator that pulls in systemd-network from containers when
1719 CAP_NET_ADMIN is set, more than the loopback device is defined, even
1720 when it is otherwise off
1721
1722 * MessageQueueMessageSize= (and suchlike) should use parse_iec_size().
1723
1724 * implement Distribute= in socket units to allow running multiple
1725 service instances processing the listening socket, and open this up
1726 for ReusePort=
1727
1728 * cgroups:
1729 - implement per-slice CPUFairScheduling=1 switch
1730 - introduce high-level settings for RT budget, swappiness
1731 - how to reset dynamically changed unit cgroup attributes sanely?
1732 - when reloading configuration, apply new cgroup configuration
1733 - when recursively showing the cgroup hierarchy, optionally also show
1734 the hierarchies of child processes
1735 - add settings for cgroup.max.descendants and cgroup.max.depth,
1736 maybe use them for user@.service
1737
1738 * transient units:
1739 - add field to transient units that indicate whether systemd or somebody else saves/restores its settings, for integration with libvirt
1740
1741 * when we detect low battery and no AC on boot, show pretty splash and refuse boot
1742
1743 * libsystemd-journal, libsystemd-login, libudev: add calls to easily attach these objects to sd-event event loops
1744
1745 * be more careful what we export on the bus as (usec_t) 0 and (usec_t) -1
1746
1747 * rfkill,backlight: we probably should run the load tools inside of the udev rules so that the state is properly initialized by the time other software sees it
1748
1749 * After coming back from hibernation reset hibernation swap partition using the /dev/snapshot ioctl APIs
1750
1751 * If we try to find a unit via a dangling symlink, generate a clean
1752 error. Currently, we just ignore it and read the unit from the search
1753 path anyway.
1754
1755 * refuse boot if /usr/lib/os-release is missing or /etc/machine-id cannot be set up
1756
1757 * man: the documentation of Restart= currently is very misleading and suggests the tools from ExecStartPre= might get restarted.
1758
1759 * load .d/*.conf dropins for device units
1760
1761 * There's currently no way to cancel fsck (used to be possible via C-c or c on the console)
1762
1763 * add option to sockets to avoid activation. Instead just drop packets/connections, see http://cyberelk.net/tim/2012/02/15/portreserve-systemd-solution/
1764
1765 * make sure systemd-ask-password-wall does not shutdown systemd-ask-password-console too early
1766
1767 * verify that the AF_UNIX sockets of a service in the fs still exist
1768 when we start a service in order to avoid confusion when a user
1769 assumes starting a service is enough to make it accessible
1770
1771 * Make it possible to set the keymap independently from the font on
1772 the kernel cmdline. Right now setting one resets also the other.
1773
1774 * and a dbus call to generate target from current state
1775
1776 * investigate whether the gnome pty helper should be moved into systemd, to provide cgroup support.
1777
1778 * dot output for --test showing the 'initial transaction'
1779
1780 * be able to specify a forced restart of service A where service B depends on, in case B
1781 needs to be auto-respawned?
1782
1783 * pid1:
1784 - When logging about multiple units (stopping BoundTo units, conflicts, etc.),
1785 log both units as UNIT=, so that journalctl -u triggers on both.
1786 - generate better errors when people try to set transient properties
1787 that are not supported...
1788 https://lists.freedesktop.org/archives/systemd-devel/2015-February/028076.html
1789 - maybe introduce WantsMountsFor=? Usecase:
1790 https://lists.freedesktop.org/archives/systemd-devel/2015-January/027729.html
1791 - recreate systemd's D-Bus private socket file on SIGUSR2
1792 - move PAM code into its own binary
1793 - when we automatically restart a service, ensure we restart its rdeps, too.
1794 - hide PAM options in fragment parser when compile time disabled
1795 - Support --test based on current system state
1796 - If we show an error about a unit (such as not showing up) and it has no Description string, then show a description string generated form the reverse of unit_name_mangle().
1797 - after deserializing sockets in socket.c we should reapply sockopts and things
1798 - drop PID 1 reloading, only do reexecing (difficult: Reload()
1799 currently is properly synchronous, Reexec() is weird, because we
1800 cannot delay the response properly until we are back, so instead of
1801 being properly synchronous we just keep open the fd and close it
1802 when done. That means clients do not get a successful method reply,
1803 but much rather a disconnect on success.
1804 - when breaking cycles drop sysv services first, then services from /run, then from /etc, then from /usr
1805 - when a bus name of a service disappears from the bus make sure to queue further activation requests
1806 - maybe introduce CoreScheduling=yes/no to optionally set a PR_SCHED_CORE cookie, so that all
1807 processes in a service's cgroup share the same cookie and are guaranteed not to share SMT cores
1808 with other units https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/Documentation/admin-guide/hw-vuln/core-scheduling.rst
1809
1810 * unit files:
1811 - allow port=0 in .socket units
1812 - maybe introduce ExecRestartPre=
1813 - implement Register= switch in .socket units to enable registration
1814 in Avahi, RPC and other socket registration services.
1815 - allow Type=simple with PIDFile=
1816 https://bugzilla.redhat.com/show_bug.cgi?id=723942
1817 - allow writing multiple conditions in unit files on one line
1818 - introduce Type=pid-file
1819 - add a concept of RemainAfterExit= to scope units
1820 - Allow multiple ExecStart= for all Type= settings, so that we can cover rescue.service nicely
1821 - add verification of [Install] section to systemd-analyze verify
1822
1823 * timer units:
1824 - timer units should get the ability to trigger when DST changes
1825 - Modulate timer frequency based on battery state
1826
1827 * add libsystemd-password or so to query passwords during boot using the password agent logic
1828
1829 * clean up date formatting and parsing so that all absolute/relative timestamps we format can also be parsed
1830
1831 * on shutdown: move utmp, wall, audit logic all into PID 1 (or logind?), get rid of systemd-update-utmp-runlevel
1832
1833 * make repeated alt-ctrl-del presses printing a dump
1834
1835 * currently x-systemd.timeout is lost in the initrd, since crypttab is copied into dracut, but fstab is not
1836
1837 * add a pam module that passes the hdd passphrase into the PAM stack and then expires it, for usage by gdm auto-login.
1838
1839 * add a pam module that on password changes updates any LUKS slot where the password matches
1840
1841 * test/:
1842 - add unit tests for config_parse_device_allow()
1843
1844 * seems that when we follow symlinks to units we prefer the symlink
1845 destination path over /etc and /usr. We should not do that. Instead
1846 /etc should always override /run+/usr and also any symlink
1847 destination.
1848
1849 * when isolating, try to figure out a way how we implicitly can order
1850 all units we stop before the isolating unit...
1851
1852 * teach ConditionKernelCommandLine= globs or regexes (in order to match foobar={no,0,off})
1853
1854 * Add ConditionDirectoryNotEmpty= handle non-absoute paths as a search path or add
1855 ConditionConfigSearchPathNotEmpty= or different syntax? See the discussion starting at
1856 https://github.com/systemd/systemd/pull/15109#issuecomment-607740136.
1857
1858 * BootLoaderSpec: Define a way how an installer can figure out whether a BLS
1859 compliant boot loader is installed.
1860
1861 * think about requeuing jobs when daemon-reload is issued? usecase:
1862 the initrd issues a reload after fstab from the host is accessible
1863 and we might want to requeue the mounts local-fs acquired through
1864 that automatically.
1865
1866 * systemd-inhibit: make taking delay locks useful: support sending SIGINT or SIGTERM on PrepareForSleep()
1867
1868 * remove any syslog support from log.c — we probably cannot do this before split-off udev is gone for good
1869
1870 * shutdown logging: store to EFI var, and store to USB stick?
1871
1872 * merge unit_kill_common() and unit_kill_context()
1873
1874 * add a dependency on standard-conf.xml and other included files to man pages
1875
1876 * MountFlags=shared acts as MountFlags=slave right now.
1877
1878 * properly handle loop back mounts via fstab, especially regards to fsck/passno
1879
1880 * initialize the hostname from the fs label of /, if /etc/hostname does not exist?
1881
1882 * sd-bus:
1883 - EBADSLT handling
1884 - GetAllProperties() on a non-existing object does not result in a failure currently
1885 - port to sd-resolve for connecting to TCP dbus servers
1886 - see if we can introduce a new sd_bus_get_owner_machine_id() call to retrieve the machine ID of the machine of the bus itself
1887 - see if we can drop more message validation on the sending side
1888 - add API to clone sd_bus_message objects
1889 - longer term: priority inheritance
1890 - dbus spec updates:
1891 - NameLost/NameAcquired obsolete
1892 - path escaping
1893 - update systemd.special(7) to mention that dbus.socket is only about the compatibility socket now
1894
1895 * sd-event
1896 - allow multiple signal handlers per signal?
1897 - document chaining of signal handler for SIGCHLD and child handlers
1898 - define more intervals where we will shift wakeup intervals around in, 1h, 6h, 24h, ...
1899 - maybe support iouring as backend, so that we allow hooking read and write
1900 operations instead of IO ready events into event loops. See considerations
1901 here:
1902 http://blog.vmsplice.net/2020/07/rethinking-event-loop-integration-for.html
1903
1904 * dbus: when a unit failed to load (i.e. is in UNIT_ERROR state), we
1905 should be able to safely try another attempt when the bus call LoadUnit() is invoked.
1906
1907 * maybe do not install getty@tty1.service symlink in /etc but in /usr?
1908
1909 * print a nicer explanation if people use variable/specifier expansion in ExecStart= for the first word
1910
1911 * mount: turn dependency information from /proc/self/mountinfo into dependency information between systemd units.
1912
1913 * systemd-firstboot: make sure to always use chase_symlinks() before
1914 reading/writing files
1915
1916 * firstboot: make it useful to be run immediately after yum --installroot to set up a machine. (most specifically, make --copy-root-password work even if /etc/passwd already exists
1917
1918 * EFI:
1919 - honor language efi variables for default language selection (if there are any?)
1920 - honor timezone efi variables for default timezone selection (if there are any?)
1921 - change bootctl to be backed by systemd-bootd to control temporary and persistent default boot goal plus efi variables
1922 * bootctl
1923 - recognize the case when not booted on EFI
1924
1925 * bootctl,sd-boot: actually honour the "architecture" key
1926
1927 * bootctl:
1928 - show whether UEFI audit mode is available
1929 - teach it to prepare an ESP wholesale, i.e. with mkfs.vfat invocation
1930 - teach it to copy in unified kernel images and maybe type #1 boot loader spec entries from host
1931
1932 * kernel-install:
1933 - optionally, support generating type #2 entries instead of type #1, including signing them
1934
1935 * logind:
1936 - logind: optionally, ignore idle-hint logic for autosuspend, block suspend as long as a session is around
1937 - logind: wakelock/opportunistic suspend support
1938 - Add pretty name for seats in logind
1939 - logind: allow showing logout dialog from system?
1940 - add Suspend() bus calls which take timestamps to fix double suspend issues when somebody hits suspend and closes laptop quickly.
1941 - if pam_systemd is invoked by su from a process that is outside of a
1942 any session we should probably just become a NOP, since that's
1943 usually not a real user session but just some system code that just
1944 needs setuid().
1945 - logind: make the Suspend()/Hibernate() bus calls wait for the for
1946 the job to be completed. before returning, so that clients can wait
1947 for "systemctl suspend" to finish to know when the suspending is
1948 complete.
1949 - logind: when the power button is pressed short, just popup a
1950 logout dialog. If it is pressed for 1s, do the usual
1951 shutdown. Inspiration are Macs here.
1952 - expose "Locked" property on logind session objects
1953 - maybe allow configuration of the StopTimeout for session scopes
1954 - rename session scope so that it includes the UID. THat way
1955 the session scope can be arranged freely in slices and we don't have
1956 make assumptions about their slice anymore.
1957 - follow PropertiesChanged state more closely, to deal with quick logouts and
1958 relogins
1959 - (optionally?) spawn seat-manager@$SEAT.service whenever a seat shows up that as CanGraphical set
1960 - expose details of boot entries on the bus. In particular, it should be possible
1961 to query the list of boot entry titles that bootctl / sd-boot would show.
1962 Currently we only expose their identifiers.
1963
1964 * move multiseat vid/pid matches from logind udev rule to hwdb
1965
1966 * logind: rework pam_logind to also do a bus call in case of invocation from
1967 user@.service, which returns the XDG_RUNTIME_DIR value, and make this
1968 behaviour selectable via pam module option.
1969
1970 * delay activation of logind until somebody logs in, or when /dev/tty0 pulls it
1971 in or lingering is on (so that containers don't bother with it until PAM is used). also exit-on-idle
1972
1973 * journal:
1974 - consider introducing implicit _TTY= + _PPID= + _EUID= + _EGID= + _FSUID= + _FSGID= fields
1975 - journald: also get thread ID from client, plus thread name
1976 - journal: when waiting for journal additions in the client always sleep at least 1s or so, in order to minimize wakeups
1977 - add API to close/reopen/get fd for journal client fd in libsystemd-journal.
1978 - fall back to /dev/log based logging in libsystemd-journal, if we cannot log natively?
1979 - declare the local journal protocol stable in the wiki interface chart
1980 - sd-journal: speed up sd_journal_get_data() with transparent hash table in bg
1981 - journald: when dropping msgs due to ratelimit make sure to write
1982 "dropped %u messages" not only when we are about to print the next
1983 message that works, but already after a short timeout
1984 - check if we can make journalctl by default use --follow mode inside of less if called without args?
1985 - maybe add API to send pairs of iovecs via sd_journal_send
1986 - journal: add a setgid "systemd-journal" utility to invoke from libsystemd-journal, which passes fds via STDOUT and does PK access
1987 - journactl: support negative filtering, i.e. FOOBAR!="waldo",
1988 and !FOOBAR for events without FOOBAR.
1989 - journal: store timestamp of journal_file_set_offline() in the header,
1990 so it is possible to display when the file was last synced.
1991 - journal-send.c, log.c: when the log socket is clogged, and we drop, count this and write a message about this when it gets unclogged again.
1992 - journal: find a way to allow dropping history early, based on priority, other rules
1993 - journal: When used on NFS, check payload hashes
1994 - journald: add kernel cmdline option to disable ratelimiting for debug purposes
1995 - refuse taking lower-case variable names in sd_journal_send() and friends.
1996 - journald: we currently rotate only after MaxUse+MaxFilesize has been reached.
1997 - journal: deal nicely with byte-by-byte copied files, especially regards header
1998 - journal: sanely deal with entries which are larger than the individual file size, but where the components would fit
1999 - Replace utmp, wtmp, btmp, and lastlog completely with journal
2000 - journalctl: instead --after-cursor= maybe have a --cursor=XYZ+1 syntax?
2001 - when a kernel driver logs in a tight loop, we should ratelimit that too.
2002 - journald: optionally, log debug messages to /run but everything else to /var
2003 - journald: when we drop syslog messages because the syslog socket is
2004 full, make sure to write how many messages are lost as first thing
2005 to syslog when it works again.
2006 - journald: allow per-priority and per-service retention times when rotating/vacuuming
2007 - journald: make use of uid-range.h to managed uid ranges to split
2008 journals in.
2009 - journalctl: add the ability to look for the most recent process of a binary. journalctl /usr/bin/X11 --pid=-1 or so...
2010 - improve journalctl performance by loading journal files
2011 lazily. Encode just enough information in the file name, so that we
2012 do not have to open it to know that it is not interesting for us, for
2013 the most common operations.
2014 - man: document that corrupted journal files is nothing to act on
2015 - rework journald sigbus stuff to use mutex
2016 - Set RLIMIT_NPROC for systemd-journal-xyz, and all other of our
2017 services that run under their own user ids, and use User= (but only
2018 in a world where userns is ubiquitous since otherwise we cannot
2019 invoke those daemons on the host AND in a container anymore). Also,
2020 if LimitNPROC= is used without User= we should warn and refuse
2021 operation.
2022 - journalctl --verify: don't show files that are currently being
2023 written to as FAIL, but instead show that their are being written to.
2024 - add journalctl -H that talks via ssh to a remote peer and passes through
2025 binary logs data
2026 - add a version of --merge which also merges /var/log/journal/remote
2027 - journalctl: -m should access container journals directly by enumerating
2028 them via machined, and also watch containers coming and going.
2029 Benefit: nspawn --ephemeral would start working nicely with the journal.
2030 - assign MESSAGE_ID to log messages about failed services
2031 - check if loop in decompress_blob_xz() is necessary
2032
2033 * journald: support RFC3164 fully for the incoming syslog transport, see
2034 https://github.com/systemd/systemd/issues/19251#issuecomment-816601955
2035
2036 * Hook up journald's FSS logic with TPM2: seal the verification disk by
2037 time-based policy, so that the verification key can remain on host and ve
2038 validated via TPM.
2039
2040 * build short web pages out of each catalog entry, build them along with man
2041 pages, and include hyperlinks to them in the journal output
2042
2043 * journald: do journal file writing out-of-process, with one writer process per
2044 client UID, so that synthetic hash table collisions can slow down a specific
2045 user's journal stream down but not the others.
2046
2047 * tweak journald context caching. In addition to caching per-process attributes
2048 keyed by PID, cache per-cgroup attributes (i.e. the various xattrs we read)
2049 keyed by cgroup path, and guarded by ctime changes. This should provide us
2050 with a nice speed-up on services that have many processes running in the same
2051 cgroup.
2052
2053 * maybe add call sd_journal_set_block_timeout() or so to set SO_SNDTIMEO for
2054 the sd-journal logging socket, and, if the timeout is set to 0, sets
2055 O_NONBLOCK on it. That way people can control if and when to block for
2056 logging.
2057
2058 * journalctl: make sure -f ends when the container indicated by -M terminates
2059
2060 * journald: sigbus API via a signal-handler safe function that people may call
2061 from the SIGBUS handler
2062
2063 * add a test if all entries in the catalog are properly formatted.
2064 (Adding dashes in a catalog entry currently results in the catalog entry
2065 being silently skipped. journalctl --update-catalog must warn about this,
2066 and we should also have a unit test to check that all our message are OK.)
2067
2068 * homed:
2069 - when user tries to log into record signed by unrecognized key, automatically add key to our chain after polkit auth
2070 - rollback when resize fails mid-operation
2071 - GNOME's side for forget key on suspend (requires rework so that lock screen runs outside of uid)
2072 - update LUKS password on login if we find there's a password that unlocks the JSON record but not the LUKS device.
2073 - create on activate?
2074 - properties: icon url?, preferred session type?, administrator bool (which translates to 'wheel' membership)?, address?, telephone?, vcard?, samba stuff?, parental controls?
2075 - communicate clearly when usb stick is safe to remove. probably involves
2076 beefing up logind to make pam session close hook synchronous and wait until
2077 systemd --user is shut down.
2078 - logind: maybe keep a "busy fd" as long as there's a non-released session around or the user@.service
2079 - maybe make automatic, read-only, time-based reflink-copies of LUKS disk
2080 images (and btrfs snapshots of subvolumes) (think: time machine)
2081 - distinguish destroy / remove (i.e. currently we can unregister a user, unregister+remove their home directory, but not just remove their home directory)
2082 - in systemd's PAMName= logic: query passwords with ssh-askpassword, so that we can make "loginctl set-linger" mode work
2083 - fingerprint authentication, pattern authentication, …
2084 - make sure "classic" user records can also be managed by homed
2085 - make size of $XDG_RUNTIME_DIR configurable in user record
2086 - query password from kernel keyring first
2087 - update even if record is "absent"
2088 - move acct mgmt stuff from pam_systemd_home to pam_systemd?
2089 - when "homectl --pkcs11-token-uri=" is used, synthesize ssh-authorized-keys records for all keys we have private keys on the stick for
2090 - make slice for users configurable (requires logind rework)
2091 - logind: populate auto-login list bus property from PKCS#11 token
2092 - when determining state of a LUKS home directory, check DM suspended sysfs file
2093 - when homed is in use, maybe start the user session manager in a mount namespace with MS_SLAVE,
2094 so that mounts propagate down but not up - eg, user A setting up a backup volume
2095 doesn't mean user B sees it
2096 - use credentials logic/TPM2 logic to store homed signing key
2097 - permit multiple user record signing keys to be used locally, and pick
2098 the right one for signing records automatically depending on a pre-existing
2099 signature
2100 - add a way to "adopt" a home directory, i.e. strip foreign signatures
2101 and insert a local signature instead.
2102 - as an extension to the directory+subvolume backend: if located on
2103 especially marked fs, then sync down password into LUKS header of that fs,
2104 and always verify passwords against it too. Bootstrapping is a problem
2105 though: if no one is logged in (or no other user even exists yet), how do you
2106 unlock the volume in order to create the first user and add the first pw.
2107 - support new FS_IOC_ADD_ENCRYPTION_KEY ioctl for setting up fscrypt
2108 - maybe pre-create ~/.cache as subvol so that it can have separate quota
2109 easily?
2110 - add a switch to homectl (maybe called --first-boot) where it will check if
2111 any non-system users exist, and if not prompts interactively for basic user
2112 info, mimicking systemd-firstboot. Then, place this in a service that runs
2113 after systemd-homed, but before gdm and friends, as a simple, barebones
2114 fallback logic to get a regular user created on uninitialized systems.
2115 - store PKCS#11 + FIDO2 token info in LUKS2 header, compatible with
2116 systemd-cryptsetup, so that it can unlock homed volumes
2117 - maybe make all *.home files owned by `systemd-home` user or so, so that we
2118 can easily set overall quota for all users
2119 - on login, if we can't fallocate initially, but rebalance is on, then allow
2120 login in discard mode, then immediately rebalance, then turn off discard
2121 - extend user records with optional "bulk" data. Specifically, a user
2122 avatar/photo or so. This data should be stored along with the user record,
2123 but probably shouldn't be part of the record itself, since it might be
2124 large.
2125
2126 * add a new switch --auto-definitions=yes/no or so to systemd-repart. If
2127 specified, synthesize a definition automatically if we can: enlarge last
2128 partition on disk, but only if it is marked for growing and not read-only.
2129
2130 * systemd-repart: read LUKS encryption key from $CREDENTIALS_DIRECTORY
2131
2132 * systemd-repart: add a switch to factory reset the partition table without
2133 immediately applying the new configuration again. i.e. --factory-reset=leave
2134 or so. (this is useful to factory reset an image, then putting it into
2135 another machine, ensuring that luks key is generated on new machine, not old)
2136
2137 * systemd-repart: support setting up dm-integrity with HMAC
2138
2139 * systemd-repart: maybe remove half-initialized image on failure. It fails
2140 if the output file exists, so a repeated invocation will usually fail if
2141 something goes wrong on the way.
2142
2143 * systemd-repart: drop pager mode on normal operation?
2144
2145 * systemd-repart: by default generate minimized partition tables (i.e. tables
2146 that only cover the space actually used, excluding any free space at the
2147 end), in order to maximize dd'ability. Requires libfdisk work, see
2148 https://github.com/karelzak/util-linux/issues/907
2149
2150 * systemd-repart: MBR partition table support. Care needs to be taken regarding
2151 Type=, so that partition definitions can sanely apply to both the GPT and the
2152 MBR case. Idea: accept syntax "Type=gpt:home mbr:0x83" for setting the types
2153 for the two partition types explicitly. And provide an internal mapping so
2154 that "Type=linux-generic" maps to the right types for both partition tables
2155 automatically.
2156
2157 * systemd-repart: allow sizing partitions as factor of available RAM, so that
2158 we can reasonably size swap partitions for hibernation.
2159
2160 * systemd-repart: allow boolean option that ensures that if existing partition
2161 doesn't exist within the configured size bounds the whole command fails. This
2162 is useful to implement ESP vs. XBOOTLDR schemes in installers: have one set
2163 of repart files for the case where ESP is large enough and one where it isn't
2164 and XBOOTLDR is added in instead. Then apply the former first, and if it
2165 fails to apply use the latter.
2166
2167 * systemd-repart: add per-partition option to never reuse existing partition
2168 and always create anew even if matching partition already exists.
2169
2170 * systemd-repart: add per-partition option to fail if partition already exist,
2171 i.e. is not added new. Similar, add option to fail if partition does not exist yet.
2172
2173 * systemd-repart: allow disabling growing of specific partitions, or making
2174 them (think ESP: we don't ever want to grow it, since we cannot resize vfat)
2175 Also add option to disable operation via kernel command line.
2176
2177 * systemd-repart: make it a static checker during early boot for existence and
2178 absence of other partitions for trusted boot environments
2179
2180 * systemd-repart: add support for SD_GPT_FLAG_GROWFS also on real systems, i.e.
2181 generate some unit to actually enlarge the fs after growing the partition
2182 during boot.
2183
2184 * systemd-repart: do not print "Successfully resized …" when no change was done.
2185
2186 * document:
2187 - document that deps in [Unit] sections ignore Alias= fields in
2188 [Install] units of other units, unless those units are disabled
2189 - man: clarify that time-sync.target is not only sysv compat but also useful otherwise. Same for similar targets
2190 - document that service reload may be implemented as service reexec
2191 - add a man page containing packaging guidelines and recommending usage of things like Documentation=, PrivateTmp=, PrivateNetwork= and ReadOnlyDirectories=/etc /usr.
2192 - document systemd-journal-flush.service properly
2193 - documentation: recommend to connect the timer units of a service to the service via Also= in [Install]
2194 - man: document the very specific env the shutdown drop-in tools live in
2195 - man: add more examples to man pages,
2196 - in particular an example how to do the equivalent of switching runlevels
2197 - man: maybe sort directives in man pages, and take sections from --help and apply them to man too
2198 - document root=gpt-auto properly
2199
2200 * systemctl:
2201 - add systemctl switch to dump transaction without executing it
2202 - Add a verbose mode to "systemctl start" and friends that explains what is being done or not done
2203 - "systemctl disable" on a static unit prints no message and does
2204 nothing. "systemctl enable" does nothing, and gives a bad message
2205 about it. Should fix both to print nice actionable messages.
2206 - print nice message from systemctl --failed if there are no entries shown, and hook that into ExecStartPre of rescue.service/emergency.service
2207 - add new command to systemctl: "systemctl system-reexec" which reexecs as many daemons as virtually possible
2208 - systemctl enable: fail if target to alias into does not exist? maybe show how many units are enabled afterwards?
2209 - systemctl: "Journal has been rotated since unit was started." message is misleading
2210 - systemctl status output should include list of triggering units and their status
2211
2212 * introduce an option (or replacement) for "systemctl show" that outputs all
2213 properties as JSON, similar to busctl's new JSON output. In contrast to that
2214 it should skip the variant type string though.
2215
2216 * add an explicit "vertical" mode to format-table, so that "systemctl
2217 status"-like outputs (i.e. with a series of field names left and values
2218 right) become genuine first class citizens, and we gain automatic, sane JSON
2219 output for them.
2220
2221 * Add a "systemctl list-units --by-slice" mode or so, which rearranges the
2222 output of "systemctl list-units" slightly by showing the tree structure of
2223 the slices, and the units attached to them.
2224
2225 * add "systemctl wait" or so, which does what "systemd-run --wait" does, but
2226 for all units. It should be both a way to pin units into memory as well as a
2227 wait to retrieve their exit data.
2228
2229 * show whether a service has out-of-date configuration in "systemctl status" by
2230 using mtime data of ConfigurationDirectory=.
2231
2232 * "systemctl preset-all" should probably order the unit files it
2233 operates on lexicographically before starting to work, in order to
2234 ensure deterministic behaviour if two unit files conflict (like DMs
2235 do, for example)
2236
2237 * add "systemctl start -v foobar.service" that shows logs of a service
2238 while the start command runs. This is non-trivial to do without
2239 races though, since we should flush out all journal messages before
2240 returning from the "systemctl stop".
2241
2242 * systemctl: if some operation fails, show log output?
2243
2244 * Add a new verb "systemctl top"
2245
2246 * unit install:
2247 - "systemctl mask" should find all names by which a unit is accessible
2248 (i.e. by scanning for symlinks to it) and link them all to /dev/null
2249
2250 * nspawn:
2251 - emulate /dev/kmsg using CUSE and turn off the syslog syscall
2252 with seccomp. That should provide us with a useful log buffer that
2253 systemd can log to during early boot, and disconnect container logs
2254 from the kernel's logs.
2255 - as soon as networkd has a bus interface, hook up --network-interface=,
2256 --network-bridge= with networkd, to trigger netdev creation should an
2257 interface be missing
2258 - a nice way to boot up without machine id set, so that it is set at boot
2259 automatically for supporting --ephemeral. Maybe hash the host machine id
2260 together with the machine name to generate the machine id for the container
2261 - fix logic always print a final newline on output.
2262 https://github.com/systemd/systemd/pull/272#issuecomment-113153176
2263 - should optionally support receiving WATCHDOG=1 messages from its payload
2264 PID 1...
2265 - optionally automatically add FORWARD rules to iptables whenever nspawn is
2266 running, remove them when shut down.
2267 - add support for sysext extensions, too. i.e. a new --extension= switch that
2268 takes one or more arguments, and applies the extensions already during
2269 startup.
2270 - when main nspawn supervisor process gets suspended due to SIGSTOP/SIGTTOU
2271 or so, freeze the payload too.
2272 - support time namespaces
2273 - on cgroupsv1 issue cgroup empty handler process based on host events, so
2274 that we make cgroup agent logic safe
2275 - add API to invoke binary in container, then use that as fallback in
2276 "machinectl shell"
2277 - make nspawn suitable for shell pipelines: instead of triggering a hangup
2278 when input is finished, send ^D, which synthesizes an EOF. Then wait for
2279 hangup or ^D before passing on the EOF.
2280 - greater control over selinux label?
2281 - support that /proc, /sys/, /dev are pre-mounted
2282 - maybe allow TPM passthrough, backed by swtpm, and measure --image= hash
2283 into its PCR 11, so that nspawn instances can be TPM enabled, and partake
2284 in measurements/remote attestation and such. swtpm would run outside of
2285 control of container, and ideally would itself bind its encryption keys to
2286 host TPM.
2287 - make boot assessment do something sensible in a container. i.e send an
2288 sd_notify() from payload to container manager once boot-up is completed
2289 successfully, and use that in nspawn for dealing with boot counting,
2290 implemented in the partition table labels and directory names.
2291 - optionally set up nftables/iptables routes that forward UDP/TCP traffic on
2292 port 53 to resolved stub 127.0.0.54
2293 - maybe optionally insert .nspawn file as GPT partition into images, so that
2294 such container images are entirely stand-alone and can be updated as one.
2295 - The subreaper logic we currently have seems overly complex. We should
2296 investigate whether creating the inner child with CLONE_PARENT isn't better.
2297 - Reduce the number of sockets that are currently in use and just rely on one
2298 or two sockets.
2299 - Support running nspawn as an unprivileged user.
2300
2301 * machined: add API to acquire UID range. add API to mount/dissect loopback
2302 file. Both protected by PK. Then make nspawn use these APIs to run
2303 unprivileged containers. i.e. push the truly privileged bits into machined,
2304 so that the client side can remain entirely unprivileged, with SUID or
2305 anything like that.
2306
2307 * machined:
2308 - add an API so that libvirt-lxc can inform us about network interfaces being
2309 removed or added to an existing machine
2310 - "machinectl migrate" or similar to copy a container from or to a
2311 difference host, via ssh
2312 - introduce systemd-nspawn-ephemeral@.service, and hook it into
2313 "machinectl start" with a new --ephemeral switch
2314 - "machinectl status" should also show internal logs of the container in
2315 question
2316 - "machinectl history"
2317 - "machinectl diff"
2318 - "machinectl commit" that takes a writable snapshot of a tree, invokes a
2319 shell in it, and marks it read-only after use
2320
2321 * udev:
2322 - move to LGPL
2323 - kill scsi_id
2324 - add trigger --subsystem-match=usb/usb_device device
2325 - reimport udev db after MOVE events for devices without dev_t
2326 - re-enable ProtectClock= once only cgroupsv2 is supported.
2327 See f562abe2963bad241d34e0b308e48cf114672c84.
2328
2329 * coredump:
2330 - save coredump in Windows/Mozilla minidump format
2331 - when truncating coredumps, also log the full size that the process had, and make a metadata field so we can report truncated coredumps
2332 - add examples for other distros in ELF_PACKAGE_METADATA
2333
2334 * support crash reporting operation modes (https://live.gnome.org/GnomeOS/Design/Whiteboards/ProblemReporting)
2335
2336 * tmpfiles:
2337 - apply "x" on "D" too (see patch from William Douglas)
2338 - allow time-based cleanup in r and R too
2339 - instead of ignoring unknown fields, reject them.
2340 - creating new directories/subvolumes/fifos/device nodes
2341 should not follow symlinks. None of the other adjustment or creation
2342 calls follow symlinks.
2343 - add --test mode
2344 - teach tmpfiles.d q/Q logic something sensible in the context of XFS/ext4
2345 project quota
2346 - teach tmpfiles.d m/M to move / atomic move + symlink old -> new
2347 - add new line type for setting btrfs subvolume attributes (i.e. rw/ro)
2348 - tmpfiles: add new line type for setting fcaps
2349
2350 * udev-link-config:
2351 - Make sure ID_PATH is always exported and complete for
2352 network devices where possible, so we can safely rely
2353 on Path= matching
2354
2355 * sd-rtnl:
2356 - add support for more attribute types
2357 - inbuilt piping support (essentially degenerate async)? see loopback-setup.c and other places
2358
2359 * networkd:
2360 - add more keys to [Route] and [Address] sections
2361 - add support for more DHCPv4 options (and, longer term, other kinds of dynamic config)
2362 - add reduced [Link] support to .network files
2363 - properly handle routerless dhcp leases
2364 - work with non-Ethernet devices
2365 - dhcp: do we allow configuring dhcp routes on interfaces that are not the one we got the dhcp info from?
2366 - the DHCP lease data (such as NTP/DNS) is still made available when
2367 a carrier is lost on a link. It should be removed instantly.
2368 - expose in the API the following bits:
2369 - option 15, domain name
2370 - option 12, hostname and/or option 81, fqdn
2371 - option 123, 144, geolocation
2372 - option 252, configure http proxy (PAC/wpad)
2373 - provide a way to define a per-network interface default metric value
2374 for all routes to it. possibly a second default for DHCP routes.
2375 - allow Name= to be specified repeatedly in the [Match] section. Maybe also
2376 support Name=foo*|bar*|baz ?
2377 - whenever uplink info changes, make DHCP server send out FORCERENEW
2378
2379 * in networkd, when matching device types, fix up DEVTYPE rubbish the kernel passes to us
2380
2381 * Figure out how to do unittests of networkd's state serialization
2382
2383 * dhcp:
2384 - figure out how much we can increase Maximum Message Size
2385
2386 * dhcp6:
2387 - add functions to set previously stored IPv6 addresses on startup and get
2388 them at shutdown; store them in client->ia_na
2389 - write more test cases
2390 - implement reconfigure support, see 5.3., 15.11. and 22.20.
2391 - implement support for temporary adressess (IA_TA)
2392 - implement dhcpv6 authentication
2393 - investigate the usefulness of Confirm messages; i.e. are there any
2394 situations where the link changes without any loss in carrier detection
2395 or interface down
2396 - some servers don't do rapid commit without a filled in IA_NA, verify
2397 this behavior
2398 - RouteTable= ?