]> git.ipfire.org Git - thirdparty/systemd.git/blob - docs/PORTABLE_SERVICES.md
docs: place all our markdown docs in rough categories
[thirdparty/systemd.git] / docs / PORTABLE_SERVICES.md
1 ---
2 title: Portable Services Introduction
3 category: Concepts
4 ---
5
6 # Portable Services Introduction
7
8 This systemd version includes a preview of the "portable service"
9 concept. "Portable Services" are supposed to be an incremental improvement over
10 traditional system services, making two specific facets of container management
11 available to system services more readily. Specifically:
12
13 1. The bundling of applications, i.e. packing up multiple services, their
14 binaries and all their dependencies in a single image, and running them
15 directly from it.
16
17 2. Stricter default security policies, i.e. sand-boxing of applications.
18
19 The primary tool for interfacing with "portable services" is the new
20 "portablectl" program. It's currently shipped in /usr/lib/systemd/portablectl
21 (i.e. not in the `$PATH`), since it's not yet considered part of the officially
22 supported systemd interfaces — it's a preview still after all.
23
24 Portable services don't bring anything inherently new to the table. All they do
25 is put together known concepts in a slightly nicer way to cover a specific set
26 of use-cases in a nicer way.
27
28 ## So, what *is* a "Portable Service"?
29
30 A portable service is ultimately just an OS tree, either inside of a directory
31 tree, or inside a raw disk image containing a Linux file system. This tree is
32 called the "image". It can be "attached" or "detached" from the system. When
33 "attached" specific systemd units from the image are made available on the host
34 system, then behaving pretty much exactly like locally installed system
35 services. When "detached" these units are removed again from the host, leaving
36 no artifacts around (except maybe messages they might have logged).
37
38 The OS tree/image can be created with any tool of your choice. For example, you
39 can use `dnf --installroot=` if you like, or `debootstrap`, the image format is
40 entirely generic, and doesn't have to carry any specific metadata beyond what
41 distribution images carry anyway. Or to say this differently: the image format
42 doesn't define any new metadata as unit files and OS tree directories or disk
43 images are already sufficient, and pretty universally available these days. One
44 particularly nice tool for creating suitable images is
45 [mkosi](https://github.com/systemd/mkosi), but many other existing tools will
46 do too.
47
48 If you so will, "Portable Services" are a nicer way to manage chroot()
49 environments, with better security, tooling and behavior.
50
51 ## Where's the difference to a "Container"?
52
53 "Container" is a very vague term, after all it is used for
54 systemd-nspawn/LXC-type OS containers, for Docker/rkt-like micro service
55 containers, and even certain 'lightweight' VM runtimes.
56
57 The "portable service" concept ultimately will not provide a fully isolated
58 environment to the payload, like containers mostly intend to. Instead they are
59 from the beginning more alike regular system services, can be controlled with
60 the same tools, are exposed the same way in all infrastructure and so on. Their
61 main difference is that the use a different root directory than the rest of the
62 system. Hence, the intention is not to run code in a different, isolated world
63 from the host — like most containers would do it —, but to run it in the same
64 world, but with stricter access controls on what the service can see and do.
65
66 As one point of differentiation: as programs run as "portable services" are
67 pretty much regular system services, they won't run as PID 1 (like Docker would
68 do it), but as normal process. A corollary of that is that they aren't supposed
69 to manage anything in their own environment (such as the network) as the
70 execution environment is mostly shared with the rest of the system.
71
72 The primary focus use-case of "portable services" is to extend the host system
73 with encapsulated extensions, but provide almost full integration with the rest
74 of the system, though possibly restricted by effective security knobs. This
75 focus includes system extensions otherwise sometimes called "super-privileged
76 containers".
77
78 Note that portable services are only available for system services, not for
79 user services. i.e. the functionality cannot be used for the stuff
80 bubblewrap/flatpak is focusing on.
81
82 ## Mode of Operation
83
84 If you have portable service image, maybe in a raw disk image called
85 `foobar_0.7.23.raw`, then attaching the services to the host is as easy as:
86
87 ```
88 # /usr/lib/systemd/portablectl attach foobar_0.7.23.raw
89 ```
90
91 This command does the following:
92
93 1. It dissects the image, checks and validates the `/etc/os-release`
94 (or `/usr/lib/os-release`, see below) data of the image, and looks for
95 all included unit files.
96
97 2. It copies out all unit files with a suffix of `.service`, `.socket`,
98 `.target`, `.timer` and `.path`. whose name begins with the image's name
99 (with the .raw removed), truncated at the first underscore (if there is
100 one). This prefix name generated from the image name must be followed by a
101 ".", "-" or "@" character in the unit name. Or in other words, given the
102 image name of `foobar_0.7.23.raw` all unit files matching
103 `foobar-*.{service|socket|target|timer|path}`,
104 `foobar@.{service|socket|target|timer|path}` as well as
105 `foobar.*.{service|socket|target|timer|path}` and
106 `foobar.{service|socket|target|timer|path}` are copied out. These unit files
107 are placed in `/etc/systemd/system.attached/` (which is part of the normal
108 unit file search path of PID 1, and thus loaded exactly like regular unit
109 files). Within the images the unit files are looked for at the usual
110 locations, i.e. in `/usr/lib/systemd/system/` and `/etc/systemd/system/` and
111 so on, relative to the image's root.
112
113 3. For each such unit file a drop-in file is created. Let's say
114 `foobar-waldo.service` was one of the unit files copied to
115 `/etc/systemd/system.attached/`, then a drop-in file
116 `/etc/systemd/system.attached/foobar-waldo.service.d/20-portable.conf` is
117 created, containing a few lines of additional configuration:
118
119 ```
120 [Service]
121 RootImage=/path/to/foobar.raw
122 Environment=PORTABLE=foobar
123 LogExtraFields=PORTABLE=foobar
124 ```
125
126 4. For each such unit a "profile" drop-in is linked in. This "profile" drop-in
127 generally contains security options that lock down the service. By default
128 the `default` profile is used, which provides a medium level of
129 security. There's also `trusted` which runs the service at the highest
130 privileges, i.e. host's root and everything. The `strict` profile comes with
131 the toughest security restrictions. Finally, `nonetwork` is like `default`
132 but without network access. Users may define their own profiles too (or
133 modify the existing ones)
134
135 And that's already it.
136
137 Note that the images need to stay around (and the same location) as long as the
138 portable service is attached. If an image is moved, the `RootImage=` line
139 written to the unit drop-in would point to an non-existing place, and break the
140 logic.
141
142 The `portablectl detach` command executes the reverse operation: it looks for
143 the drop-ins and the unit files associated with the image, and removes them
144 again.
145
146 Note that `portable attach` won't enable or start any of the units it copies
147 out. This still has to take place in a second, separate step. (That said We
148 might add options to do this automatically later on.).
149
150 ## Requirements on Images
151
152 Note that portable services don't introduce any new image format, but most OS
153 images should just work the way they are. Specifically, the following
154 requirements are made for an image that can be attached/detached with
155 `portablectl`.
156
157 1. It must contain an executable that shall be invoked, along with all its
158 dependencies. If binary code, the code needs to be compiled for an
159 architecture compatible with the host.
160
161 2. The image must either be a plain sub-directory (or btrfs subvolume)
162 containing the binaries and its dependencies in a classic Linux OS tree, or
163 must be a raw disk image either containing only one, naked file system, or
164 an image with a partition table understood by the Linux kernel with only a
165 single partition defined, or alternatively, a GPT partition table with a set
166 of properly marked partitions following the [Discoverable Partitions
167 Specification](https://www.freedesktop.org/wiki/Specifications/DiscoverablePartitionsSpec/).
168
169 3. The image must at least contain one matching unit file, with the right name
170 prefix and suffix (see above). The unit file is searched in the usual paths,
171 i.e. primarily /etc/systemd/system/ and /usr/lib/systemd/system/ within the
172 image. (The implementation will check a couple of other paths too, but it's
173 recommended to use these two paths.)
174
175 4. The image must contain an os-release file, either in `/etc/os-release` or
176 `/usr/lib/os-release`. The file should follow the standard format.
177
178 5. The image must contain the files `/etc/resolv.conf` and `/etc/machine-id`
179 (empty files are ok), they will be bind mounted from the host at runtime.
180
181 6. The image must contain directories `/proc/`, `/sys/`, `/dev/`, `/run/`,
182 `/tmp/`, `/var/tmp/` that can be mounted over with the corresponding version
183 from the host.
184
185 7. The OS might require other files or directories to be in place. For example,
186 if the image is built based on glibc, the dynamic loader needs to be
187 available in `/lib/ld-linux.so.2` or `/lib64/ld-linux-x86-64.so.2` (or
188 similar, depending on architecture), and if the distribution implements a
189 merged `/usr/` tree, this means `/lib` and/or `/lib64` need to be symlinks
190 to their respective counterparts below `/usr/`. For details see your
191 distribution's documentation.
192
193 Note that images created by tools such as `debootstrap`, `dnf --installroot=`
194 or `mkosi` generally qualify for all of the above in one way or another. If you
195 wonder what the most minimal image would be that complies with the requirements
196 above, it could consist of this:
197
198 ```
199 /usr/bin/minimald # a statically compiled binary
200 /usr/lib/systemd/system/minimal-test.service # the unit file for the service, with ExecStart=/usr/bin/minimald
201 /usr/lib/os-release # an os-release file explaining what this is
202 /etc/resolv.conf # empty file to mount over with host's version
203 /etc/machine-id # ditto
204 /proc/ # empty directory to use as mount point for host's API fs
205 /sys/ # ditto
206 /dev/ # ditto
207 /run/ # ditto
208 /tmp/ # ditto
209 /var/tmp/ # ditto
210 ```
211
212 And that's it.
213
214 Note that qualifying images do not have to contain an init system of their
215 own. If they do, it's fine, it will be ignored by the portable service logic,
216 but they generally don't have to, and it might make sense to avoid any, to keep
217 images minimal.
218
219 If the image is writable, and some of the files or directories that are
220 overmounted from the host do not exist yet they are automatically created. On
221 read-only, immutable images (e.g. squashfs images) all files and directories to
222 over-mount must exist already.
223
224 Note that as no new image format or metadata is defined, it's very
225 straight-forward to define images than can be made use of it a number of
226 different ways. For example, by using `mkosi -b` you can trivially build a
227 single, unified image that:
228
229 1. Can be attached as portable service, to run any container services natively
230 on the host.
231
232 2. Can be run as OS container, using `systemd-nspawn`, by booting the image
233 with `systemd-nspawn -i -b`.
234
235 3. Can be booted directly as VM image, using a generic VM executor such as
236 `virtualbox`/`qemu`/`kvm`
237
238 4. Can be booted directly on bare-metal systems.
239
240 Of course, to facilitate 2, 3 and 4 you need to include an init system in the
241 image. To facility 3 and 4 you also need to include a boot loader in the
242 image. As mentioned `mkosi -b` takes care of all of that for you, but any other
243 image generator should work too.
244
245 ## Execution Environment
246
247 Note that the code in portable service images is run exactly like regular
248 services. Hence there's no new execution environment to consider. Oh, unlike
249 Docker would do it, as these are regular system services they aren't run as PID
250 1 either, but with regular PID values.
251
252 ## Access to host resources
253
254 If services shipped with this mechanism shall be able to access host resources
255 (such as files or AF_UNIX sockets for IPC), use the normal `BindPaths=` and
256 `BindReadOnlyPaths=` settings in unit files to mount them in. In fact the
257 `default` profile mentioned above makes use of this to ensure
258 `/etc/resolv.conf`, the D-Bus system bus socket or write access to the logging
259 subsystem are available to the service.
260
261 ## Instantiation
262
263 Sometimes it makes sense to instantiate the same set of services multiple
264 times. The portable service concept does not introduce a new logic for this. It
265 is recommended to use the regular unit templating of systemd for this, i.e. to
266 include template units such as `foobar@.service`, so that instantiation is as
267 simple as:
268
269 ```
270 # /usr/lib/systemd/portablectl attach foobar_0.7.23.raw
271 # systemctl enable --now foobar@instancea.service
272 # systemctl enable --now foobar@instanceb.service
273
274 ```
275
276 The benefit of this approach is that templating works exactly the same for
277 units shipped with the OS itself as for attached portable services.
278
279 ## Immutable images with local data
280
281 It's a good idea to keep portable service images read-only during normal
282 operation. In fact all but the `trusted` profile will default to this kind of
283 behaviour, by setting the `ProtectSystem=strict` option. In this case writable
284 service data may be placed on the host file system. Use `StateDirectory=` in
285 the unit files to enable such behaviour and add a local data directory to the
286 services copied onto the host.