]> git.ipfire.org Git - thirdparty/systemd.git/blob - man/systemd-nspawn.xml
nspawn: make -U a tiny bit smarter
[thirdparty/systemd.git] / man / systemd-nspawn.xml
1 <?xml version='1.0'?> <!--*- Mode: nxml; nxml-child-indent: 2; indent-tabs-mode: nil -*-->
2 <!DOCTYPE refentry PUBLIC "-//OASIS//DTD DocBook XML V4.2//EN"
3 "http://www.oasis-open.org/docbook/xml/4.2/docbookx.dtd">
4
5 <!--
6 This file is part of systemd.
7
8 Copyright 2010 Lennart Poettering
9
10 systemd is free software; you can redistribute it and/or modify it
11 under the terms of the GNU Lesser General Public License as published by
12 the Free Software Foundation; either version 2.1 of the License, or
13 (at your option) any later version.
14
15 systemd is distributed in the hope that it will be useful, but
16 WITHOUT ANY WARRANTY; without even the implied warranty of
17 MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU
18 Lesser General Public License for more details.
19
20 You should have received a copy of the GNU Lesser General Public License
21 along with systemd; If not, see <http://www.gnu.org/licenses/>.
22 -->
23
24 <refentry id="systemd-nspawn"
25 xmlns:xi="http://www.w3.org/2001/XInclude">
26
27 <refentryinfo>
28 <title>systemd-nspawn</title>
29 <productname>systemd</productname>
30
31 <authorgroup>
32 <author>
33 <contrib>Developer</contrib>
34 <firstname>Lennart</firstname>
35 <surname>Poettering</surname>
36 <email>lennart@poettering.net</email>
37 </author>
38 </authorgroup>
39 </refentryinfo>
40
41 <refmeta>
42 <refentrytitle>systemd-nspawn</refentrytitle>
43 <manvolnum>1</manvolnum>
44 </refmeta>
45
46 <refnamediv>
47 <refname>systemd-nspawn</refname>
48 <refpurpose>Spawn a namespace container for debugging, testing and building</refpurpose>
49 </refnamediv>
50
51 <refsynopsisdiv>
52 <cmdsynopsis>
53 <command>systemd-nspawn</command>
54 <arg choice="opt" rep="repeat">OPTIONS</arg>
55 <arg choice="opt"><replaceable>COMMAND</replaceable>
56 <arg choice="opt" rep="repeat">ARGS</arg>
57 </arg>
58 </cmdsynopsis>
59 <cmdsynopsis>
60 <command>systemd-nspawn</command>
61 <arg choice="plain">--boot</arg>
62 <arg choice="opt" rep="repeat">OPTIONS</arg>
63 <arg choice="opt" rep="repeat">ARGS</arg>
64 </cmdsynopsis>
65 </refsynopsisdiv>
66
67 <refsect1>
68 <title>Description</title>
69
70 <para><command>systemd-nspawn</command> may be used to run a
71 command or OS in a light-weight namespace container. In many ways
72 it is similar to
73 <citerefentry project='man-pages'><refentrytitle>chroot</refentrytitle><manvolnum>1</manvolnum></citerefentry>,
74 but more powerful since it fully virtualizes the file system
75 hierarchy, as well as the process tree, the various IPC subsystems
76 and the host and domain name.</para>
77
78 <para><command>systemd-nspawn</command> limits access to various
79 kernel interfaces in the container to read-only, such as
80 <filename>/sys</filename>, <filename>/proc/sys</filename> or
81 <filename>/sys/fs/selinux</filename>. Network interfaces and the
82 system clock may not be changed from within the container. Device
83 nodes may not be created. The host system cannot be rebooted and
84 kernel modules may not be loaded from within the container.</para>
85
86 <para>Note that even though these security precautions are taken
87 <command>systemd-nspawn</command> is not suitable for fully secure
88 container setups. Many of the security features may be
89 circumvented and are hence primarily useful to avoid accidental
90 changes to the host system from the container.</para>
91
92 <para>In contrast to
93 <citerefentry project='man-pages'><refentrytitle>chroot</refentrytitle><manvolnum>1</manvolnum></citerefentry> <command>systemd-nspawn</command>
94 may be used to boot full Linux-based operating systems in a
95 container.</para>
96
97 <para>Use a tool like
98 <citerefentry project='mankier'><refentrytitle>dnf</refentrytitle><manvolnum>8</manvolnum></citerefentry>,
99 <citerefentry project='die-net'><refentrytitle>debootstrap</refentrytitle><manvolnum>8</manvolnum></citerefentry>,
100 or
101 <citerefentry project='archlinux'><refentrytitle>pacman</refentrytitle><manvolnum>8</manvolnum></citerefentry>
102 to set up an OS directory tree suitable as file system hierarchy
103 for <command>systemd-nspawn</command> containers.</para>
104
105 <para>Note that <command>systemd-nspawn</command> will mount file
106 systems private to the container to <filename>/dev</filename>,
107 <filename>/run</filename> and similar. These will not be visible
108 outside of the container, and their contents will be lost when the
109 container exits.</para>
110
111 <para>Note that running two <command>systemd-nspawn</command>
112 containers from the same directory tree will not make processes in
113 them see each other. The PID namespace separation of the two
114 containers is complete and the containers will share very few
115 runtime objects except for the underlying file system. Use
116 <citerefentry><refentrytitle>machinectl</refentrytitle><manvolnum>1</manvolnum></citerefentry>'s
117 <command>login</command> command to request an additional login
118 prompt in a running container.</para>
119
120 <para><command>systemd-nspawn</command> implements the
121 <ulink
122 url="http://www.freedesktop.org/wiki/Software/systemd/ContainerInterface">Container
123 Interface</ulink> specification.</para>
124
125 <para>As a safety check <command>systemd-nspawn</command> will
126 verify the existence of <filename>/usr/lib/os-release</filename>
127 or <filename>/etc/os-release</filename> in the container tree
128 before starting the container (see
129 <citerefentry><refentrytitle>os-release</refentrytitle><manvolnum>5</manvolnum></citerefentry>).
130 It might be necessary to add this file to the container tree
131 manually if the OS of the container is too old to contain this
132 file out-of-the-box.</para>
133 </refsect1>
134
135 <refsect1>
136 <title>Options</title>
137
138 <para>If option <option>-b</option> is specified, the arguments
139 are used as arguments for the init binary. Otherwise,
140 <replaceable>COMMAND</replaceable> specifies the program to launch
141 in the container, and the remaining arguments are used as
142 arguments for this program. If <option>-b</option> is not used and
143 no arguments are specified, a shell is launched in the
144 container.</para>
145
146 <para>The following options are understood:</para>
147
148 <variablelist>
149 <varlistentry>
150 <term><option>-D</option></term>
151 <term><option>--directory=</option></term>
152
153 <listitem><para>Directory to use as file system root for the
154 container.</para>
155
156 <para>If neither <option>--directory=</option>, nor
157 <option>--image=</option> is specified the directory is
158 determined by searching for a directory named the same as the
159 machine name specified with <option>--machine=</option>. See
160 <citerefentry><refentrytitle>machinectl</refentrytitle><manvolnum>1</manvolnum></citerefentry>
161 section "Files and Directories" for the precise search path.</para>
162
163 <para>If neither <option>--directory=</option>,
164 <option>--image=</option>, nor <option>--machine=</option>
165 are specified, the current directory will
166 be used. May not be specified together with
167 <option>--image=</option>.</para></listitem>
168 </varlistentry>
169
170 <varlistentry>
171 <term><option>--template=</option></term>
172
173 <listitem><para>Directory or <literal>btrfs</literal>
174 subvolume to use as template for the container's root
175 directory. If this is specified and the container's root
176 directory (as configured by <option>--directory=</option>)
177 does not yet exist it is created as <literal>btrfs</literal>
178 subvolume and populated from this template tree. Ideally, the
179 specified template path refers to the root of a
180 <literal>btrfs</literal> subvolume, in which case a simple
181 copy-on-write snapshot is taken, and populating the root
182 directory is instant. If the specified template path does not
183 refer to the root of a <literal>btrfs</literal> subvolume (or
184 not even to a <literal>btrfs</literal> file system at all),
185 the tree is copied, which can be substantially more
186 time-consuming. Note that if this option is used the
187 container's root directory (in contrast to the template
188 directory!) must be located on a <literal>btrfs</literal> file
189 system, so that the <literal>btrfs</literal> subvolume may be
190 created. May not be specified together with
191 <option>--image=</option> or
192 <option>--ephemeral</option>.</para>
193
194 <para>Note that this switch leaves host name, machine ID and
195 all other settings that could identify the instance
196 unmodified.</para></listitem>
197 </varlistentry>
198
199 <varlistentry>
200 <term><option>-x</option></term>
201 <term><option>--ephemeral</option></term>
202
203 <listitem><para>If specified, the container is run with a
204 temporary <literal>btrfs</literal> snapshot of its root
205 directory (as configured with <option>--directory=</option>),
206 that is removed immediately when the container terminates.
207 This option is only supported if the root file system is
208 <literal>btrfs</literal>. May not be specified together with
209 <option>--image=</option> or
210 <option>--template=</option>.</para>
211 <para>Note that this switch leaves host name, machine ID and
212 all other settings that could identify the instance
213 unmodified.</para></listitem>
214 </varlistentry>
215
216 <varlistentry>
217 <term><option>-i</option></term>
218 <term><option>--image=</option></term>
219
220 <listitem><para>Disk image to mount the root directory for the
221 container from. Takes a path to a regular file or to a block
222 device node. The file or block device must contain
223 either:</para>
224
225 <itemizedlist>
226 <listitem><para>An MBR partition table with a single
227 partition of type 0x83 that is marked
228 bootable.</para></listitem>
229
230 <listitem><para>A GUID partition table (GPT) with a single
231 partition of type
232 0fc63daf-8483-4772-8e79-3d69d8477de4.</para></listitem>
233
234 <listitem><para>A GUID partition table (GPT) with a marked
235 root partition which is mounted as the root directory of the
236 container. Optionally, GPT images may contain a home and/or
237 a server data partition which are mounted to the appropriate
238 places in the container. All these partitions must be
239 identified by the partition types defined by the <ulink
240 url="http://www.freedesktop.org/wiki/Specifications/DiscoverablePartitionsSpec/">Discoverable
241 Partitions Specification</ulink>.</para></listitem>
242 </itemizedlist>
243
244 <para>Any other partitions, such as foreign partitions, swap
245 partitions or EFI system partitions are not mounted. May not
246 be specified together with <option>--directory=</option>,
247 <option>--template=</option> or
248 <option>--ephemeral</option>.</para></listitem>
249 </varlistentry>
250
251 <varlistentry>
252 <term><option>-a</option></term>
253 <term><option>--as-pid2</option></term>
254
255 <listitem><para>Invoke the shell or specified program as process ID (PID) 2 instead of PID 1 (init). By
256 default, if neither this option nor <option>--boot</option> is used, the selected binary is run as process with
257 PID 1, a mode only suitable for programs that are aware of the special semantics that the process with PID 1
258 has on UNIX. For example, it needs to reap all processes reparented to it, and should implement
259 <command>sysvinit</command> compatible signal handling (specifically: it needs to reboot on SIGINT, reexecute
260 on SIGTERM, reload configuration on SIGHUP, and so on). With <option>--as-pid2</option> a minimal stub init
261 process is run as PID 1 and the selected binary is executed as PID 2 (and hence does not need to implement any
262 special semantics). The stub init process will reap processes as necessary and react appropriately to
263 signals. It is recommended to use this mode to invoke arbitrary commands in containers, unless they have been
264 modified to run correctly as PID 1. Or in other words: this switch should be used for pretty much all commands,
265 except when the command refers to an init or shell implementation, as these are generally capable of running
266 correctly as PID 1. This option may not be combined with <option>--boot</option> or
267 <option>--share-system</option>.</para>
268 </listitem>
269 </varlistentry>
270
271 <varlistentry>
272 <term><option>-b</option></term>
273 <term><option>--boot</option></term>
274
275 <listitem><para>Automatically search for an init binary and invoke it as PID 1, instead of a shell or a user
276 supplied program. If this option is used, arguments specified on the command line are used as arguments for the
277 init binary. This option may not be combined with <option>--as-pid2</option> or
278 <option>--share-system</option>.</para>
279
280 <para>The following table explains the different modes of invocation and relationship to
281 <option>--as-pid2</option> (see above):</para>
282
283 <table>
284 <title>Invocation Mode</title>
285 <tgroup cols='2' align='left' colsep='1' rowsep='1'>
286 <colspec colname="switch" />
287 <colspec colname="explanation" />
288 <thead>
289 <row>
290 <entry>Switch</entry>
291 <entry>Explanation</entry>
292 </row>
293 </thead>
294 <tbody>
295 <row>
296 <entry>Neither <option>--as-pid2</option> nor <option>--boot</option> specified</entry>
297 <entry>The passed parameters are interpreted as the command line, which is executed as PID 1 in the container.</entry>
298 </row>
299
300 <row>
301 <entry><option>--as-pid2</option> specified</entry>
302 <entry>The passed parameters are interpreted as the command line, which is executed as PID 2 in the container. A stub init process is run as PID 1.</entry>
303 </row>
304
305 <row>
306 <entry><option>--boot</option> specified</entry>
307 <entry>An init binary as automatically searched and run as PID 1 in the container. The passed parameters are used as invocation parameters for this process.</entry>
308 </row>
309
310 </tbody>
311 </tgroup>
312 </table>
313 </listitem>
314 </varlistentry>
315
316 <varlistentry>
317 <term><option>--chdir=</option></term>
318
319 <listitem><para>Change to the specified working directory before invoking the process in the container. Expects
320 an absolute path in the container's file system namespace.</para></listitem>
321 </varlistentry>
322
323 <varlistentry>
324 <term><option>-u</option></term>
325 <term><option>--user=</option></term>
326
327 <listitem><para>After transitioning into the container, change
328 to the specified user-defined in the container's user
329 database. Like all other systemd-nspawn features, this is not
330 a security feature and provides protection against accidental
331 destructive operations only.</para></listitem>
332 </varlistentry>
333
334 <varlistentry>
335 <term><option>-M</option></term>
336 <term><option>--machine=</option></term>
337
338 <listitem><para>Sets the machine name for this container. This
339 name may be used to identify this container during its runtime
340 (for example in tools like
341 <citerefentry><refentrytitle>machinectl</refentrytitle><manvolnum>1</manvolnum></citerefentry>
342 and similar), and is used to initialize the container's
343 hostname (which the container can choose to override,
344 however). If not specified, the last component of the root
345 directory path of the container is used, possibly suffixed
346 with a random identifier in case <option>--ephemeral</option>
347 mode is selected. If the root directory selected is the host's
348 root directory the host's hostname is used as default
349 instead.</para></listitem>
350 </varlistentry>
351
352 <varlistentry>
353 <term><option>--uuid=</option></term>
354
355 <listitem><para>Set the specified UUID for the container. The
356 init system will initialize
357 <filename>/etc/machine-id</filename> from this if this file is
358 not set yet. Note that this option takes effect only if
359 <filename>/etc/machine-id</filename> in the container is
360 unpopulated.</para></listitem>
361 </varlistentry>
362
363 <varlistentry>
364 <term><option>--slice=</option></term>
365
366 <listitem><para>Make the container part of the specified
367 slice, instead of the default
368 <filename>machine.slice</filename>. This is only applies if
369 the machine is run in its own scope unit, i.e. if
370 <option>--keep-unit</option> is not used.</para>
371 </listitem>
372 </varlistentry>
373
374 <varlistentry>
375 <term><option>--property=</option></term>
376
377 <listitem><para>Set a unit property on the scope unit to
378 register for the machine. This only applies if the machine is
379 run in its own scope unit, i.e. if
380 <option>--keep-unit</option> is not used. Takes unit property
381 assignments in the same format as <command>systemctl
382 set-property</command>. This is useful to set memory limits
383 and similar for machines.</para>
384 </listitem>
385 </varlistentry>
386
387 <varlistentry>
388 <term><option>--private-users=</option></term>
389
390 <listitem><para>Controls user namespacing. If enabled, the container will run with its own private set of UNIX
391 user and group ids (UIDs and GIDs). This involves mapping the private UIDs/GIDs used in the container (starting
392 with the container's root user 0 and up) to a range of UIDs/GIDs on the host that are not used for other
393 purposes (usually in the range beyond the host's UID/GID 65536). The parameter may be specified as follows:</para>
394
395 <orderedlist>
396 <listitem><para>The value <literal>no</literal> turns off user namespacing. This is the default.</para></listitem>
397
398 <listitem><para>The value <literal>yes</literal> (or the omission of a parameter) turns on user
399 namespacing. The UID/GID range to use is determined automatically from the file ownership of the root
400 directory of the container's directory tree. To use this option, make sure to prepare the directory tree in
401 advance, and ensure that all files and directories in it are owned by UIDs/GIDs in the range you'd like to
402 use. Also, make sure that used file ACLs exclusively reference UIDs/GIDs in the appropriate range. If this
403 mode is used the number of UIDs/GIDs assigned to the container for use is 65536, and the UID/GID of the
404 root directory must be a multiple of 65536.</para></listitem>
405
406 <listitem><para>The value "pick" turns on user namespacing. In this case the UID/GID range is automatically
407 chosen. As first step, the file owner of the root directory of the container's directory tree is read, and it
408 is checked that it is currently not used by the system otherwise (in particular, that no other container is
409 using it). If this check is successful, the UID/GID range determined this way is used, similar to the
410 behaviour if "yes" is specified. If the check is not successful (and thus the UID/GID range indicated in the
411 root directory's file owner is already used elsewhere) a new – currently unused – UID/GID range of 65536
412 UIDs/GIDs is randomly chosen between the host UID/GIDs of 524288 and 1878982656, always starting at a
413 multiple of 65536. This setting implies <option>--private-users-chown</option> (see below), which has the
414 effect that the files and directories in the container's directory tree will be owned by the appropriate
415 users of the range picked. Using this option makes user namespace behaviour fully automatic. Note that the
416 first invocation of a previously unused container image might result in picking a new UID/GID range for it,
417 and thus in the (possibly expensive) file ownership adjustment operation. However, subsequent invocations of
418 the container will be cheap (unless of course the picked UID/GID range is assigned to a different use by
419 then).</para></listitem>
420
421 <listitem><para>Finally if one or two colon-separated numeric parameters are specified, user namespacing is
422 turned on, too. The first parameter specifies the first host UID/GID to assign to the container, the second
423 parameter specifies the number of host UIDs/GIDs to assign to the container. If the second parameter is
424 omitted, 65536 UIDs/GIDs are assigned.</para></listitem>
425 </orderedlist>
426
427 <para>It is recommended to assign at least 65536 UIDs/GIDs to each container, so that the usable UID/GID range in the
428 container covers 16 bit. For best security, do not assign overlapping UID/GID ranges to multiple containers. It is
429 hence a good idea to use the upper 16 bit of the host 32-bit UIDs/GIDs as container identifier, while the lower 16
430 bit encode the container UID/GID used. This is in fact the behaviour enforced by the
431 <option>--private-users=pick</option> option.</para>
432
433 <para>When user namespaces are used, the GID range assigned to each container is always chosen identical to the
434 UID range.</para>
435
436 <para>In most cases, using <option>--private-users=pick</option> is the recommended option as it enhances
437 container security massively and operates fully automatically in most cases.</para>
438
439 <para>Note that the picked UID/GID range is not written to <filename>/etc/passwd</filename> or
440 <filename>/etc/group</filename>. In fact, the allocation of the range is not stored persistently anywhere,
441 except in the file ownership of the files and directories of the container.</para></listitem>
442 </varlistentry>
443
444 <varlistentry>
445 <term><option>-U</option></term>
446
447 <listitem><para>If the kernel supports the user namespaces feature, equivalent to
448 <option>--private-users=pick</option>, otherwise equivalent to
449 <option>--private-users=no</option>.</para></listitem>
450 </varlistentry>
451
452 <varlistentry>
453 <term><option>--private-users-chown</option></term>
454
455 <listitem><para>If specified, all files and directories in the container's directory tree will adjusted so that
456 they are owned to the appropriate UIDs/GIDs selected for the container (see above). This operation is
457 potentially expensive, as it involves descending and iterating through the full directory tree of the
458 container. Besides actual file ownership, file ACLs are adjusted as well.</para>
459
460 <para>This option is implied if <option>--private-users=pick</option> is used. This option has no effect if
461 user namespacing is not used.</para></listitem>
462 </varlistentry>
463
464 <varlistentry>
465 <term><option>--private-network</option></term>
466
467 <listitem><para>Disconnect networking of the container from
468 the host. This makes all network interfaces unavailable in the
469 container, with the exception of the loopback device and those
470 specified with <option>--network-interface=</option> and
471 configured with <option>--network-veth</option>. If this
472 option is specified, the CAP_NET_ADMIN capability will be
473 added to the set of capabilities the container retains. The
474 latter may be disabled by using
475 <option>--drop-capability=</option>.</para></listitem>
476 </varlistentry>
477
478 <varlistentry>
479 <term><option>--network-interface=</option></term>
480
481 <listitem><para>Assign the specified network interface to the
482 container. This will remove the specified interface from the
483 calling namespace and place it in the container. When the
484 container terminates, it is moved back to the host namespace.
485 Note that <option>--network-interface=</option> implies
486 <option>--private-network</option>. This option may be used
487 more than once to add multiple network interfaces to the
488 container.</para></listitem>
489 </varlistentry>
490
491 <varlistentry>
492 <term><option>--network-macvlan=</option></term>
493
494 <listitem><para>Create a <literal>macvlan</literal> interface
495 of the specified Ethernet network interface and add it to the
496 container. A <literal>macvlan</literal> interface is a virtual
497 interface that adds a second MAC address to an existing
498 physical Ethernet link. The interface in the container will be
499 named after the interface on the host, prefixed with
500 <literal>mv-</literal>. Note that
501 <option>--network-macvlan=</option> implies
502 <option>--private-network</option>. This option may be used
503 more than once to add multiple network interfaces to the
504 container.</para></listitem>
505 </varlistentry>
506
507 <varlistentry>
508 <term><option>--network-ipvlan=</option></term>
509
510 <listitem><para>Create an <literal>ipvlan</literal> interface
511 of the specified Ethernet network interface and add it to the
512 container. An <literal>ipvlan</literal> interface is a virtual
513 interface, similar to a <literal>macvlan</literal> interface,
514 which uses the same MAC address as the underlying interface.
515 The interface in the container will be named after the
516 interface on the host, prefixed with <literal>iv-</literal>.
517 Note that <option>--network-ipvlan=</option> implies
518 <option>--private-network</option>. This option may be used
519 more than once to add multiple network interfaces to the
520 container.</para></listitem>
521 </varlistentry>
522
523 <varlistentry>
524 <term><option>-n</option></term>
525 <term><option>--network-veth</option></term>
526
527 <listitem><para>Create a virtual Ethernet link
528 (<literal>veth</literal>) between host and container. The host
529 side of the Ethernet link will be available as a network
530 interface named after the container's name (as specified with
531 <option>--machine=</option>), prefixed with
532 <literal>ve-</literal>. The container side of the Ethernet
533 link will be named <literal>host0</literal>. Note that
534 <option>--network-veth</option> implies
535 <option>--private-network</option>.</para></listitem>
536 </varlistentry>
537
538 <varlistentry>
539 <term><option>--network-veth-extra=</option></term>
540
541 <listitem><para>Adds an additional virtual Ethernet link
542 between host and container. Takes a colon-separated pair of
543 host interface name and container interface name. The latter
544 may be omitted in which case the container and host sides will
545 be assigned the same name. This switch is independent of
546 <option>--network-veth</option>, and — in contrast — may be
547 used multiple times, and allows configuration of the network
548 interface names. Note that <option>--network-bridge=</option>
549 has no effect on interfaces created with
550 <option>--network-veth-extra=</option>.</para></listitem>
551 </varlistentry>
552
553 <varlistentry>
554 <term><option>--network-bridge=</option></term>
555
556 <listitem><para>Adds the host side of the Ethernet link
557 created with <option>--network-veth</option> to the specified
558 bridge. Note that <option>--network-bridge=</option> implies
559 <option>--network-veth</option>. If this option is used, the
560 host side of the Ethernet link will use the
561 <literal>vb-</literal> prefix instead of
562 <literal>ve-</literal>.</para></listitem>
563 </varlistentry>
564
565 <varlistentry>
566 <term><option>-p</option></term>
567 <term><option>--port=</option></term>
568
569 <listitem><para>If private networking is enabled, maps an IP
570 port on the host onto an IP port on the container. Takes a
571 protocol specifier (either <literal>tcp</literal> or
572 <literal>udp</literal>), separated by a colon from a host port
573 number in the range 1 to 65535, separated by a colon from a
574 container port number in the range from 1 to 65535. The
575 protocol specifier and its separating colon may be omitted, in
576 which case <literal>tcp</literal> is assumed. The container
577 port number and its colon may be omitted, in which case the
578 same port as the host port is implied. This option is only
579 supported if private networking is used, such as with
580 <option>--network-veth</option> or
581 <option>--network-bridge=</option>.</para></listitem>
582 </varlistentry>
583
584 <varlistentry>
585 <term><option>-Z</option></term>
586 <term><option>--selinux-context=</option></term>
587
588 <listitem><para>Sets the SELinux security context to be used
589 to label processes in the container.</para>
590 </listitem>
591 </varlistentry>
592
593 <varlistentry>
594 <term><option>-L</option></term>
595 <term><option>--selinux-apifs-context=</option></term>
596
597 <listitem><para>Sets the SELinux security context to be used
598 to label files in the virtual API file systems in the
599 container.</para>
600 </listitem>
601 </varlistentry>
602
603 <varlistentry>
604 <term><option>--capability=</option></term>
605
606 <listitem><para>List one or more additional capabilities to
607 grant the container. Takes a comma-separated list of
608 capability names, see
609 <citerefentry project='man-pages'><refentrytitle>capabilities</refentrytitle><manvolnum>7</manvolnum></citerefentry>
610 for more information. Note that the following capabilities
611 will be granted in any way: CAP_CHOWN, CAP_DAC_OVERRIDE,
612 CAP_DAC_READ_SEARCH, CAP_FOWNER, CAP_FSETID, CAP_IPC_OWNER,
613 CAP_KILL, CAP_LEASE, CAP_LINUX_IMMUTABLE,
614 CAP_NET_BIND_SERVICE, CAP_NET_BROADCAST, CAP_NET_RAW,
615 CAP_SETGID, CAP_SETFCAP, CAP_SETPCAP, CAP_SETUID,
616 CAP_SYS_ADMIN, CAP_SYS_CHROOT, CAP_SYS_NICE, CAP_SYS_PTRACE,
617 CAP_SYS_TTY_CONFIG, CAP_SYS_RESOURCE, CAP_SYS_BOOT,
618 CAP_AUDIT_WRITE, CAP_AUDIT_CONTROL. Also CAP_NET_ADMIN is
619 retained if <option>--private-network</option> is specified.
620 If the special value <literal>all</literal> is passed, all
621 capabilities are retained.</para></listitem>
622 </varlistentry>
623
624 <varlistentry>
625 <term><option>--drop-capability=</option></term>
626
627 <listitem><para>Specify one or more additional capabilities to
628 drop for the container. This allows running the container with
629 fewer capabilities than the default (see
630 above).</para></listitem>
631 </varlistentry>
632
633 <varlistentry>
634 <term><option>--kill-signal=</option></term>
635
636 <listitem><para>Specify the process signal to send to the
637 container's PID 1 when nspawn itself receives SIGTERM, in
638 order to trigger an orderly shutdown of the
639 container. Defaults to SIGRTMIN+3 if <option>--boot</option>
640 is used (on systemd-compatible init systems SIGRTMIN+3
641 triggers an orderly shutdown). For a list of valid signals, see
642 <citerefentry project='man-pages'><refentrytitle>signal</refentrytitle><manvolnum>7</manvolnum></citerefentry>.</para></listitem>
643 </varlistentry>
644
645 <varlistentry>
646 <term><option>--link-journal=</option></term>
647
648 <listitem><para>Control whether the container's journal shall
649 be made visible to the host system. If enabled, allows viewing
650 the container's journal files from the host (but not vice
651 versa). Takes one of <literal>no</literal>,
652 <literal>host</literal>, <literal>try-host</literal>,
653 <literal>guest</literal>, <literal>try-guest</literal>,
654 <literal>auto</literal>. If <literal>no</literal>, the journal
655 is not linked. If <literal>host</literal>, the journal files
656 are stored on the host file system (beneath
657 <filename>/var/log/journal/<replaceable>machine-id</replaceable></filename>)
658 and the subdirectory is bind-mounted into the container at the
659 same location. If <literal>guest</literal>, the journal files
660 are stored on the guest file system (beneath
661 <filename>/var/log/journal/<replaceable>machine-id</replaceable></filename>)
662 and the subdirectory is symlinked into the host at the same
663 location. <literal>try-host</literal> and
664 <literal>try-guest</literal> do the same but do not fail if
665 the host does not have persistent journalling enabled. If
666 <literal>auto</literal> (the default), and the right
667 subdirectory of <filename>/var/log/journal</filename> exists,
668 it will be bind mounted into the container. If the
669 subdirectory does not exist, no linking is performed.
670 Effectively, booting a container once with
671 <literal>guest</literal> or <literal>host</literal> will link
672 the journal persistently if further on the default of
673 <literal>auto</literal> is used.</para></listitem>
674 </varlistentry>
675
676 <varlistentry>
677 <term><option>-j</option></term>
678
679 <listitem><para>Equivalent to
680 <option>--link-journal=try-guest</option>.</para></listitem>
681 </varlistentry>
682
683 <varlistentry>
684 <term><option>--read-only</option></term>
685
686 <listitem><para>Mount the root file system read-only for the
687 container.</para></listitem>
688 </varlistentry>
689
690 <varlistentry>
691 <term><option>--bind=</option></term>
692 <term><option>--bind-ro=</option></term>
693
694 <listitem><para>Bind mount a file or directory from the host
695 into the container. Takes one of: a path argument — in which
696 case the specified path will be mounted from the host to the
697 same path in the container —, or a colon-separated pair of
698 paths — in which case the first specified path is the source
699 in the host, and the second path is the destination in the
700 container —, or a colon-separated triple of source path,
701 destination path and mount options. Mount options are
702 comma-separated and currently, only "rbind" and "norbind"
703 are allowed. Defaults to "rbind". Backslash escapes are interpreted, so
704 <literal>\:</literal> may be used to embed colons in either path.
705 This option may be specified multiple times for
706 creating multiple independent bind mount points. The
707 <option>--bind-ro=</option> option creates read-only bind
708 mounts.</para></listitem>
709 </varlistentry>
710
711 <varlistentry>
712 <term><option>--tmpfs=</option></term>
713
714 <listitem><para>Mount a tmpfs file system into the container.
715 Takes a single absolute path argument that specifies where to
716 mount the tmpfs instance to (in which case the directory
717 access mode will be chosen as 0755, owned by root/root), or
718 optionally a colon-separated pair of path and mount option
719 string that is used for mounting (in which case the kernel
720 default for access mode and owner will be chosen, unless
721 otherwise specified). This option is particularly useful for
722 mounting directories such as <filename>/var</filename> as
723 tmpfs, to allow state-less systems, in particular when
724 combined with <option>--read-only</option>.
725 Backslash escapes are interpreted in the path, so
726 <literal>\:</literal> may be used to embed colons in the path.
727 </para></listitem>
728 </varlistentry>
729
730 <varlistentry>
731 <term><option>--overlay=</option></term>
732 <term><option>--overlay-ro=</option></term>
733
734 <listitem><para>Combine multiple directory trees into one
735 overlay file system and mount it into the container. Takes a
736 list of colon-separated paths to the directory trees to
737 combine and the destination mount point.</para>
738
739 <para>Backslash escapes are interpreted in the paths, so
740 <literal>\:</literal> may be used to embed colons in the paths.
741 </para>
742
743 <para>If three or more paths are specified, then the last
744 specified path is the destination mount point in the
745 container, all paths specified before refer to directory trees
746 on the host and are combined in the specified order into one
747 overlay file system. The left-most path is hence the lowest
748 directory tree, the second-to-last path the highest directory
749 tree in the stacking order. If <option>--overlay-ro=</option>
750 is used instead of <option>--overlay=</option>, a read-only
751 overlay file system is created. If a writable overlay file
752 system is created, all changes made to it are written to the
753 highest directory tree in the stacking order, i.e. the
754 second-to-last specified.</para>
755
756 <para>If only two paths are specified, then the second
757 specified path is used both as the top-level directory tree in
758 the stacking order as seen from the host, as well as the mount
759 point for the overlay file system in the container. At least
760 two paths have to be specified.</para>
761
762 <para>For details about overlay file systems, see <ulink
763 url="https://www.kernel.org/doc/Documentation/filesystems/overlayfs.txt">overlayfs.txt</ulink>. Note
764 that the semantics of overlay file systems are substantially
765 different from normal file systems, in particular regarding
766 reported device and inode information. Device and inode
767 information may change for a file while it is being written
768 to, and processes might see out-of-date versions of files at
769 times. Note that this switch automatically derives the
770 <literal>workdir=</literal> mount option for the overlay file
771 system from the top-level directory tree, making it a sibling
772 of it. It is hence essential that the top-level directory tree
773 is not a mount point itself (since the working directory must
774 be on the same file system as the top-most directory
775 tree). Also note that the <literal>lowerdir=</literal> mount
776 option receives the paths to stack in the opposite order of
777 this switch.</para></listitem>
778 </varlistentry>
779
780 <varlistentry>
781 <term><option>-E <replaceable>NAME</replaceable>=<replaceable>VALUE</replaceable></option></term>
782 <term><option>--setenv=<replaceable>NAME</replaceable>=<replaceable>VALUE</replaceable></option></term>
783
784 <listitem><para>Specifies an environment variable assignment
785 to pass to the init process in the container, in the format
786 <literal>NAME=VALUE</literal>. This may be used to override
787 the default variables or to set additional variables. This
788 parameter may be used more than once.</para></listitem>
789 </varlistentry>
790
791 <varlistentry>
792 <term><option>--share-system</option></term>
793
794 <listitem><para>Allows the container to share certain system
795 facilities with the host. More specifically, this turns off
796 PID namespacing, UTS namespacing and IPC namespacing, and thus
797 allows the guest to see and interact more easily with
798 processes outside of the container. Note that using this
799 option makes it impossible to start up a full Operating System
800 in the container, as an init system cannot operate in this
801 mode. It is only useful to run specific programs or
802 applications this way, without involving an init system in the
803 container. This option implies <option>--register=no</option>.
804 This option may not be combined with
805 <option>--boot</option>.</para></listitem>
806 </varlistentry>
807
808 <varlistentry>
809 <term><option>--register=</option></term>
810
811 <listitem><para>Controls whether the container is registered
812 with
813 <citerefentry><refentrytitle>systemd-machined</refentrytitle><manvolnum>8</manvolnum></citerefentry>.
814 Takes a boolean argument, which defaults to <literal>yes</literal>.
815 This option should be enabled when the container runs a full
816 Operating System (more specifically: an init system), and is
817 useful to ensure that the container is accessible via
818 <citerefentry><refentrytitle>machinectl</refentrytitle><manvolnum>1</manvolnum></citerefentry>
819 and shown by tools such as
820 <citerefentry project='man-pages'><refentrytitle>ps</refentrytitle><manvolnum>1</manvolnum></citerefentry>.
821 If the container does not run an init system, it is
822 recommended to set this option to <literal>no</literal>. Note
823 that <option>--share-system</option> implies
824 <option>--register=no</option>. </para></listitem>
825 </varlistentry>
826
827 <varlistentry>
828 <term><option>--keep-unit</option></term>
829
830 <listitem><para>Instead of creating a transient scope unit to
831 run the container in, simply register the service or scope
832 unit <command>systemd-nspawn</command> has been invoked in
833 with
834 <citerefentry><refentrytitle>systemd-machined</refentrytitle><manvolnum>8</manvolnum></citerefentry>.
835 This has no effect if <option>--register=no</option> is used.
836 This switch should be used if
837 <command>systemd-nspawn</command> is invoked from within a
838 service unit, and the service unit's sole purpose is to run a
839 single <command>systemd-nspawn</command> container. This
840 option is not available if run from a user
841 session.</para></listitem>
842 </varlistentry>
843
844 <varlistentry>
845 <term><option>--personality=</option></term>
846
847 <listitem><para>Control the architecture ("personality")
848 reported by
849 <citerefentry project='man-pages'><refentrytitle>uname</refentrytitle><manvolnum>2</manvolnum></citerefentry>
850 in the container. Currently, only <literal>x86</literal> and
851 <literal>x86-64</literal> are supported. This is useful when
852 running a 32-bit container on a 64-bit host. If this setting
853 is not used, the personality reported in the container is the
854 same as the one reported on the host.</para></listitem>
855 </varlistentry>
856
857 <varlistentry>
858 <term><option>-q</option></term>
859 <term><option>--quiet</option></term>
860
861 <listitem><para>Turns off any status output by the tool
862 itself. When this switch is used, the only output from nspawn
863 will be the console output of the container OS
864 itself.</para></listitem>
865 </varlistentry>
866
867 <varlistentry>
868 <term><option>--volatile</option></term>
869 <term><option>--volatile=</option><replaceable>MODE</replaceable></term>
870
871 <listitem><para>Boots the container in volatile mode. When no
872 mode parameter is passed or when mode is specified as
873 <option>yes</option>, full volatile mode is enabled. This
874 means the root directory is mounted as a mostly unpopulated
875 <literal>tmpfs</literal> instance, and
876 <filename>/usr</filename> from the OS tree is mounted into it
877 in read-only mode (the system thus starts up with read-only OS
878 resources, but pristine state and configuration, any changes
879 to the either are lost on shutdown). When the mode parameter
880 is specified as <option>state</option>, the OS tree is
881 mounted read-only, but <filename>/var</filename> is mounted as
882 a <literal>tmpfs</literal> instance into it (the system thus
883 starts up with read-only OS resources and configuration, but
884 pristine state, and any changes to the latter are lost on
885 shutdown). When the mode parameter is specified as
886 <option>no</option> (the default), the whole OS tree is made
887 available writable.</para>
888
889 <para>Note that setting this to <option>yes</option> or
890 <option>state</option> will only work correctly with
891 operating systems in the container that can boot up with only
892 <filename>/usr</filename> mounted, and are able to populate
893 <filename>/var</filename> automatically, as
894 needed.</para></listitem>
895 </varlistentry>
896
897 <varlistentry>
898 <term><option>--settings=</option><replaceable>MODE</replaceable></term>
899
900 <listitem><para>Controls whether
901 <command>systemd-nspawn</command> shall search for and use
902 additional per-container settings from
903 <filename>.nspawn</filename> files. Takes a boolean or the
904 special values <option>override</option> or
905 <option>trusted</option>.</para>
906
907 <para>If enabled (the default), a settings file named after the
908 machine (as specified with the <option>--machine=</option>
909 setting, or derived from the directory or image file name)
910 with the suffix <filename>.nspawn</filename> is searched in
911 <filename>/etc/systemd/nspawn/</filename> and
912 <filename>/run/systemd/nspawn/</filename>. If it is found
913 there, its settings are read and used. If it is not found
914 there, it is subsequently searched in the same directory as the
915 image file or in the immediate parent of the root directory of
916 the container. In this case, if the file is found, its settings
917 will be also read and used, but potentially unsafe settings
918 are ignored. Note that in both these cases, settings on the
919 command line take precedence over the corresponding settings
920 from loaded <filename>.nspawn</filename> files, if both are
921 specified. Unsafe settings are considered all settings that
922 elevate the container's privileges or grant access to
923 additional resources such as files or directories of the
924 host. For details about the format and contents of
925 <filename>.nspawn</filename> files, consult
926 <citerefentry><refentrytitle>systemd.nspawn</refentrytitle><manvolnum>5</manvolnum></citerefentry>.</para>
927
928 <para>If this option is set to <option>override</option>, the
929 file is searched, read and used the same way, however, the order of
930 precedence is reversed: settings read from the
931 <filename>.nspawn</filename> file will take precedence over
932 the corresponding command line options, if both are
933 specified.</para>
934
935 <para>If this option is set to <option>trusted</option>, the
936 file is searched, read and used the same way, but regardless
937 of being found in <filename>/etc/systemd/nspawn/</filename>,
938 <filename>/run/systemd/nspawn/</filename> or next to the image
939 file or container root directory, all settings will take
940 effect, however, command line arguments still take precedence
941 over corresponding settings.</para>
942
943 <para>If disabled, no <filename>.nspawn</filename> file is read
944 and no settings except the ones on the command line are in
945 effect.</para></listitem>
946 </varlistentry>
947
948 <xi:include href="standard-options.xml" xpointer="help" />
949 <xi:include href="standard-options.xml" xpointer="version" />
950 </variablelist>
951
952 </refsect1>
953
954 <refsect1>
955 <title>Examples</title>
956
957 <example>
958 <title>Download a Fedora image and start a shell in it</title>
959
960 <programlisting># machinectl pull-raw --verify=no http://ftp.halifax.rwth-aachen.de/fedora/linux/releases/21/Cloud/Images/x86_64/Fedora-Cloud-Base-20141203-21.x86_64.raw.xz
961 # systemd-nspawn -M Fedora-Cloud-Base-20141203-21</programlisting>
962
963 <para>This downloads an image using
964 <citerefentry><refentrytitle>machinectl</refentrytitle><manvolnum>1</manvolnum></citerefentry>
965 and opens a shell in it.</para>
966 </example>
967
968 <example>
969 <title>Build and boot a minimal Fedora distribution in a container</title>
970
971 <programlisting># dnf -y --releasever=23 --installroot=/srv/mycontainer --disablerepo='*' --enablerepo=fedora --enablerepo=updates install systemd passwd dnf fedora-release vim-minimal
972 # systemd-nspawn -bD /srv/mycontainer</programlisting>
973
974 <para>This installs a minimal Fedora distribution into the
975 directory <filename noindex='true'>/srv/mycontainer/</filename>
976 and then boots an OS in a namespace container in it.</para>
977 </example>
978
979 <example>
980 <title>Spawn a shell in a container of a minimal Debian unstable distribution</title>
981
982 <programlisting># debootstrap --arch=amd64 unstable ~/debian-tree/
983 # systemd-nspawn -D ~/debian-tree/</programlisting>
984
985 <para>This installs a minimal Debian unstable distribution into
986 the directory <filename>~/debian-tree/</filename> and then
987 spawns a shell in a namespace container in it.</para>
988 </example>
989
990 <example>
991 <title>Boot a minimal Arch Linux distribution in a container</title>
992
993 <programlisting># pacstrap -c -d ~/arch-tree/ base
994 # systemd-nspawn -bD ~/arch-tree/</programlisting>
995
996 <para>This installs a minimal Arch Linux distribution into the
997 directory <filename>~/arch-tree/</filename> and then boots an OS
998 in a namespace container in it.</para>
999 </example>
1000
1001 <example>
1002 <title>Boot into an ephemeral <literal>btrfs</literal> snapshot of the host system</title>
1003
1004 <programlisting># systemd-nspawn -D / -xb</programlisting>
1005
1006 <para>This runs a copy of the host system in a
1007 <literal>btrfs</literal> snapshot which is removed immediately
1008 when the container exits. All file system changes made during
1009 runtime will be lost on shutdown, hence.</para>
1010 </example>
1011
1012 <example>
1013 <title>Run a container with SELinux sandbox security contexts</title>
1014
1015 <programlisting># chcon system_u:object_r:svirt_sandbox_file_t:s0:c0,c1 -R /srv/container
1016 # systemd-nspawn -L system_u:object_r:svirt_sandbox_file_t:s0:c0,c1 -Z system_u:system_r:svirt_lxc_net_t:s0:c0,c1 -D /srv/container /bin/sh</programlisting>
1017 </example>
1018 </refsect1>
1019
1020 <refsect1>
1021 <title>Exit status</title>
1022
1023 <para>The exit code of the program executed in the container is
1024 returned.</para>
1025 </refsect1>
1026
1027 <refsect1>
1028 <title>See Also</title>
1029 <para>
1030 <citerefentry><refentrytitle>systemd</refentrytitle><manvolnum>1</manvolnum></citerefentry>,
1031 <citerefentry><refentrytitle>systemd.nspawn</refentrytitle><manvolnum>5</manvolnum></citerefentry>,
1032 <citerefentry project='man-pages'><refentrytitle>chroot</refentrytitle><manvolnum>1</manvolnum></citerefentry>,
1033 <citerefentry project='mankier'><refentrytitle>dnf</refentrytitle><manvolnum>8</manvolnum></citerefentry>,
1034 <citerefentry project='die-net'><refentrytitle>debootstrap</refentrytitle><manvolnum>8</manvolnum></citerefentry>,
1035 <citerefentry project='archlinux'><refentrytitle>pacman</refentrytitle><manvolnum>8</manvolnum></citerefentry>,
1036 <citerefentry><refentrytitle>systemd.slice</refentrytitle><manvolnum>5</manvolnum></citerefentry>,
1037 <citerefentry><refentrytitle>machinectl</refentrytitle><manvolnum>1</manvolnum></citerefentry>,
1038 <citerefentry project='man-pages'><refentrytitle>btrfs</refentrytitle><manvolnum>8</manvolnum></citerefentry>
1039 </para>
1040 </refsect1>
1041
1042 </refentry>