1 .\" Copyright (C) 1998 Andries Brouwer (aeb@cwi.nl)
2 .\" and Copyright (C) 2002, 2006, 2008, 2012, 2013 Michael Kerrisk <mtk.manpages@gmail.com>
3 .\" and Copyright Guillem Jover <guillem@hadrons.org>
4 .\" and Copyright (C) 2014 Dave Hansen / Intel
6 .\" %%%LICENSE_START(VERBATIM)
7 .\" Permission is granted to make and distribute verbatim copies of this
8 .\" manual provided the copyright notice and this permission notice are
9 .\" preserved on all copies.
11 .\" Permission is granted to copy and distribute modified versions of this
12 .\" manual under the conditions for verbatim copying, provided that the
13 .\" entire resulting derived work is distributed under the terms of a
14 .\" permission notice identical to this one.
16 .\" Since the Linux kernel and libraries are constantly changing, this
17 .\" manual page may be incorrect or out-of-date. The author(s) assume no
18 .\" responsibility for errors or omissions, or for damages resulting from
19 .\" the use of the information contained herein. The author(s) may not
20 .\" have taken the same level of care in the production of this manual,
21 .\" which is licensed free of charge, as they might when working
24 .\" Formatted or processed versions of this manual, if unaccompanied by
25 .\" the source, must acknowledge the copyright and authors of this work.
28 .\" Modified Thu Nov 11 04:19:42 MET 1999, aeb: added PR_GET_PDEATHSIG
29 .\" Modified 27 Jun 02, Michael Kerrisk
30 .\" Added PR_SET_DUMPABLE, PR_GET_DUMPABLE,
31 .\" PR_SET_KEEPCAPS, PR_GET_KEEPCAPS
32 .\" Modified 2006-08-30 Guillem Jover <guillem@hadrons.org>
33 .\" Updated Linux versions where the options where introduced.
34 .\" Added PR_SET_TIMING, PR_GET_TIMING, PR_SET_NAME, PR_GET_NAME,
35 .\" PR_SET_UNALIGN, PR_GET_UNALIGN, PR_SET_FPEMU, PR_GET_FPEMU,
36 .\" PR_SET_FPEXC, PR_GET_FPEXC
37 .\" 2008-04-29 Serge Hallyn, Document PR_CAPBSET_READ and PR_CAPBSET_DROP
38 .\" 2008-06-13 Erik Bosman, <ejbosman@cs.vu.nl>
39 .\" Document PR_GET_TSC and PR_SET_TSC.
40 .\" 2008-06-15 mtk, Document PR_SET_SECCOMP, PR_GET_SECCOMP
41 .\" 2009-10-03 Andi Kleen, document PR_MCE_KILL
42 .\" 2012-04 Cyrill Gorcunov, Document PR_SET_MM
43 .\" 2012-04-25 Michael Kerrisk, Document PR_TASK_PERF_EVENTS_DISABLE and
44 .\" PR_TASK_PERF_EVENTS_ENABLE
45 .\" 2012-09-20 Kees Cook, update PR_SET_SECCOMP for mode 2
46 .\" 2012-09-20 Kees Cook, document PR_SET_NO_NEW_PRIVS, PR_GET_NO_NEW_PRIVS
47 .\" 2012-10-25 Michael Kerrisk, Document PR_SET_TIMERSLACK and
49 .\" 2013-01-10 Kees Cook, document PR_SET_PTRACER
50 .\" 2012-02-04 Michael Kerrisk, document PR_{SET,GET}_CHILD_SUBREAPER
51 .\" 2014-11-10 Dave Hansen, document PR_MPX_{EN,DIS}ABLE_MANAGEMENT
54 .TH PRCTL 2 2020-04-11 "Linux" "Linux Programmer's Manual"
56 prctl \- operations on a process
59 .B #include <sys/prctl.h>
61 .BI "int prctl(int " option ", unsigned long " arg2 ", unsigned long " arg3 ,
62 .BI " unsigned long " arg4 ", unsigned long " arg5 );
66 is called with a first argument describing what to do
67 (with values defined in \fI<linux/prctl.h>\fP), and further
68 arguments with a significance depending on the first one.
69 The first argument can be:
72 .BR PR_CAP_AMBIENT " (since Linux 4.3)"
73 .\" commit 58319057b7847667f0c9585b9de0e8932b0fdb08
74 Reads or changes the ambient capability set of the calling thread,
75 according to the value of
77 which must be one of the following:
81 .B PR_CAP_AMBIENT_RAISE
82 The capability specified in
84 is added to the ambient set.
85 The specified capability must already be present in
86 both the permitted and the inheritable sets of the process.
87 This operation is not permitted if the
88 .B SECBIT_NO_CAP_AMBIENT_RAISE
91 .B PR_CAP_AMBIENT_LOWER
92 The capability specified in
94 is removed from the ambient set.
96 .B PR_CAP_AMBIENT_IS_SET
99 call returns 1 if the capability in
101 is in the ambient set and 0 if it is not.
103 .BR PR_CAP_AMBIENT_CLEAR_ALL
104 All capabilities will be removed from the ambient set.
105 This operation requires setting
110 In all of the above operations,
114 must be specified as 0.
116 Higher-level interfaces layered on top of the above operations are
119 library in the form of
120 .BR cap_get_ambient (3),
121 .BR cap_set_ambient (3),
123 .BR cap_reset_ambient (3).
125 .BR PR_CAPBSET_READ " (since Linux 2.6.25)"
126 Return (as the function result) 1 if the capability specified in
128 is in the calling thread's capability bounding set,
130 (The capability constants are defined in
131 .IR <linux/capability.h> .)
132 The capability bounding set dictates
133 whether the process can receive the capability through a
134 file's permitted capability set on a subsequent call to
137 If the capability specified in
139 is not valid, then the call fails with the error
142 A higher-level interface layered on top of this operation is provided in the
144 library in the form of
145 .BR cap_get_bound (3).
147 .BR PR_CAPBSET_DROP " (since Linux 2.6.25)"
148 If the calling thread has the
150 capability within its user namespace, then drop the capability specified by
152 from the calling thread's capability bounding set.
153 Any children of the calling thread will inherit the newly
154 reduced bounding set.
156 The call fails with the error:
158 if the calling thread does not have the
163 does not represent a valid capability; or
165 if file capabilities are not enabled in the kernel,
166 in which case bounding sets are not supported.
168 A higher-level interface layered on top of this operation is provided in the
170 library in the form of
171 .BR cap_drop_bound (3).
173 .BR PR_SET_CHILD_SUBREAPER " (since Linux 3.4)"
174 .\" commit ebec18a6d3aa1e7d84aab16225e87fd25170ec2b
178 set the "child subreaper" attribute of the calling process;
181 is zero, unset the attribute.
183 A subreaper fulfills the role of
185 for its descendant processes.
186 When a process becomes orphaned
187 (i.e., its immediate parent terminates),
188 then that process will be reparented to
189 the nearest still living ancestor subreaper.
190 Subsequently, calls to
192 in the orphaned process will now return the PID of the subreaper process,
193 and when the orphan terminates, it is the subreaper process that
196 signal and will be able to
198 on the process to discover its termination status.
200 The setting of the "child subreaper" attribute
201 is not inherited by children created by
205 The setting is preserved across
208 Establishing a subreaper process is useful in session management frameworks
209 where a hierarchical group of processes is managed by a subreaper process
210 that needs to be informed when one of the processes\(emfor example,
211 a double-forked daemon\(emterminates
212 (perhaps so that it can restart that process).
217 employ a subreaper process for similar reasons.
219 .BR PR_GET_CHILD_SUBREAPER " (since Linux 3.4)"
220 Return the "child subreaper" setting of the caller,
221 in the location pointed to by
222 .IR "(int\ *) arg2" .
224 .BR PR_SET_DUMPABLE " (since Linux 2.3.20)"
225 Set the state of the "dumpable" attribute,
226 which determines whether core dumps are produced for the calling process
227 upon delivery of a signal whose default behavior is to produce a core dump.
229 In kernels up to and including 2.6.12,
232 .RB ( SUID_DUMP_DISABLE ,
233 process is not dumpable) or 1
234 .RB ( SUID_DUMP_USER ,
235 process is dumpable).
236 Between kernels 2.6.13 and 2.6.17,
237 .\" commit abf75a5033d4da7b8a7e92321d74021d1fcfb502
238 the value 2 was also permitted,
239 which caused any binary which normally would not be dumped
240 to be dumped readable by root only;
241 for security reasons, this feature has been removed.
242 .\" See http://marc.theaimsgroup.com/?l=linux-kernel&m=115270289030630&w=2
243 .\" Subject: Fix prctl privilege escalation (CVE-2006-2451)
244 .\" From: Marcel Holtmann <marcel () holtmann ! org>
245 .\" Date: 2006-07-12 11:12:00
246 (See also the description of
247 .I /proc/sys/fs/\:suid_dumpable
251 Normally, the "dumpable" attribute is set to 1.
252 However, it is reset to the current value contained in the file
253 .IR /proc/sys/fs/\:suid_dumpable
254 (which by default has the value 0),
255 in the following circumstances:
256 .\" See kernel/cred.c::commit_creds() (Linux 3.18 sources)
259 The process's effective user or group ID is changed.
261 The process's filesystem user or group ID is changed (see
262 .BR credentials (7)).
266 a set-user-ID or set-group-ID program, resulting in a change
267 of either the effective user ID or the effective group ID.
271 a program that has file capabilities (see
272 .BR capabilities (7)),
273 .\" See kernel/cred.c::commit_creds()
274 but only if the permitted capabilities
275 gained exceed those already permitted for the process.
276 .\" Also certain namespace operations;
279 Processes that are not dumpable can not be attached via
286 If a process is not dumpable,
287 the ownership of files in the process's
289 directory is affected as described in
292 .BR PR_GET_DUMPABLE " (since Linux 2.3.20)"
293 Return (as the function result) the current state of the calling
294 process's dumpable attribute.
295 .\" Since Linux 2.6.13, the dumpable flag can have the value 2,
296 .\" but in 2.6.13 PR_GET_DUMPABLE simply returns 1 if the dumpable
297 .\" flags has a nonzero value. This was fixed in 2.6.14.
299 .BR PR_SET_ENDIAN " (since Linux 2.6.18, PowerPC only)"
300 Set the endian-ness of the calling process to the value given
301 in \fIarg2\fP, which should be one of the following:
302 .\" Respectively 0, 1, 2
304 .BR PR_ENDIAN_LITTLE ,
306 .B PR_ENDIAN_PPC_LITTLE
307 (PowerPC pseudo little endian).
309 .BR PR_GET_ENDIAN " (since Linux 2.6.18, PowerPC only)"
310 Return the endian-ness of the calling process,
311 in the location pointed to by
312 .IR "(int\ *) arg2" .
314 .BR PR_SET_FP_MODE " (since Linux 4.0, only on MIPS)"
315 .\" commit 9791554b45a2acc28247f66a5fd5bbc212a6b8c8
316 On the MIPS architecture,
317 user-space code can be built using an ABI which permits linking
318 with code that has more restrictive floating-point (FP) requirements.
319 For example, user-space code may be built to target the O32 FPXX ABI
320 and linked with code built for either one of the more restrictive
322 When more restrictive code is linked in,
323 the overall requirement for the process is to use the more
324 restrictive floating-point mode.
326 Because the kernel has no means of knowing in advance
327 which mode the process should be executed in,
328 and because these restrictions can
329 change over the lifetime of the process, the
331 operation is provided to allow control of the floating-point mode
334 .\" https://dmz-portal.mips.com/wiki/MIPS_O32_ABI_-_FR0_and_FR1_Interlinking
336 .I (unsigned int) arg2
337 argument is a bit mask describing the floating-point mode used:
345 mode), the 32 floating-point registers are 32 bits wide,
346 and 64-bit registers are represented as a pair of registers
347 (even- and odd- numbered,
348 with the even-numbered register containing the lower 32 bits,
349 and the odd-numbered register containing the higher 32 bits).
353 (on supported hardware),
354 the 32 floating-point registers are 64 bits wide (so called
357 Note that modern MIPS implementations (MIPS R6 and newer) support
362 Applications that use the O32 FP32 ABI can operate only when this bit is
365 or they can be used with FRE enabled, see below).
366 Applications that use the O32 FP64 ABI
367 (and the O32 FP64A ABI, which exists to
368 provide the ability to operate with existing FP32 code; see below)
369 can operate only when this bit is
372 Applications that use the O32 FPXX ABI can operate with either
378 Enable emulation of 32-bit floating-point mode.
379 When this mode is enabled,
380 it emulates 32-bit floating-point operations
381 by raising a reserved-instruction exception
382 on every instruction that uses 32-bit formats and
383 the kernel then handles the instruction in software.
384 (The problem lies in the discrepancy of handling odd-numbered registers
385 which are the high 32 bits of 64-bit registers with even numbers in
387 mode and the lower 32-bit parts of odd-numbered 64-bit registers in
390 Enabling this bit is necessary when code with the O32 FP32 ABI should operate
391 with code with compatible the O32 FPXX or O32 FP64A ABIs (which require
393 FPU mode) or when it is executed on newer hardware (MIPS R6 onwards)
396 mode support when a binary with the FP32 ABI is used.
398 Note that this mode makes sense only when the FPU is in 64-bit mode
401 Note that the use of emulation inherently has a significant performance hit
402 and should be avoided if possible.
405 In the N32/N64 ABI, 64-bit floating-point mode is always used,
406 so FPU emulation is not required and the FPU always operates in
410 This option is mainly intended for use by the dynamic linker
420 .BR PR_GET_FP_MODE " (since Linux 4.0, only on MIPS)"
421 Return (as the function result)
422 the current floating-point mode (see the description of
427 the call returns a bit mask which represents the current floating-point mode.
437 .BR PR_SET_FPEMU " (since Linux 2.4.18, 2.5.9, only on ia64)"
438 Set floating-point emulation control bits to \fIarg2\fP.
441 to silently emulate floating-point operation accesses, or
443 to not emulate floating-point operations and send
447 .BR PR_GET_FPEMU " (since Linux 2.4.18, 2.5.9, only on ia64)"
448 Return floating-point emulation control bits,
449 in the location pointed to by
450 .IR "(int\ *) arg2" .
452 .BR PR_SET_FPEXC " (since Linux 2.4.21, 2.5.32, only on PowerPC)"
453 Set floating-point exception mode to \fIarg2\fP.
454 Pass \fBPR_FP_EXC_SW_ENABLE\fP to use FPEXC for FP exception enables,
455 \fBPR_FP_EXC_DIV\fP for floating-point divide by zero,
456 \fBPR_FP_EXC_OVF\fP for floating-point overflow,
457 \fBPR_FP_EXC_UND\fP for floating-point underflow,
458 \fBPR_FP_EXC_RES\fP for floating-point inexact result,
459 \fBPR_FP_EXC_INV\fP for floating-point invalid operation,
460 \fBPR_FP_EXC_DISABLED\fP for FP exceptions disabled,
461 \fBPR_FP_EXC_NONRECOV\fP for async nonrecoverable exception mode,
462 \fBPR_FP_EXC_ASYNC\fP for async recoverable exception mode,
463 \fBPR_FP_EXC_PRECISE\fP for precise exception mode.
465 .BR PR_GET_FPEXC " (since Linux 2.4.21, 2.5.32, only on PowerPC)"
466 Return floating-point exception mode,
467 in the location pointed to by
468 .IR "(int\ *) arg2" .
470 .BR PR_SET_KEEPCAPS " (since Linux 2.2.18)"
471 Set the state of the calling thread's "keep capabilities" flag.
472 The effect of this flag is described in
473 .BR capabilities (7).
475 must be either 0 (clear the flag)
477 The "keep capabilities" value will be reset to 0 on subsequent calls to
480 .BR PR_GET_KEEPCAPS " (since Linux 2.2.18)"
481 Return (as the function result) the current state of the calling thread's
482 "keep capabilities" flag.
485 for a description of this flag.
487 .BR PR_MCE_KILL " (since Linux 2.6.32)"
488 Set the machine check memory corruption kill policy for the calling thread.
492 .BR PR_MCE_KILL_CLEAR ,
493 clear the thread memory corruption kill policy and use the system-wide default.
494 (The system-wide default is defined by
495 .IR /proc/sys/vm/memory_failure_early_kill ;
501 .BR PR_MCE_KILL_SET ,
502 use a thread-specific memory corruption kill policy.
505 defines whether the policy is
507 .RB ( PR_MCE_KILL_EARLY ),
509 .RB ( PR_MCE_KILL_LATE ),
510 or the system-wide default
511 .RB ( PR_MCE_KILL_DEFAULT ).
512 Early kill means that the thread receives a
514 signal as soon as hardware memory corruption is detected inside
516 In late kill mode, the process is killed only when it accesses a corrupted page.
519 for more information on the
522 The policy is inherited by children.
525 arguments must be zero for future compatibility.
527 .BR PR_MCE_KILL_GET " (since Linux 2.6.32)"
528 Return (as the function result)
529 the current per-process machine check kill policy.
532 arguments must be zero.
534 .BR PR_SET_MM " (since Linux 3.3)"
535 .\" commit 028ee4be34a09a6d48bdf30ab991ae933a7bc036
536 Modify certain kernel memory map descriptor fields
537 of the calling process.
538 Usually these fields are set by the kernel and dynamic loader (see
540 for more information) and a regular application should not use this feature.
541 However, there are cases, such as self-modifying programs,
542 where a program might find it useful to change its own memory map.
544 The calling process must have the
549 is one of the options below, while
551 provides a new value for the option.
556 arguments must be zero if unused.
559 .\" commit 52b3694157e3aa6df871e283115652ec6f2d31e0
560 this feature is available only if the kernel is built with the
561 .BR CONFIG_CHECKPOINT_RESTORE
565 .BR PR_SET_MM_START_CODE
566 Set the address above which the program text can run.
567 The corresponding memory area must be readable and executable,
568 but not writable or shareable (see
572 for more information).
574 .BR PR_SET_MM_END_CODE
575 Set the address below which the program text can run.
576 The corresponding memory area must be readable and executable,
577 but not writable or shareable.
579 .BR PR_SET_MM_START_DATA
580 Set the address above which initialized and
581 uninitialized (bss) data are placed.
582 The corresponding memory area must be readable and writable,
583 but not executable or shareable.
585 .B PR_SET_MM_END_DATA
586 Set the address below which initialized and
587 uninitialized (bss) data are placed.
588 The corresponding memory area must be readable and writable,
589 but not executable or shareable.
591 .BR PR_SET_MM_START_STACK
592 Set the start address of the stack.
593 The corresponding memory area must be readable and writable.
595 .BR PR_SET_MM_START_BRK
596 Set the address above which the program heap can be expanded with
599 The address must be greater than the ending address of
600 the current program data segment.
601 In addition, the combined size of the resulting heap and
602 the size of the data segment can't exceed the
611 The requirements for the address are the same as for the
612 .BR PR_SET_MM_START_BRK
615 The following options are available since Linux 3.5.
616 .\" commit fe8c7f5cbf91124987106faa3bdf0c8b955c4cf7
618 .BR PR_SET_MM_ARG_START
619 Set the address above which the program command line is placed.
621 .BR PR_SET_MM_ARG_END
622 Set the address below which the program command line is placed.
624 .BR PR_SET_MM_ENV_START
625 Set the address above which the program environment is placed.
627 .BR PR_SET_MM_ENV_END
628 Set the address below which the program environment is placed.
630 The address passed with
631 .BR PR_SET_MM_ARG_START ,
632 .BR PR_SET_MM_ARG_END ,
633 .BR PR_SET_MM_ENV_START ,
635 .BR PR_SET_MM_ENV_END
636 should belong to a process stack area.
637 Thus, the corresponding memory area must be readable, writable, and
638 (depending on the kernel configuration) have the
644 Set a new auxiliary vector.
647 argument should provide the address of the vector.
650 is the size of the vector.
652 .BR PR_SET_MM_EXE_FILE
653 .\" commit b32dfe377102ce668775f8b6b1461f7ad428f8b6
656 symbolic link with a new one pointing to a new executable file
657 identified by the file descriptor provided in
660 The file descriptor should be obtained with a regular
664 To change the symbolic link, one needs to unmap all existing
665 executable memory areas, including those created by the kernel itself
666 (for example the kernel usually creates at least one executable
667 memory area for the ELF
671 In Linux 4.9 and earlier, the
672 .\" commit 3fb4afd9a504c2386b8435028d43283216bf588e
673 .BR PR_SET_MM_EXE_FILE
674 operation can be performed only once in a process's lifetime;
675 attempting to perform the operation a second time results in the error
677 This restriction was enforced for security reasons that were subsequently
679 and the restriction was removed in Linux 4.10 because some
680 user-space applications needed to perform this operation more than once.
682 The following options are available since Linux 3.18.
683 .\" commit f606b77f1a9e362451aca8f81d8f36a3a112139e
686 Provides one-shot access to all the addresses by passing in a
687 .I struct prctl_mm_map
688 (as defined in \fI<linux/prctl.h>\fP).
691 argument should provide the size of the struct.
693 This feature is available only if the kernel is built with the
694 .BR CONFIG_CHECKPOINT_RESTORE
697 .BR PR_SET_MM_MAP_SIZE
698 Returns the size of the
699 .I struct prctl_mm_map
701 This allows user space to find a compatible struct.
704 argument should be a pointer to an unsigned int.
706 This feature is available only if the kernel is built with the
707 .BR CONFIG_CHECKPOINT_RESTORE
711 .BR PR_MPX_ENABLE_MANAGEMENT ", " PR_MPX_DISABLE_MANAGEMENT " (since Linux 3.19) "
712 .\" commit fe3d197f84319d3bce379a9c0dc17b1f48ad358c
713 .\" See also http://lwn.net/Articles/582712/
714 .\" See also https://gcc.gnu.org/wiki/Intel%20MPX%20support%20in%20the%20GCC%20compiler
715 Enable or disable kernel management of Memory Protection eXtensions (MPX)
723 .\" commit e9d1b4f3c60997fe197bf0243cb4a41a44387a88
724 arguments must be zero.
726 MPX is a hardware-assisted mechanism for performing bounds checking on
728 It consists of a set of registers storing bounds information
729 and a set of special instruction prefixes that tell the CPU on which
730 instructions it should do bounds enforcement.
731 There is a limited number of these registers and
732 when there are more pointers than registers,
733 their contents must be "spilled" into a set of tables.
734 These tables are called "bounds tables" and the MPX
737 whether the kernel manages their allocation and freeing.
739 When management is enabled, the kernel will take over allocation
740 and freeing of the bounds tables.
741 It does this by trapping the #BR exceptions that result
742 at first use of missing bounds tables and
743 instead of delivering the exception to user space,
744 it allocates the table and populates the bounds directory
745 with the location of the new table.
746 For freeing, the kernel checks to see if bounds tables are
747 present for memory which is not allocated, and frees them if so.
749 Before enabling MPX management using
750 .BR PR_MPX_ENABLE_MANAGEMENT ,
751 the application must first have allocated a user-space buffer for
752 the bounds directory and placed the location of that directory in the
756 These calls fail if the CPU or kernel does not support MPX.
757 Kernel support for MPX is enabled via the
758 .BR CONFIG_X86_INTEL_MPX
759 configuration option.
760 You can check whether the CPU supports MPX by looking for the 'mpx'
761 CPUID bit, like with the following command:
765 cat /proc/cpuinfo | grep ' mpx '
769 A thread may not switch in or out of long (64-bit) mode while MPX is
772 All threads in a process are affected by these calls.
776 inherits the state of MPX management.
779 MPX management is reset to a state as if
780 .BR PR_MPX_DISABLE_MANAGEMENT
783 For further information on Intel MPX, see the kernel source file
784 .IR Documentation/x86/intel_mpx.txt .
786 .BR PR_SET_NAME " (since Linux 2.6.9)"
787 Set the name of the calling thread,
788 using the value in the location pointed to by
789 .IR "(char\ *) arg2" .
790 The name can be up to 16 bytes long,
791 .\" TASK_COMM_LEN in include/linux/sched.h
792 including the terminating null byte.
793 (If the length of the string, including the terminating null byte,
794 exceeds 16 bytes, the string is silently truncated.)
795 This is the same attribute that can be set via
796 .BR pthread_setname_np (3)
798 .BR pthread_getname_np (3).
799 The attribute is likewise accessible via
800 .IR /proc/self/task/[tid]/comm ,
803 is the name of the calling thread.
805 .BR PR_GET_NAME " (since Linux 2.6.11)"
806 Return the name of the calling thread,
807 in the buffer pointed to by
808 .IR "(char\ *) arg2" .
809 The buffer should allow space for up to 16 bytes;
810 the returned string will be null-terminated.
812 .BR PR_SET_NO_NEW_PRIVS " (since Linux 3.5)"
813 Set the calling thread's
815 attribute to the value in
821 promises not to grant privileges to do anything
822 that could not have been done without the
825 rendering the set-user-ID and set-group-ID mode bits,
826 and file capabilities non-functional).
829 attribute cannot be unset.
830 The setting of this attribute is inherited by children created by
838 the value of a thread's
840 attribute can be viewed via the
843 .IR /proc/[pid]/status
846 For more information, see the kernel source file
847 .IR Documentation/userspace\-api/no_new_privs.rst
848 .\" commit 40fde647ccb0ae8c11d256d271e24d385eed595b
850 .IR Documentation/prctl/no_new_privs.txt
855 .BR PR_GET_NO_NEW_PRIVS " (since Linux 3.5)"
856 Return (as the function result) the value of the
858 attribute for the calling thread.
859 A value of 0 indicates the regular
862 A value of 1 indicates
864 will operate in the privilege-restricting mode described above.
866 .BR PR_SET_PDEATHSIG " (since Linux 2.1.57)"
867 Set the parent-death signal
868 of the calling process to \fIarg2\fP (either a signal value
869 in the range 1..maxsig, or 0 to clear).
870 This is the signal that the calling process will get when its
874 .\" https://bugzilla.kernel.org/show_bug.cgi?id=43300
875 the "parent" in this case is considered to be the
877 that created this process.
878 In other words, the signal will be sent when that thread terminates
880 .BR pthread_exit (3)),
881 rather than after all of the threads in the parent process terminate.
883 The parent-death signal is sent upon subsequent termination of the parent
884 thread and also upon termination of each subreaper process
885 (see the description of
886 .B PR_SET_CHILD_SUBREAPER
887 above) to which the caller is subsequently reparented.
888 If the parent thread and all ancestor subreapers have already terminated
891 operation, then no parent-death signal is sent to the caller.
893 The parent-death signal is process-directed (see
895 and, if the child installs a handler using the
902 argument of the handler contains the PID of the terminating parent process.
904 The parent-death signal setting is cleared for the child of a
907 (since Linux 2.4.36 / 2.6.23)
908 .\" commit d2d56c5f51028cb9f3d800882eb6f4cbd3f9099f
909 cleared when executing a set-user-ID or set-group-ID binary,
910 or a binary that has associated capabilities (see
911 .BR capabilities (7));
912 otherwise, this value is preserved across
915 .BR PR_GET_PDEATHSIG " (since Linux 2.3.15)"
916 Return the current value of the parent process death signal,
917 in the location pointed to by
918 .IR "(int\ *) arg2" .
920 .BR PR_SET_PTRACER " (since Linux 3.4)"
921 .\" commit 2d514487faf188938a4ee4fb3464eeecfbdcf8eb
922 .\" commit bf06189e4d14641c0148bea16e9dd24943862215
923 This is meaningful only when the Yama LSM is enabled and in mode 1
924 ("restricted ptrace", visible via
925 .IR /proc/sys/kernel/yama/ptrace_scope ).
926 When a "ptracer process ID" is passed in \fIarg2\fP,
927 the caller is declaring that the ptracer process can
929 the calling process as if it were a direct process ancestor.
932 operation replaces the previous "ptracer process ID".
937 set to 0 clears the caller's "ptracer process ID".
941 .BR PR_SET_PTRACER_ANY ,
942 the ptrace restrictions introduced by Yama are effectively disabled for the
945 For further information, see the kernel source file
946 .IR Documentation/admin\-guide/LSM/Yama.rst
947 .\" commit 90bb766440f2147486a2acc3e793d7b8348b0c22
949 .IR Documentation/security/Yama.txt
952 .BR PR_SET_SECCOMP " (since Linux 2.6.23)"
953 .\" See http://thread.gmane.org/gmane.linux.kernel/542632
954 .\" [PATCH 0 of 2] seccomp updates
955 .\" andrea@cpushare.com
956 Set the secure computing (seccomp) mode for the calling thread, to limit
957 the available system calls.
960 system call provides a superset of the functionality of
963 The seccomp mode is selected via
965 (The seccomp constants are defined in
966 .IR <linux/seccomp.h> .)
971 .BR SECCOMP_MODE_STRICT ,
972 the only system calls that the thread is permitted to make are
980 Other system calls result in the delivery of a
983 Strict secure computing mode is useful for number-crunching applications
984 that may need to execute untrusted byte code,
985 perhaps obtained by reading from a pipe or socket.
986 This operation is available only
987 if the kernel is configured with
994 .BR SECCOMP_MODE_FILTER " (since Linux 3.5),"
995 the system calls allowed are defined by a pointer
996 to a Berkeley Packet Filter passed in
998 This argument is a pointer to
999 .IR "struct sock_fprog" ;
1000 it can be designed to filter
1001 arbitrary system calls and system call arguments.
1002 This mode is available only if the kernel is configured with
1003 .B CONFIG_SECCOMP_FILTER
1007 .BR SECCOMP_MODE_FILTER
1010 then the seccomp mode is inherited by children created by
1014 is permitted, then the seccomp mode is preserved across
1016 If the filters permit
1018 calls, then additional filters can be added;
1019 they are run in order until the first non-allow result is seen.
1021 For further information, see the kernel source file
1022 .IR Documentation/userspace\-api/seccomp_filter.rst
1023 .\" commit c061f33f35be0ccc80f4b8e0aea5dfd2ed7e01a3
1025 .IR Documentation/prctl/seccomp_filter.txt
1028 .BR PR_GET_SECCOMP " (since Linux 2.6.23)"
1029 Return (as the function result)
1030 the secure computing mode of the calling thread.
1031 If the caller is not in secure computing mode, this operation returns 0;
1032 if the caller is in strict secure computing mode, then the
1036 signal to be sent to the process.
1037 If the caller is in filter mode, and this system call is allowed by the
1038 seccomp filters, it returns 2; otherwise, the process is killed with a
1041 This operation is available only
1042 if the kernel is configured with
1046 Since Linux 3.8, the
1049 .IR /proc/[pid]/status
1050 file provides a method of obtaining the same information,
1051 without the risk that the process is killed; see
1054 .BR PR_SET_SECUREBITS " (since Linux 2.6.26)"
1055 Set the "securebits" flags of the calling thread to the value supplied in
1058 .BR capabilities (7).
1060 .BR PR_GET_SECUREBITS " (since Linux 2.6.26)"
1061 Return (as the function result)
1062 the "securebits" flags of the calling thread.
1064 .BR capabilities (7).
1066 .BR PR_GET_SPECULATION_CTRL " (since Linux 4.17)"
1067 Return (as the function result)
1068 the state of the speculation misfeature specified in
1070 Currently, the only permitted value for this argument is
1071 .BR PR_SPEC_STORE_BYPASS
1072 (otherwise the call fails with the error
1075 The return value uses bits 0-3 with the following meaning:
1079 Mitigation can be controlled per thread by
1080 .B PR_SET_SPECULATION_CTRL
1083 The speculation feature is enabled, mitigation is disabled.
1086 The speculation feature is disabled, mitigation is enabled
1088 .BR PR_SPEC_FORCE_DISABLE
1091 but cannot be undone.
1095 then the CPU is not affected by the speculation misfeature.
1099 is set, then per-thread control of the mitigation is available.
1102 for the speculation misfeature will fail.
1109 arguments must be specified as 0; otherwise the call fails with the error
1112 .BR PR_SET_SPECULATION_CTRL " (since Linux 4.17)"
1113 .\" commit b617cfc858161140d69cc0b5cc211996b557a1c7
1114 .\" commit 356e4bfff2c5489e016fdb925adbf12a1e3950ee
1115 Sets the state of the speculation misfeature specified in
1117 Currently, the only permitted value for this argument is
1118 .B PR_SPEC_STORE_BYPASS
1119 (otherwise the call fails with the error
1121 This setting is a per-thread attribute.
1124 argument is used to hand in the control value,
1125 which is one of the following:
1129 The speculation feature is enabled, mitigation is disabled.
1132 The speculation feature is disabled, mitigation is enabled
1134 .BR PR_SPEC_FORCE_DISABLE
1137 but cannot be undone.
1140 prctl(..., PR_SPEC_ENABLE)
1141 will fail with the error
1147 will result in the call failing with the error
1154 arguments must be specified as 0; otherwise the call fails with the error
1157 The speculation feature can also be controlled by the
1158 .B spec_store_bypass_disable
1160 This parameter may enforce a read-only policy which will result in the
1162 call failing with the error
1164 For further details, see the kernel source file
1165 .IR Documentation/admin-guide/kernel-parameters.txt .
1167 .BR PR_SET_THP_DISABLE " (since Linux 3.15)"
1168 .\" commit a0715cc22601e8830ace98366c0c2bd8da52af52
1169 Set the state of the "THP disable" flag for the calling thread.
1172 has a nonzero value, the flag is set, otherwise it is cleared.
1173 Setting this flag provides a method
1174 for disabling transparent huge pages
1175 for jobs where the code cannot be modified, and using a malloc hook with
1177 is not an option (i.e., statically allocated data).
1178 The setting of the "THP disable" flag is inherited by a child created via
1180 and is preserved across
1184 .BR PR_TASK_PERF_EVENTS_DISABLE " (since Linux 2.6.31)"
1185 Disable all performance counters attached to the calling process,
1186 regardless of whether the counters were created by
1187 this process or another process.
1188 Performance counters created by the calling process for other
1189 processes are unaffected.
1190 For more information on performance counters, see the Linux kernel source file
1191 .IR tools/perf/design.txt .
1194 .BR PR_TASK_PERF_COUNTERS_DISABLE ;
1195 .\" commit 1d1c7ddbfab358445a542715551301b7fc363e28
1196 renamed (retaining the same numerical value)
1200 .BR PR_TASK_PERF_EVENTS_ENABLE " (since Linux 2.6.31)"
1202 .BR PR_TASK_PERF_EVENTS_DISABLE ;
1203 enable performance counters attached to the calling process.
1206 .BR PR_TASK_PERF_COUNTERS_ENABLE ;
1207 .\" commit 1d1c7ddbfab358445a542715551301b7fc363e28
1209 .\" commit cdd6c482c9ff9c55475ee7392ec8f672eddb7be6
1213 .BR PR_GET_THP_DISABLE " (since Linux 3.15)"
1214 Return (as the function result) the current setting of the "THP disable"
1215 flag for the calling thread:
1216 either 1, if the flag is set, or 0, if it is not.
1218 .BR PR_GET_TID_ADDRESS " (since Linux 3.5)"
1219 .\" commit 300f786b2683f8bb1ec0afb6e1851183a479c86d
1223 .BR set_tid_address (2)
1226 .B CLONE_CHILD_CLEARTID
1227 flag, in the location pointed to by
1228 .IR "(int\ **)\ arg2" .
1229 This feature is available only if the kernel is built with the
1230 .BR CONFIG_CHECKPOINT_RESTORE
1234 system call does not have a compat implementation for
1235 the AMD64 x32 and MIPS n32 ABIs,
1236 and the kernel writes out a pointer using the kernel's pointer size,
1237 this operation expects a user-space buffer of 8 (not 4) bytes on these ABIs.
1239 .BR PR_SET_TIMERSLACK " (since Linux 2.6.28)"
1240 .\" See https://lwn.net/Articles/369549/
1241 .\" commit 6976675d94042fbd446231d1bd8b7de71a980ada
1242 Each thread has two associated timer slack values:
1243 a "default" value, and a "current" value.
1244 This operation sets the "current" timer slack value for the calling thread.
1246 is an unsigned long value, then maximum "current" value is ULONG_MAX and
1247 the minimum "current" value is 1.
1248 If the nanosecond value supplied in
1250 is greater than zero, then the "current" value is set to this value.
1254 the "current" timer slack is reset to the
1255 thread's "default" timer slack value.
1257 The "current" timer slack is used by the kernel to group timer expirations
1258 for the calling thread that are close to one another;
1259 as a consequence, timer expirations for the thread may be
1260 up to the specified number of nanoseconds late (but will never expire early).
1261 Grouping timer expirations can help reduce system power consumption
1262 by minimizing CPU wake-ups.
1264 The timer expirations affected by timer slack are those set by
1270 .BR epoll_pwait (2),
1271 .BR clock_nanosleep (2),
1275 (and thus the library functions implemented via futexes, including
1276 .\" List obtained by grepping for futex usage in glibc source
1277 .BR pthread_cond_timedwait (3),
1278 .BR pthread_mutex_timedlock (3),
1279 .BR pthread_rwlock_timedrdlock (3),
1280 .BR pthread_rwlock_timedwrlock (3),
1282 .BR sem_timedwait (3)).
1284 Timer slack is not applied to threads that are scheduled under
1285 a real-time scheduling policy (see
1286 .BR sched_setscheduler (2)).
1288 When a new thread is created,
1289 the two timer slack values are made the same as the "current" value
1290 of the creating thread.
1291 Thereafter, a thread can adjust its "current" timer slack value via
1292 .BR PR_SET_TIMERSLACK .
1293 The "default" value can't be changed.
1294 The timer slack values of
1296 (PID 1), the ancestor of all processes,
1297 are 50,000 nanoseconds (50 microseconds).
1298 The timer slack value is inherited by a child created via
1300 and is preserved across
1303 Since Linux 4.6, the "current" timer slack value of any process
1304 can be examined and changed via the file
1305 .IR /proc/[pid]/timerslack_ns .
1309 .BR PR_GET_TIMERSLACK " (since Linux 2.6.28)"
1310 Return (as the function result)
1311 the "current" timer slack value of the calling thread.
1313 .BR PR_SET_TIMING " (since Linux 2.6.0)"
1314 .\" Precisely: Linux 2.6.0-test4
1315 Set whether to use (normal, traditional) statistical process timing or
1316 accurate timestamp-based process timing, by passing
1317 .B PR_TIMING_STATISTICAL
1320 .B PR_TIMING_TIMESTAMP
1323 .B PR_TIMING_TIMESTAMP
1324 is not currently implemented
1325 (attempting to set this mode will yield the error
1327 .\" PR_TIMING_TIMESTAMP doesn't do anything in 2.6.26-rc8,
1328 .\" and looking at the patch history, it appears
1329 .\" that it never did anything.
1331 .BR PR_GET_TIMING " (since Linux 2.6.0)"
1332 .\" Precisely: Linux 2.6.0-test4
1333 Return (as the function result) which process timing method is currently
1336 .BR PR_SET_TSC " (since Linux 2.6.26, x86 only)"
1337 Set the state of the flag determining whether the timestamp counter
1338 can be read by the process.
1343 to allow it to be read, or
1347 when the process tries to read the timestamp counter.
1349 .BR PR_GET_TSC " (since Linux 2.6.26, x86 only)"
1350 Return the state of the flag determining whether the timestamp counter
1352 in the location pointed to by
1353 .IR "(int\ *) arg2" .
1356 (Only on: ia64, since Linux 2.3.48; parisc, since Linux 2.6.15;
1357 PowerPC, since Linux 2.6.18; Alpha, since Linux 2.6.22;
1358 .\" sh: 94ea5e449ae834af058ef005d16a8ad44fcf13d6
1359 .\" tile: 2f9ac29eec71a696cb0dcc5fb82c0f8d4dac28c9
1360 sh, since Linux 2.6.34; tile, since Linux 3.12)
1361 Set unaligned access control bits to \fIarg2\fP.
1363 \fBPR_UNALIGN_NOPRINT\fP to silently fix up unaligned user accesses,
1364 or \fBPR_UNALIGN_SIGBUS\fP to generate
1366 on unaligned user access.
1367 Alpha also supports an additional flag with the value
1368 of 4 and no corresponding named constant,
1369 which instructs kernel to not fix up
1370 unaligned accesses (it is analogous to providing the
1376 system call on Tru64).
1381 for information on versions and architectures.)
1382 Return unaligned access control bits, in the location pointed to by
1383 .IR "(unsigned int\ *) arg2" .
1385 .BR PR_SET_IO_FLUSHER " (since Linux 5.6)"
1386 If a user process is involved in the block layer or filesystem I/O path,
1387 and can allocate memory while processing I/O requests it must set
1389 This will put the process in the IO_FLUSHER state,
1390 which allows it special treatment to make progress when allocating memory.
1391 If \fIarg2\fP is 0, the process will clear the IO_FLUSHER state, and
1392 the default behavior will be used.
1394 The calling process must have the
1395 .BR CAP_SYS_RESOURCE
1404 The IO_FLUSHER state is inherited by a child process created via
1406 and is preserved across
1409 Examples of IO_FLUSHER applications are FUSE daemons, SCSI device
1410 emulation daemons, and daemons that perform error handling like multipath
1411 path recovery applications.
1413 .B PR_GET_IO_FLUSHER (Since Linux 5.6)
1414 Return (as the function result) the IO_FLUSHER state of the caller.
1415 A value of 1 indicates that the caller is in the IO_FLUSHER state;
1416 0 indicates that the caller is not in the IO_FLUSHER state.
1418 The calling process must have the
1419 .BR CAP_SYS_RESOURCE
1430 .BR PR_GET_DUMPABLE ,
1431 .BR PR_GET_FP_MODE ,
1432 .BR PR_GET_KEEPCAPS ,
1433 .BR PR_GET_NO_NEW_PRIVS ,
1434 .BR PR_GET_THP_DISABLE ,
1435 .BR PR_CAPBSET_READ ,
1437 .BR PR_GET_TIMERSLACK ,
1438 .BR PR_GET_SECUREBITS ,
1439 .BR PR_GET_SPECULATION_CTRL ,
1440 .BR PR_MCE_KILL_GET ,
1441 .BR PR_CAP_AMBIENT + PR_CAP_AMBIENT_IS_SET ,
1442 .BR PR_GET_IO_FLUSHER ,
1445 return the nonnegative values described above.
1448 values return 0 on success.
1449 On error, \-1 is returned, and
1451 is set appropriately.
1461 .BR SECCOMP_MODE_FILTER ,
1462 but the process does not have the
1464 capability or has not set the
1466 attribute (see the discussion of
1467 .BR PR_SET_NO_NEW_PRIVS
1477 .BR PR_SET_MM_EXE_FILE ,
1478 the file is not executable.
1486 .BR PR_SET_MM_EXE_FILE ,
1487 and the file descriptor passed in
1497 .BR PR_SET_MM_EXE_FILE ,
1498 and this the second attempt to change the
1500 symbolic link, which is prohibited.
1504 is an invalid address.
1509 .BR PR_SET_SECCOMP ,
1512 .BR SECCOMP_MODE_FILTER ,
1513 the system was built with
1514 .BR CONFIG_SECCOMP_FILTER ,
1517 is an invalid address.
1534 arguments were not specified as zero.
1538 is not valid value for this
1546 .BR PR_GET_SECCOMP ,
1547 and the kernel was not configured with
1548 .BR CONFIG_SECCOMP .
1553 .BR PR_SET_SECCOMP ,
1556 .BR SECCOMP_MODE_FILTER ,
1557 and the kernel was not configured with
1558 .BR CONFIG_SECCOMP_FILTER .
1564 and one of the following is true
1575 (the limit on the size of the user address space for this architecture);
1579 .BR PR_SET_MM_START_CODE ,
1580 .BR PR_SET_MM_END_CODE ,
1581 .BR PR_SET_MM_START_DATA ,
1582 .BR PR_SET_MM_END_DATA ,
1584 .BR PR_SET_MM_START_STACK ,
1585 and the permissions of the corresponding memory area are not as required;
1589 .BR PR_SET_MM_START_BRK
1594 is less than or equal to the end of the data segment
1595 or specifies a value that would cause the
1597 resource limit to be exceeded.
1607 .BR PR_SET_PTRACER_ANY ,
1608 or the PID of an existing process.
1616 is not a valid signal number.
1625 .B SUID_DUMP_DISABLE
1627 .BR SUID_DUMP_USER .
1636 .BR PR_TIMING_STATISTICAL .
1641 .BR PR_SET_NO_NEW_PRIVS
1655 .BR PR_GET_NO_NEW_PRIVS
1667 .BR PR_SET_THP_DISABLE
1678 .BR PR_GET_THP_DISABLE
1691 and an unused argument
1696 .BR PR_CAP_AMBIENT_CLEAR_ALL ,
1700 has an invalid value;
1704 .BR PR_CAP_AMBIENT_LOWER ,
1705 .BR PR_CAP_AMBIENT_RAISE ,
1707 .BR PR_CAP_AMBIENT_IS_SET
1710 does not specify a valid capability.
1715 .BR PR_GET_SPECULATION_CTRL
1717 .BR PR_SET_SPECULATION_CTRL
1718 and unused arguments to
1725 .BR PR_SET_SPECULATION_CTRL
1726 the kernel or CPU does not support the requested speculation misfeature.
1731 .BR PR_MPX_ENABLE_MANAGEMENT
1733 .BR PR_MPX_DISABLE_MANAGEMENT
1734 and the kernel or the CPU does not support MPX management.
1735 Check that the kernel and processor have MPX support.
1740 .BR PR_SET_SPECULATION_CTRL
1741 implies that the control of the selected speculation misfeature is not possible.
1743 .BR PR_GET_SPECULATION_CTRL
1744 for the bit fields to determine which option is available.
1752 has an invalid or unsupported value.
1757 .BR PR_SET_SECUREBITS ,
1758 and the caller does not have the
1761 or tried to unset a "locked" flag,
1762 or tried to set a flag whose corresponding locked flag was set
1764 .BR capabilities (7)).
1769 .BR PR_SET_SPECULATION_CTRL
1770 wherein the speculation was disabled with
1771 .B PR_SPEC_FORCE_DISABLE
1772 and caller tried to enable it again.
1777 .BR PR_SET_KEEPCAPS ,
1779 .B SECBIT_KEEP_CAPS_LOCKED
1782 .BR capabilities (7)).
1787 .BR PR_CAPBSET_DROP ,
1788 and the caller does not have the
1796 and the caller does not have the
1807 .BR PR_CAP_AMBIENT_RAISE ,
1808 but either the capability specified in
1810 is not present in the process's permitted and inheritable capability sets,
1812 .B PR_CAP_AMBIENT_LOWER
1813 securebit has been set.
1818 .BR PR_SET_SPECULATION_CTRL
1822 .BR PR_SPEC_ENABLE ,
1823 .BR PR_SPEC_DISABLE ,
1825 .BR PR_SPEC_FORCE_DISABLE .
1829 system call was introduced in Linux 2.1.57.
1830 .\" The library interface was added in glibc 2.0.6
1832 This call is Linux-specific.
1835 system call (also introduced in Linux 2.1.44
1836 as irix_prctl on the MIPS architecture),
1841 .BI "ptrdiff_t prctl(int " option ", int " arg2 ", int " arg3 );
1845 and options to get the maximum number of processes per user,
1846 get the maximum number of processors the calling process can use,
1847 find out whether a specified process is currently blocked,
1848 get or set the maximum stack size, and so on.