git.ipfire.org Git - thirdparty/man-pages.git/commit

author	Michal Hocko <mhocko@suse.cz>
	Thu, 14 May 2015 13:27:13 +0000 (15:27 +0200)
committer	Michael Kerrisk <mtk.manpages@gmail.com>
	Thu, 14 May 2015 13:31:11 +0000 (15:31 +0200)
commit	7e3786bcdce27f99e547cd1b3bc13e7c85cf46e6
tree	30a7c4378d85b67e8f17d59bf4c80efa314339e3	tree \| snapshot
parent	79ae0b1fbd496f488c97c700c217e03044f40464	commit \| diff

mmap.2: Clarify MAP_LOCKED semantics

MAP_LOCKED had a subtly different semantic from mmap(2)+mlock(2)
since it has been introduced.
mlock(2) fails if the memory range cannot get populated to
guarantee that no future major faults will happen on the range.
mmap(MAP_LOCKED) on the other hand silently succeeds even if
the range was populated only partially.

Fixing this subtle difference in the kernel is rather awkward
because the memory population happens after mm locks have been
dropped and so the cleanup before returning failure (munlock)
could operate on something else than the originally mapped area.

E.g. speculative userspace page fault handler catching SEGV and
doing mmap(fault_addr, MAP_FIXED|MAP_LOCKED) might discard portion
of a racing mmap and lead to lost data. Although it is not clear
whether such a usage would be valid, mmap page doesn't explicitly
describe requirements for threaded applications so we cannot
exclude this possibility.

This patch makes the semantic of MAP_LOCKED explicit and suggests
using mmap + mlock as the only way to guarantee no later major
page faults.

Reviewed-by: Eric B Munson <emunson@akamai.com>
Signed-off-by: Michal Hocko <mhocko@suse.cz>
Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>