git.ipfire.org Git - thirdparty/kernel/linux.git/commit

LoongArch: Add THREAD_INFO_IN_TASK implementation

Like other architectures such as x86, arm64, riscv, powerpc and s390,
select THREAD_INFO_IN_TASK for LoongArch to move thread_info off the
stack into task_struct. This follows modern kernel standards and also
makes the system more secure.

With this patch, thread_info is included in task_struct at an offset
of 0 instead of being placed at the bottom of the kernel stack. Thus,
the $tp register points to both thread_info and task_struct.

To support this, introduce a per-CPU variable cpu_tasks to store the
pointer to the current task_struct. This decouples the recovery of the
$tp register from the stack pointer during exception entry.

Then initialize cpu_tasks for the primary and secondary CPUs during
arch-specific setup and SMP boot paths. To eliminate the dangerous
windows during the early initialization where the cpu_tasks remains
uninitialized, set_current() is invoked as early as possible in both
setup_arch() and start_secondary(). This ensures the $tp recovery
barrier is armed in case any early boot exceptions or kernel panics
occur.

Modify SAVE_SOME and handle_syscall to restore the $tp register from
cpu_tasks, and also use the la_abs absolute addressing for cpu_tasks
access in assembly to bypass the relocation limits within exception
handling sections. By advancing the preservation of u0 in SAVE_SOME,
we reuse the PERCPU_BASE_KS value in u0 for the cpu_tasks calculation,
effectively eliminating a duplicate csrrd instruction execution on SMP
platforms.

Update <asm/switch_to.h> and <kernel/switch.S> to fully support the
CONFIG_THREAD_INFO_IN_TASK feature.

Remove the obsolete next_ti argument from __switch_to(), which shifts
the remaining arguments ahead in the calling convention (sched_ra from
a3 to a2, and sched_cfa from a4 to a3). Under the new configuration,
__switch_to() now directly derives the thread pointer ($tp) from the
next task_struct pointer in a1.

To preserve the optimal and clean "move tp, a1" path for 64-bit kernels,
the thread pointer ($tp) is assigned directly from a1 in the core path.
For 32-bit kernels, where a1 carries a 2000-byte structural pointer bias
at entry, an explicit adjustment "PTR_ADDI tp, tp, -TASK_STRUCT_OFFSET"
is introduced at the function exit.

In the context of __switch_to(), local interrupts are disabled, and the
kernel is in a critical switching phase where handling any synchronous
exception is practically impossible and prohibited.

If any synchronous exception or watchpoint does trigger in this narrow
window, it constitutes a fatal double fault and the kernel is expected
to die/panic immediately anyway. Therefore, the temporary biased value
in $tp is safe and acceptable here.

Additionally, evaluate the stack lookup as a single load instruction
"LONG_LPTR t0, a1, (TASK_STACK - TASK_STRUCT_OFFSET)", this perfectly
satisfies both 32-bit and 64-bit kernels. Using the "next" pointer in
a1 as the base register, rather than $tp, effectively unchains the data
dependency (RAW hazard) from the preceding move instruction, maximizing
the instruction-level parallelism and superscalar execution efficiency
while naturally adapting the structural shift.

With CONFIG_THREAD_INFO_IN_TASK enabled, the kernel stack life cycle is
decoupled from task_struct and can be freed concurrently.

Currently, show_stacktrace() reads raw stack data via __get_addr() and
subsequently calls show_backtrace() to unwind the frame, without holding
any reference to the target task's stack. If show_stacktrace() is called
on a concurrently exiting task, it could attempt to read from a freed or
reallocated kernel stack. This introduces a severe use-after-free (UAF)
read risk or kernel panics.

Wrap the entire stack inspection process inside show_stacktrace() with
a try_get_task_stack() and put_task_stack() pair. This ensures the task
stack remains pinned safely during both the raw stack data dump loop and
the subsequent stack unwinding phase.

Also, ensure that the task pointer is initialized to "current" early if
it is NULL, so that try_get_task_stack() always operates on a valid task
reference.

Signed-off-by: Tiezhu Yang <yangtiezhu@loongson.cn>
Signed-off-by: Huacai Chen <chenhuacai@loongson.cn>

author	Tiezhu Yang <yangtiezhu@loongson.cn>
	Thu, 25 Jun 2026 05:03:47 +0000 (13:03 +0800)
committer	Huacai Chen <chenhuacai@loongson.cn>
	Thu, 25 Jun 2026 05:03:47 +0000 (13:03 +0800)
commit	7b5944d6ed369e43aeaf37beba9f89f7fb6c633b
tree	441bb6231c14f5e15c82649d54f81d51d822fb8d	tree \| snapshot
parent	872cc6abda1507429b97cc7b42a9dae51ee0a668	commit \| diff

Documentation/features/core/thread-info-in-task/arch-support.txt		diff \| blob \| blame \| history
arch/loongarch/Kconfig		diff \| blob \| blame \| history
arch/loongarch/include/asm/current.h	[new file with mode: 0644]	blob
arch/loongarch/include/asm/percpu.h		diff \| blob \| blame \| history
arch/loongarch/include/asm/smp.h		diff \| blob \| blame \| history
arch/loongarch/include/asm/stackframe.h		diff \| blob \| blame \| history
arch/loongarch/include/asm/switch_to.h		diff \| blob \| blame \| history
arch/loongarch/include/asm/thread_info.h		diff \| blob \| blame \| history
arch/loongarch/kernel/asm-offsets.c		diff \| blob \| blame \| history
arch/loongarch/kernel/entry.S		diff \| blob \| blame \| history
arch/loongarch/kernel/head.S		diff \| blob \| blame \| history
arch/loongarch/kernel/process.c		diff \| blob \| blame \| history
arch/loongarch/kernel/relocate.c		diff \| blob \| blame \| history
arch/loongarch/kernel/setup.c		diff \| blob \| blame \| history
arch/loongarch/kernel/smp.c		diff \| blob \| blame \| history
arch/loongarch/kernel/switch.S		diff \| blob \| blame \| history
arch/loongarch/kernel/traps.c		diff \| blob \| blame \| history