]> git.ipfire.org Git - thirdparty/kernel/linux.git/commit
drm/amdgpu: Fix error handling in slot reset
authorLijo Lazar <lijo.lazar@amd.com>
Tue, 24 Feb 2026 04:48:51 +0000 (10:18 +0530)
committerAlex Deucher <alexander.deucher@amd.com>
Wed, 25 Feb 2026 22:57:55 +0000 (17:57 -0500)
commitb57c4ec98c17789136a4db948aec6daadceb5024
treeea9cfc65f48f79d61c3e40f044ebbe6167e98894
parenta5fe1a54513196e4bc8f9170006057dc31e7155e
drm/amdgpu: Fix error handling in slot reset

If the device has not recovered after slot reset is called, it goes to
out label for error handling. There it could make decision based on
uninitialized hive pointer and could result in accessing an uninitialized
list.

Initialize the list and hive properly so that it handles the error
situation and also releases the reset domain lock which is acquired
during error_detected callback.

Fixes: 732c6cefc1ec ("drm/amdgpu: Replace tmp_adev with hive in amdgpu_pci_slot_reset")
Signed-off-by: Lijo Lazar <lijo.lazar@amd.com>
Reviewed-by: Ce Sun <cesun102@amd.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
(cherry picked from commit bb71362182e59caa227e4192da5a612b09349696)
drivers/gpu/drm/amd/amdgpu/amdgpu_device.c