]> git.ipfire.org Git - thirdparty/kernel/stable.git/commit
drm/amdgpu: fix KIQ ring test fail in TDR of SRIOV
authorMonk Liu <Monk.Liu@amd.com>
Tue, 17 Dec 2019 10:16:44 +0000 (18:16 +0800)
committerGreg Kroah-Hartman <gregkh@linuxfoundation.org>
Mon, 24 Feb 2020 07:38:17 +0000 (08:38 +0100)
commit7429e7ff873c77044f4d714633e9bd8578063c06
treedcad9094d7fe3d269ff61b471229cbe2eba9e4d7
parent02dbc003c9661ad05059aad49bf9bd29f4fb9ef9
drm/amdgpu: fix KIQ ring test fail in TDR of SRIOV

[ Upstream commit 5a7489a7e189ee2be889485f90c8cf24ea4b9a40 ]

issues:
MEC is ruined by the amdkfd_pre_reset after VF FLR done

fix:
amdkfd_pre_reset() would ruin MEC after hypervisor finished the VF FLR,
the correct sequence is do amdkfd_pre_reset before VF FLR but there is
a limitation to block this sequence:
if we do pre_reset() before VF FLR, it would go KIQ way to do register
access and stuck there, because KIQ probably won't work by that time
(e.g. you already made GFX hang)

so the best way right now is to simply remove it.

Signed-off-by: Monk Liu <Monk.Liu@amd.com>
Reviewed-by: Emily Deng <Emily.Deng@amd.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
Signed-off-by: Sasha Levin <sashal@kernel.org>
drivers/gpu/drm/amd/amdgpu/amdgpu_device.c