From: Philip Yang Date: Mon, 10 Feb 2025 14:42:31 +0000 (-0500) Subject: drm/amdkfd: debugfs hang_hws skip GPU with MES X-Git-Tag: v6.15-rc1~120^2~12^2~88 X-Git-Url: http://git.ipfire.org/?a=commitdiff_plain;h=fe9d0061c413f8fb8c529b18b592b04170850ded;p=thirdparty%2Fkernel%2Flinux.git drm/amdkfd: debugfs hang_hws skip GPU with MES debugfs hang_hws is used by GPU reset test with HWS, for MES this crash the kernel with NULL pointer access because dqm->packet_mgr is not setup for MES path. Skip GPU with MES for now, MES hang_hws debugfs interface will be supported later. Signed-off-by: Philip Yang Reviewed-by: Kent Russell Reviewed-by: Felix Kuehling Signed-off-by: Alex Deucher --- diff --git a/drivers/gpu/drm/amd/amdkfd/kfd_device.c b/drivers/gpu/drm/amd/amdkfd/kfd_device.c index 095e73790dc48..b9c82be6ce134 100644 --- a/drivers/gpu/drm/amd/amdkfd/kfd_device.c +++ b/drivers/gpu/drm/amd/amdkfd/kfd_device.c @@ -1604,6 +1604,11 @@ int kfd_debugfs_hang_hws(struct kfd_node *dev) return -EINVAL; } + if (dev->kfd->shared_resources.enable_mes) { + dev_err(dev->adev->dev, "Inducing MES hang is not supported\n"); + return -EINVAL; + } + return dqm_debugfs_hang_hws(dev->dqm); }