git.ipfire.org Git - thirdparty/kernel/stable.git/commit

author	Lang Yu <Lang.Yu@amd.com>
	Fri, 26 Apr 2024 06:56:35 +0000 (14:56 +0800)
committer	Greg Kroah-Hartman <gregkh@linuxfoundation.org>
	Thu, 11 Jul 2024 10:51:21 +0000 (12:51 +0200)
commit	8d656c7a0fed8e1675a170d0d17039fe713beb59
tree	fe467dcaefb345f656ef226ea7b83f477ec285b4	tree \| snapshot
parent	d7aeca648788f417953ea9c9834740e8731c82c2	commit \| diff

drm/amdkfd: Let VRAM allocations go to GTT domain on small APUs

[ Upstream commit eb853413d02c8d9b27942429b261a9eef228f005 ]

Small APUs(i.e., consumer, embedded products) usually have a small
carveout device memory which can't satisfy most compute workloads
memory allocation requirements.

We can't even run a Basic MNIST Example with a default 512MB carveout.
https://github.com/pytorch/examples/tree/main/mnist. Error Log:

"torch.cuda.OutOfMemoryError: HIP out of memory. Tried to allocate
84.00 MiB. GPU 0 has a total capacity of 512.00 MiB of which 0 bytes
is free. Of the allocated memory 103.83 MiB is allocated by PyTorch,
and 22.17 MiB is reserved by PyTorch but unallocated"

Though we can change BIOS settings to enlarge carveout size,
which is inflexible and may bring complaint. On the other hand,
the memory resource can't be effectively used between host and device.

The solution is MI300A approach, i.e., let VRAM allocations go to GTT.
Then device and host can flexibly and effectively share memory resource.

v2: Report local_mem_size_private as 0. (Felix)

Signed-off-by: Lang Yu <Lang.Yu@amd.com>
Reviewed-by: Felix Kuehling <felix.kuehling@amd.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
Signed-off-by: Sasha Levin <sashal@kernel.org>

drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd.c		diff \| blob \| blame \| history
drivers/gpu/drm/amd/amdgpu/amdgpu_amdkfd_gpuvm.c		diff \| blob \| blame \| history
drivers/gpu/drm/amd/amdkfd/kfd_migrate.c		diff \| blob \| blame \| history
drivers/gpu/drm/amd/amdkfd/kfd_svm.c		diff \| blob \| blame \| history
drivers/gpu/drm/amd/amdkfd/kfd_svm.h		diff \| blob \| blame \| history