git.ipfire.org Git - thirdparty/kernel/linux.git/commit

author	Christoph Hellwig <hch@lst.de>
	Wed, 25 Jun 2025 11:35:03 +0000 (13:35 +0200)
committer	Jens Axboe <axboe@kernel.dk>
	Mon, 30 Jun 2025 21:50:53 +0000 (15:50 -0600)
commit	7ce3c1dd78fca86ea8b9aee370db10c7a8cfc3c2
tree	07fc6ee37c261567ecc8065d3bcd8b6e5a9cf649	tree \| snapshot
parent	deecd1c49cd36df206a20afb82399cbc9058c16d	commit \| diff

nvme-pci: convert the data mapping to blk_rq_dma_map

Use the blk_rq_dma_map API to DMA map requests instead of scatterlists.
This removes the need to allocate a scatterlist covering every segment,
and thus the overall transfer length limit based on the scatterlist
allocation.

Instead the DMA mapping is done by iterating the bio_vec chain in the
request directly.  The unmap is handled differently depending on how
we mapped:

- when using an IOMMU only a single IOVA is used, and it is stored in
   iova_state
- for direct mappings that don't use swiotlb and are cache coherent,
   unmap is not needed at all
- for direct mappings that are not cache coherent or use swiotlb, the
   physical addresses are rebuild from the PRPs or SGL segments

The latter unfortunately adds a fair amount of code to the driver, but
it is code not used in the fast path.

The conversion only covers the data mapping path, and still uses a
scatterlist for the multi-segment metadata case.  I plan to convert that
as soon as we have good test coverage for the multi-segment metadata
path.

Thanks to Chaitanya Kulkarni for an initial attempt at a new DMA API
conversion for nvme-pci, Kanchan Joshi for bringing back the single
segment optimization, Leon Romanovsky for shepherding this through a
gazillion rebases and Nitesh Shetty for various improvements.

Signed-off-by: Christoph Hellwig <hch@lst.de>
Reviewed-by: Keith Busch <kbusch@kernel.org>
Reviewed-by: Leon Romanovsky <leonro@nvidia.com>
Link: https://lore.kernel.org/r/20250625113531.522027-7-hch@lst.de
Signed-off-by: Jens Axboe <axboe@kernel.dk>