git.ipfire.org Git - thirdparty/kernel/linux.git/commit

net/mlx5e: SHAMPO, Switch to header memcpy

Previously the HW-GRO code was using a separate page_pool for the header
buffer. The pages of the header buffer were replenished via UMR. This
mechanism has some drawbacks:
- Reference counting on the page_pool page frags is not cheap.
- UMRs have HW overhead for updating and also for access. Especially for
  the KLM type which was previously used.
- UMR code for headers is complex.

This patch switches to using a static memory area (static MTT MKEY) for
the header buffer and does a header memcpy. This happens only once per
GRO session. The SKB is allocated from the per-cpu NAPI SKB cache.

Performance numbers for x86:
+---------------------------------------------------------+
| Test                | Baseline   | Header Copy | Change |
|---------------------+------------+-------------+--------|
| iperf3 oncpu        |  59.5 Gbps |  64.00 Gbps |   7 %  |
| iperf3 offcpu       | 102.5 Gbps | 104.20 Gbps |   2 %  |
| kperf oncpu         | 115.0 Gbps | 130.00 Gbps |  12 %  |
| XDP_DROP (skb mode) |   3.9 Mpps |   3.9 Mpps  |   0 %  |
+---------------------------------------------------------+

Notes on test:
- System: Intel(R) Xeon(R) Platinum 8380 CPU @ 2.30GHz
- oncpu: NAPI and application running on same CPU
- offcpu: NAPI and application running on different CPUs
- MTU: 1500
- iperf3 tests are single stream, 60s with IPv6 (for slightly larger
  headers)
- kperf version [1]

[1] git://git.kernel.dk/kperf.git

Suggested-by: Eric Dumazet <edumazet@google.com>
Signed-off-by: Dragos Tatulea <dtatulea@nvidia.com>
Signed-off-by: Tariq Toukan <tariqt@nvidia.com>
Reviewed-by: Jacob Keller <jacob.e.keller@intel.com>
Link: https://patch.msgid.link/20260204200345.1724098-1-tariqt@nvidia.com
Signed-off-by: Jakub Kicinski <kuba@kernel.org>

author	Dragos Tatulea <dtatulea@nvidia.com>
	Wed, 4 Feb 2026 20:03:45 +0000 (22:03 +0200)
committer	Jakub Kicinski <kuba@kernel.org>
	Fri, 6 Feb 2026 02:36:06 +0000 (18:36 -0800)
commit	24cf78c738318f3d2b961a1ab4b3faf1eca860d7
tree	b9e8493677754ab7d833e1d70e74353776388cc7	tree
parent	215b53099b60da274fd0f8292fa92edf2a32aaf0	commit \| diff

drivers/net/ethernet/mellanox/mlx5/core/en.h		diff \| blob \| blame \| history
drivers/net/ethernet/mellanox/mlx5/core/en/params.c		diff \| blob \| blame \| history
drivers/net/ethernet/mellanox/mlx5/core/en/txrx.h		diff \| blob \| blame \| history
drivers/net/ethernet/mellanox/mlx5/core/en_main.c		diff \| blob \| blame \| history
drivers/net/ethernet/mellanox/mlx5/core/en_rx.c		diff \| blob \| blame \| history