git.ipfire.org Git - thirdparty/kernel/linux.git/commit

net/mlx5e: Use regular ICOSQ for triggering NAPI

Before the cited commit, ICOSQ is used to post NOP WQE to trigger
hardware interrupt and start NAPI, but this mechanism suffers from
a race condition: mlx5e_alloc_rx_mpwqe may post UMR WQEs to ICOSQ
_before_ NOP WQE is posted. The cited commit fixes the issue by
replacing ICOSQ with async ICOSQ, as a new way to post the NOP WQE
to trigger the hardware interrupt and NAPI.

The patch changes it back by replacing async ICOSQ with regular
ICOSQ, for the purpose of saving memory in later patches, and solves
the issue by adding a new SQ state, MLX5E_SQ_STATE_LOCK_NEEDED
for syncing the start of NAPI.

What it does:
- Switch trigger path from async ICOSQ to regular ICOSQ to reduce
  need for async SQ.
- Introduce MLX5E_SQ_STATE_LOCK_NEEDED and mlx5e_icosq_sync_lock(),
  unlock() to prevent the race where UMR WQEs could be posted before
  the NOP WQE used to trigger NAPI.
- Use synchronize_net() once per trigger cycle to quiesce in-flight
  softirqs before serializing the NOP WQE and any UMR postings via
  the ICOSQ lock.
- Wrap ICOSQ UMR posting in en_rx.c and xsk/rx.c with the new
  conditional lock.

The conditional locking approach is critical for performance: always
locking would impose unnecessary overhead. Synchronization is not needed
between regular NAPI cycles once the channel is activated and running.
The lock is only required to protect against the race during channel
activation—specifically, when the very first NOP WQE is posted to trigger
NAPI. After that initial trigger, normal NAPI polling handles subsequent
work without contention. The MLX5E_SQ_STATE_LOCK_NEEDED flag ensures we
pay the synchronization cost only when necessary.

Signed-off-by: William Tu <witu@nvidia.com>
Signed-off-by: Tariq Toukan <tariqt@nvidia.com>
Link: https://patch.msgid.link/1768376800-1607672-3-git-send-email-tariqt@nvidia.com
Signed-off-by: Jakub Kicinski <kuba@kernel.org>

author	William Tu <witu@nvidia.com>
	Wed, 14 Jan 2026 07:46:38 +0000 (09:46 +0200)
committer	Jakub Kicinski <kuba@kernel.org>
	Mon, 19 Jan 2026 20:26:42 +0000 (12:26 -0800)
commit	56aca3e0f7308821b6404730a0a6bfd9f26fa04c
tree	2d646e0c4d76b208cc5457ec8bc578e38820453f	tree
parent	ea945f4f399130658a358103b57f37c2d2150458	commit \| diff

drivers/net/ethernet/mellanox/mlx5/core/en.h		diff \| blob \| blame \| history
drivers/net/ethernet/mellanox/mlx5/core/en/reporter_tx.c		diff \| blob \| blame \| history
drivers/net/ethernet/mellanox/mlx5/core/en/xsk/rx.c		diff \| blob \| blame \| history
drivers/net/ethernet/mellanox/mlx5/core/en_main.c		diff \| blob \| blame \| history
drivers/net/ethernet/mellanox/mlx5/core/en_rx.c		diff \| blob \| blame \| history