memory-copy functions of the CUDA library. Higher dimensions will
call those functions in a loop and are therefore supported.
@item Low-latency memory (@code{omp_low_lat_mem_space}) is supported when the
- the @code{access} trait is set to @code{cgroup}, the ISA is at least
- @code{sm_53}, and the PTX version is at least 4.1. The default pool size
+ the @code{access} trait is set to @code{cgroup}, and libgomp has
+ been built for PTX ISA version 4.1 or higher (such as in GCC's
+ default configuration). @c -mptx=4.1
+ The default pool size
is 8 kiB per team, but may be adjusted at runtime by setting environment
variable @code{GOMP_NVPTX_LOWLAT_POOL=@var{bytes}}. The maximum value is
limited by the available hardware, and care should be taken that the