git.ipfire.org Git - thirdparty/gcc.git/commit

SVE intrinsics: Fold svmul by -1 to svneg for unsigned types

As follow-up to
https://gcc.gnu.org/pipermail/gcc-patches/2024-October/665472.html,
this patch implements folding of svmul by -1 to svneg for
unsigned SVE vector types. The key idea is to reuse the existing code that
does this fold for signed types and feed it as callback to a helper function
that adds the necessary type conversions.

For example, for the test case
svuint64_t foo (svuint64_t x, svbool_t pg)
{
  return svmul_n_u64_x (pg, x, -1);
}

the following gimple sequence is emitted (-O2 -mcpu=grace):
svuint64_t foo (svuint64_t x, svbool_t pg)
{
  svint64_t D.12921;
  svint64_t D.12920;
  svuint64_t D.12919;

  D.12920 = VIEW_CONVERT_EXPR<svint64_t>(x);
  D.12921 = svneg_s64_x (pg, D.12920);
  D.12919 = VIEW_CONVERT_EXPR<svuint64_t>(D.12921);
  goto <D.12922>;
  <D.12922>:
  return D.12919;
}

In general, the new helper gimple_folder::convert_and_fold
- takes a target type and a function pointer,
- converts the lhs and all non-boolean vector types to the target type,
- passes the converted lhs and arguments to the callback,
- receives the new gimple statement from the callback function,
- adds the necessary view converts to the gimple sequence,
- and returns the new call.

Because all arguments are converted to the same target types, the helper
function is only suitable for folding calls whose arguments are all of
the same type. If necessary, this could be extended to convert the
arguments to different types differentially.

The patch was bootstrapped and tested on aarch64-linux-gnu, no regression.
OK for mainline?

Signed-off-by: Jennifer Schmitz <jschmitz@nvidia.com>
gcc/ChangeLog:

* config/aarch64/aarch64-sve-builtins-base.cc
(svmul_impl::fold): Wrap code for folding to svneg in lambda
function and pass to gimple_folder::convert_and_fold to enable
the transform for unsigned types.
* config/aarch64/aarch64-sve-builtins.cc
(gimple_folder::convert_and_fold): New function that converts
operands to target type before calling callback function, adding the
necessary conversion statements.
(gimple_folder::redirect_call): Set fntype of redirected call.
(get_vector_type): Move from here to aarch64-sve-builtins.h.
* config/aarch64/aarch64-sve-builtins.h
(gimple_folder::convert_and_fold): Declare function.
(get_vector_type): Move here as inline function.

gcc/testsuite/ChangeLog:

* gcc.target/aarch64/sve/acle/asm/mul_u8.c: Adjust expected outcome.
* gcc.target/aarch64/sve/acle/asm/mul_u16.c: Likewise.
* gcc.target/aarch64/sve/acle/asm/mul_u32.c: Likewise.
* gcc.target/aarch64/sve/acle/asm/mul_u64.c: New test and adjust
expected outcome.

author	Jennifer Schmitz <jschmitz@nvidia.com>
	Thu, 7 Nov 2024 16:44:30 +0000 (08:44 -0800)
committer	Jennifer Schmitz <jschmitz@nvidia.com>
	Mon, 6 Jan 2025 11:56:57 +0000 (12:56 +0100)
commit	f9c99d403c9a0948936e3120ad97b4f10998351f
tree	49f3bcef0764d1dfd5bf21d2010a11b982c49e0a	tree
parent	144ddb0cdfa7a09ccbaaea5ec72837346fbc1d8d	commit \| diff

gcc/config/aarch64/aarch64-sve-builtins-base.cc		diff \| blob \| blame \| history
gcc/config/aarch64/aarch64-sve-builtins.cc		diff \| blob \| blame \| history
gcc/config/aarch64/aarch64-sve-builtins.h		diff \| blob \| blame \| history
gcc/testsuite/gcc.target/aarch64/sve/acle/asm/mul_u16.c		diff \| blob \| blame \| history
gcc/testsuite/gcc.target/aarch64/sve/acle/asm/mul_u32.c		diff \| blob \| blame \| history
gcc/testsuite/gcc.target/aarch64/sve/acle/asm/mul_u64.c		diff \| blob \| blame \| history
gcc/testsuite/gcc.target/aarch64/sve/acle/asm/mul_u8.c		diff \| blob \| blame \| history