git.ipfire.org Git - thirdparty/gcc.git/commit

author	Tamar Christina <tamar.christina@arm.com>
	Fri, 12 Sep 2025 07:30:55 +0000 (08:30 +0100)
committer	Tamar Christina <tamar.christina@arm.com>
	Fri, 12 Sep 2025 07:30:55 +0000 (08:30 +0100)
commit	443fc6ade9f476b18ef4ce14a95839648fa9956d
tree	2e528a309210e8de8af98ccfb81e094f41798d1b	tree
parent	4ce2556991764eb9a31ed3419da85163c2ad4bcc	commit \| diff

middle-end: Use addhn for compression instead of inclusive OR when reducing comparison values

Given a sequence such as

int foo ()
{
#pragma GCC unroll 4
  for (int i = 0; i < N; i++)
    if (a[i] == 124)
      return 1;

  return 0;
}

where a[i] is long long, we will unroll the loop and use an OR reduction for
early break on Adv. SIMD.  Afterwards the sequence is followed by a compression
sequence to compress the 128-bit vectors into 64-bits for use by the branch.

However if we have support for add halving and narrowing then we can instead of
using an OR, use an ADDHN which will do the combining and narrowing.

Note that for now I only do the last OR, however if we have more than one level
of unrolling we could technically chain them.  I will revisit this in another
up coming early break series, however an unroll of 2 is fairly common.

gcc/ChangeLog:

* internal-fn.def (VEC_TRUNC_ADD_HIGH): New.
* doc/generic.texi: Document it.
* optabs.def (vec_trunc_add_high): New.
* doc/md.texi: Document it.
* tree-vect-stmts.cc (vectorizable_early_exit): Use addhn if supported.

gcc/testsuite/ChangeLog:

* gcc.target/aarch64/vect-early-break-addhn_1.c: New test.
* gcc.target/aarch64/vect-early-break-addhn_2.c: New test.
* gcc.target/aarch64/vect-early-break-addhn_3.c: New test.
* gcc.target/aarch64/vect-early-break-addhn_4.c: New test.

gcc/doc/generic.texi		diff \| blob \| blame \| history
gcc/doc/md.texi		diff \| blob \| blame \| history
gcc/internal-fn.def		diff \| blob \| blame \| history
gcc/optabs.def		diff \| blob \| blame \| history
gcc/testsuite/gcc.target/aarch64/vect-early-break-addhn_1.c	[new file with mode: 0644]	blob
gcc/testsuite/gcc.target/aarch64/vect-early-break-addhn_2.c	[new file with mode: 0644]	blob
gcc/testsuite/gcc.target/aarch64/vect-early-break-addhn_3.c	[new file with mode: 0644]	blob
gcc/testsuite/gcc.target/aarch64/vect-early-break-addhn_4.c	[new file with mode: 0644]	blob
gcc/tree-vect-stmts.cc		diff \| blob \| blame \| history