]> git.ipfire.org Git - thirdparty/gcc.git/commit
RISC-V: Allow dest operand and accumulator operand overlap of widen reduction instruc...
authorJuzhe-Zhong <juzhe.zhong@rivai.ai>
Wed, 1 Nov 2023 06:56:39 +0000 (14:56 +0800)
committerPan Li <pan2.li@intel.com>
Thu, 2 Nov 2023 00:51:15 +0000 (08:51 +0800)
commit1a0af6e5a99cd895a663f0221c25321ae802413f
tree0402ab6261de29021fa24ab0f7386627b1f42e86
parentc73d2d49f9beec33bb843a0c04bde8bc41d7a0b9
RISC-V: Allow dest operand and accumulator operand overlap of widen reduction instruction[PR112327]

Consider this following intrinsic code:

void rvv_dot_prod(int16_t *pSrcA, int16_t *pSrcB, uint32_t n, int64_t *result)
{
    size_t vl;
    vint16m4_t vSrcA, vSrcB;
    vint64m1_t vSum = __riscv_vmv_s_x_i64m1(0, 1);
    while (n > 0) {
        vl = __riscv_vsetvl_e16m4(n);
        vSrcA = __riscv_vle16_v_i16m4(pSrcA, vl);
        vSrcB = __riscv_vle16_v_i16m4(pSrcB, vl);
        vSum = __riscv_vwredsum_vs_i32m8_i64m1(__riscv_vwmul_vv_i32m8(vSrcA, vSrcB, vl), vSum, vl);
        pSrcA += vl;
        pSrcB += vl;
        n -= vl;
    }
    *result = __riscv_vmv_x_s_i64m1_i64(vSum);
}

https://godbolt.org/z/vWd35W7G6

Before this patch:

...
Loop:
...
vmv1r.v v2,v1
...
vwredsum.vs     v1,v8,v2
...

After this patch:

...
Loop:
...
vwredsum.vs v1,v8,v1
...

PR target/112327

gcc/ChangeLog:

* config/riscv/vector.md: Add '0'.

gcc/testsuite/ChangeLog:

* gcc.target/riscv/rvv/base/pr112327-1.c: New test.
* gcc.target/riscv/rvv/base/pr112327-2.c: New test.
gcc/config/riscv/vector.md
gcc/testsuite/gcc.target/riscv/rvv/base/pr112327-1.c [new file with mode: 0644]
gcc/testsuite/gcc.target/riscv/rvv/base/pr112327-2.c [new file with mode: 0644]