git.ipfire.org Git - thirdparty/gcc.git/log

]> git.ipfire.org Git - thirdparty/gcc.git/log

Kito Cheng [Mon, 27 Nov 2023 12:50:11 +0000 (20:50 +0800)]

RISC-V: Check if zcd conflicts with zcmt and zcmp

gcc/ChangeLog:

* common/config/riscv/riscv-common.cc
(riscv_subset_list::check_conflict_ext): Check zcd conflicts
with zcmt and zcmp.

gcc/testsuite/ChangeLog:

* gcc.target/riscv/arch-29.c: New test.
* gcc.target/riscv/arch-30.c: New test.

commit | commitdiff | tree

Richard Sandiford [Tue, 5 Dec 2023 09:20:55 +0000 (09:20 +0000)]

lra: Updates of biggest mode for hard regs [PR112278]

LRA keeps track of the biggest mode for both hard registers and
pseudos.  The updates assume that the modes are ordered, i.e. that
we can tell whether one is no bigger than the other at compile time.

That is (or at least seemed to be) a reasonable restriction for pseudos.
But it isn't necessarily so for hard registers, since the uses of hard
registers can be logically distinct.  The testcase is an example of this.

The biggest mode of hard registers is also special for other reasons.
As the existing comment says:

  /* A reg can have a biggest_mode of VOIDmode if it was only ever seen as
     part of a multi-word register.  In that case, just use the reg_rtx
     mode.  Do the same also if the biggest mode was larger than a register
     or we can not compare the modes.  Otherwise, limit the size to that of
     the biggest access in the function or to the natural mode at least.  */

This patch applies the same approach to the updates.

gcc/
PR rtl-optimization/112278
* lra-int.h (lra_update_biggest_mode): New function.
* lra-coalesce.cc (merge_pseudos): Use it.
* lra-lives.cc (process_bb_lives): Likewise.
* lra.cc (new_insn_reg): Likewise.

gcc/testsuite/
PR rtl-optimization/112278
* gcc.target/aarch64/sve/pr112278.c: New test.

commit | commitdiff | tree

Jakub Jelinek [Tue, 5 Dec 2023 08:45:40 +0000 (09:45 +0100)]

lower-bitint: Make temporarily wrong IL less wrong [PR112843]

As discussed in the PR, for the middle (on x86-64 65..128 bit) _BitInt
types like
  _1 = x_4(D) * 5;
where _1 and x_4(D) have _BitInt(128) type and x is PARM_DECL, the bitint
lowering pass wants to replace this with
  _13 = (int128_t) x_4(D);
  _12 = _13 * 5;
  _1 = (_BitInt(128)) _12;
where _13 and _12 have int128_t type and the ranger ICEs when the IL is
temporarily invalid:
during GIMPLE pass: bitintlower
pr112843.c: In function ‘foo’:
pr112843.c:7:1: internal compiler error: Segmentation fault
    7 | foo (_BitInt (128) x, _BitInt (256) y)
      | ^~~
0x152943f crash_signal
        ../../gcc/toplev.cc:316
0x25c21c8 ranger_cache::range_of_expr(vrange&, tree_node*, gimple*)
        ../../gcc/gimple-range-cache.cc:1204
0x25cdcf9 fold_using_range::range_of_range_op(vrange&, gimple_range_op_handler&, fur_source&)
        ../../gcc/gimple-range-fold.cc:671
0x25cf9a0 fold_using_range::fold_stmt(vrange&, gimple*, fur_source&, tree_node*)
        ../../gcc/gimple-range-fold.cc:602
0x25b5520 gimple_ranger::update_stmt(gimple*)
        ../../gcc/gimple-range.cc:564
0x16f1234 update_stmt_operands(function*, gimple*)
        ../../gcc/tree-ssa-operands.cc:1150
0x117a5b6 update_stmt_if_modified(gimple*)
        ../../gcc/gimple-ssa.h:187
0x117a5b6 update_stmt_if_modified(gimple*)
        ../../gcc/gimple-ssa.h:184
0x117a5b6 update_modified_stmt
        ../../gcc/gimple-iterator.cc:44
0x117a5b6 gsi_insert_after(gimple_stmt_iterator*, gimple*, gsi_iterator_update)
        ../../gcc/gimple-iterator.cc:544
0x25abc2f gimple_lower_bitint
        ../../gcc/gimple-lower-bitint.cc:6348

What the code does right now is, it first creates a new SSA_NAME (_12
above), adds the
  _1 = (_BitInt(128)) _12;
stmt after it (where it crashes, because _12 has no SSA_NAME_DEF_STMT yet),
then sets lhs of the previous stmt to _12 (this is also temporarily
incorrect, there are incompatible types involved in the stmt), later on
changes also operands and finally update_stmt it.

The following patch instead changes the lhs of the stmt before adding the
cast after it.  The question is if this is less or more wrong temporarily
(but the ICE is gone).  In addition to that the patch moves the operand
adjustments before the lhs adjustment.

The reason I tweaked the lhs first is that it then just uses gimple_op and
iterates over all ops, if that is done before lhs it would need to special
case which op to skip because it is lhs (I'm using gimple_get_lhs for the
lhs, but this isn't done for GIMPLE_CALL nor GIMPLE_PHI, so GIMPLE_ASSIGN
or say GIMPLE_GOTO etc. are the only options).

2023-12-05  Jakub Jelinek  <jakub@redhat.com>

PR tree-optimization/112843
* gimple-lower-bitint.cc (gimple_lower_bitint): Change lhs of stmt
to lhs2 before building and inserting lhs = (cast) lhs2; assignment.
Adjust stmt operands before adjusting lhs.

* gcc.dg/bitint-47.c: New test.

commit | commitdiff | tree

xuli [Tue, 5 Dec 2023 05:58:35 +0000 (05:58 +0000)]

RISC-V: FAIL:g++.dg/torture/vshuf-v[2|4]di.C -Os (execution test) on RV32

This patch fixs the issue of g++.dg/torture/vshuf-v2di.C
and g++.dg/torture/vshuf-v4di.C -Os execution failure with
-march=rv32gcv -mabi=ilp32d.

Consider the following code:
typedef unsigned long long V __attribute__((vector_size(16)));

.LC0: 0xc1c2c3c4c5c6c7c8

before this patch:

lui a5,%hi(.LC0)
addi a5,a5,%lo(.LC0)
lw a6,4(a5)//0xc1c2c3c4
lw a5,0(a5)//0xc5c6c7c8
vsetivli zero,2,e64,m1,ta,mu
vmv.v.x v2,a5//v2 is {0xffffffffc5c6c7c8, 0xffffffffc5c6c7c8}

after this patch:

lui a5,%hi(.LC0)
addi a5,a5,%lo(.LC0)
vsetivli zero,2,e64,m1,ta,mu
vlse64.v v2,0(a5),zero//v2 is {0xc1c2c3c4c5c6c7c8, 0xc1c2c3c4c5c6c7c8}

gcc/ChangeLog:

* config/riscv/riscv-v.cc (sew64_scalar_helper): Bugfix.

commit | commitdiff | tree

Jakub Jelinek [Tue, 5 Dec 2023 08:08:45 +0000 (09:08 +0100)]

i386: Improve code generation for vector __builtin_signbit (x.x[i]) ? -1 : 0 [PR112816]

On the testcase I've recently fixed I've noticed bad code generation,
we emit
        pxor    %xmm1, %xmm1
        psrld   $31, %xmm0
        pcmpeqd %xmm1, %xmm0
        pcmpeqd %xmm1, %xmm0
or
        vpxor   %xmm1, %xmm1, %xmm1
        vpsrld  $31, %xmm0, %xmm0
        vpcmpeqd        %xmm1, %xmm0, %xmm0
        vpcmpeqd        %xmm1, %xmm0, %xmm2
rather than
        psrad   $31, %xmm2
or
        vpsrad  $31, %xmm1, %xmm2
The following patch fixes that using a combiner splitter.

2023-12-05  Jakub Jelinek  <jakub@redhat.com>

PR target/112816
* config/i386/sse.md ((eq (eq (lshiftrt x elt_bits-1) 0) 0)): New
splitter to turn psrld $31; pcmpeq; pcmpeq into psrad $31.

* gcc.target/i386/pr112816.c: New test.

commit | commitdiff | tree

Juzhe-Zhong [Tue, 5 Dec 2023 03:22:50 +0000 (11:22 +0800)]

RISC-V: Add blocker for gather/scatter auto-vectorization

This patch fixes ICE exposed on full coverage testing:

                                === g++: Unexpected fails for rv64gc_zve32f_zvfh_zfh lp64d medlow --param=riscv-autovec-lmul=dynamic ===
FAIL: g++.dg/pr106219.C  -std=gnu++14 (internal compiler error: in require, at machmode.h:313)
FAIL: g++.dg/pr106219.C  -std=gnu++17 (internal compiler error: in require, at machmode.h:313)
FAIL: g++.dg/pr106219.C  -std=gnu++20 (internal compiler error: in require, at machmode.h:313)
FAIL: g++.dg/pr106219.C  -std=gnu++98 (internal compiler error: in require, at machmode.h:313)
                                === g++: Unexpected fails for rv64gc_zve32f_zvfh_zfh lp64d medlow --param=riscv-autovec-lmul=dynamic --param=riscv-autovec-preference=fixed-vlmax ===
FAIL: g++.dg/pr106219.C  -std=gnu++14 (internal compiler error: in require, at machmode.h:313)
FAIL: g++.dg/pr106219.C  -std=gnu++17 (internal compiler error: in require, at machmode.h:313)
FAIL: g++.dg/pr106219.C  -std=gnu++20 (internal compiler error: in require, at machmode.h:313)
FAIL: g++.dg/pr106219.C  -std=gnu++98 (internal compiler error: in require, at machmode.h:313)
                                === g++: Unexpected fails for rv64gc_zve32f_zvfh_zfh lp64d medlow --param=riscv-autovec-lmul=m4 ===
FAIL: g++.dg/pr106219.C  -std=gnu++14 (internal compiler error: in require, at machmode.h:313)
FAIL: g++.dg/pr106219.C  -std=gnu++17 (internal compiler error: in require, at machmode.h:313)
FAIL: g++.dg/pr106219.C  -std=gnu++20 (internal compiler error: in require, at machmode.h:313)
FAIL: g++.dg/pr106219.C  -std=gnu++98 (internal compiler error: in require, at machmode.h:313)
                                === g++: Unexpected fails for rv64gc_zve32f_zvfh_zfh lp64d medlow --param=riscv-autovec-lmul=m4 --param=riscv-autovec-preference=fixed-vlmax ===
FAIL: g++.dg/pr106219.C  -std=gnu++14 (internal compiler error: in require, at machmode.h:313)
FAIL: g++.dg/pr106219.C  -std=gnu++17 (internal compiler error: in require, at machmode.h:313)
FAIL: g++.dg/pr106219.C  -std=gnu++20 (internal compiler error: in require, at machmode.h:313)
FAIL: g++.dg/pr106219.C  -std=gnu++98 (internal compiler error: in require, at machmode.h:313)
                                === g++: Unexpected fails for rv64gc_zve32f_zvfh_zfh lp64d medlow --param=riscv-autovec-lmul=m8 ===
FAIL: g++.dg/pr106219.C  -std=gnu++14 (internal compiler error: in require, at machmode.h:313)
FAIL: g++.dg/pr106219.C  -std=gnu++17 (internal compiler error: in require, at machmode.h:313)
FAIL: g++.dg/pr106219.C  -std=gnu++20 (internal compiler error: in require, at machmode.h:313)
FAIL: g++.dg/pr106219.C  -std=gnu++98 (internal compiler error: in require, at machmode.h:313)
                                === g++: Unexpected fails for rv64gc_zve32f_zvfh_zfh lp64d medlow --param=riscv-autovec-lmul=m8 --param=riscv-autovec-preference=fixed-vlmax ===
FAIL: g++.dg/pr106219.C  -std=gnu++14 (internal compiler error: in require, at machmode.h:313)
FAIL: g++.dg/pr106219.C  -std=gnu++17 (internal compiler error: in require, at machmode.h:313)
FAIL: g++.dg/pr106219.C  -std=gnu++20 (internal compiler error: in require, at machmode.h:313)
FAIL: g++.dg/pr106219.C  -std=gnu++98 (internal compiler error: in require, at machmode.h:313)

The rootcause is we can't extend RVVM4SImode into RVVM8DImode on zve32f.
Add a blocker of it to disable such auto-vectorization in this situation.

gcc/ChangeLog:

* config/riscv/autovec.md: Add blocker.
* config/riscv/riscv-protos.h (gather_scatter_valid_offset_p): New function.
* config/riscv/riscv-v.cc (gather_scatter_valid_offset_p): Ditto.

gcc/testsuite/ChangeLog:

* g++.target/riscv/rvv/autovec/bug-2.C: New test.

commit | commitdiff | tree

Richard Biener [Mon, 4 Dec 2023 13:03:37 +0000 (14:03 +0100)]

c/89270 - honor registered_builtin_types in type_for_size

The following fixes the intermediate conversions inserted by
convert_to_integer when facing address-spaces and converts
to their effective [u]intptr_t when they are registered_builtin_types
by considering those also from c_common_type_for_size and not
only from c_common_type_for_mode.

PR c/89270
gcc/c-family/
* c-common.cc (c_common_type_for_size): Consider
registered_builtin_types.

gcc/testsuite/
* gcc.target/avr/pr89270.c: New testcase.

commit | commitdiff | tree

Richard Biener [Mon, 4 Dec 2023 12:31:35 +0000 (13:31 +0100)]

c/86869 - preserve address-space info when building qualified ARRAY_TYPE

The following adjusts the C FE specific qualified type building
to preserve address-space info also for ARRAY_TYPE.

PR c/86869
gcc/c/
* c-typeck.cc (c_build_qualified_type): Preserve address-space
info for ARRAY_TYPE.

gcc/testsuite/
* gcc.target/avr/pr86869.c: New testcase.

commit | commitdiff | tree

Richard Biener [Mon, 4 Dec 2023 14:46:38 +0000 (15:46 +0100)]

tree-optimization/112827 - more SCEV cprop fixes

The insert iteration can be corrupted by foldings of replace_uses_by,
within this particular PHI replacement but also with subsequent ones.
Recompute the insert location before insertion instead.

This fixes an obvserved ICE of gcc.dg/tree-ssa/ssa-sink-16.c.

PR tree-optimization/112827
PR tree-optimization/112848
* tree-scalar-evolution.cc (final_value_replacement_loop):
Compute the insert location for each insert.

commit | commitdiff | tree

liuhongt [Mon, 27 Nov 2023 05:35:41 +0000 (13:35 +0800)]

Take register pressure into account for vec_construct/scalar_to_vec when the components are not loaded from memory.

For vec_contruct, the components must be live at the same time if
they're not loaded from memory, when the number of those components
exceeds available registers, spill happens. Try to account that with a
rough estimation.
??? Ideally, we should have an overall estimation of register pressure
if we know the live range of all variables.

gcc/ChangeLog:

* config/i386/i386.cc (ix86_vector_costs::add_stmt_cost):
Count sse_reg/gpr_regs for components not loaded from memory.
(ix86_vector_costs:ix86_vector_costs): New constructor.
(ix86_vector_costs::m_num_gpr_needed[3]): New private memeber.
(ix86_vector_costs::m_num_sse_needed[3]): Ditto.
(ix86_vector_costs::finish_cost): Estimate overall register
pressure cost.
(ix86_vector_costs::ix86_vect_estimate_reg_pressure): New
function.

commit | commitdiff | tree

liuhongt [Mon, 4 Dec 2023 03:47:32 +0000 (11:47 +0800)]

Support udot_prodv*qi with emulation sdot_prodv*hi

Like r14-5990-gb4a7c1c8c59d19, but the patch optimized for udot_prod.

Since (zero_extend) (unsigned char)-> int is equal
to (zero_extend)(unsigned char) -> short
+ (sign_extend) (short) -> int

It should be safe to emulate udot_prodv*qi with

     vec_unpacku_lo_v32qi
     vec_unpacku_lo_v32qi
     vec_unpacku_hi_v32qi
     vec_unpacku_hi_v32qi
     sdot_prodv16hi
     sdot_prodv16hi
     add3v8si

gcc/ChangeLog:

* config/i386/sse.md (udot_prodv64qi): New expander.
(udot_prod<mode>): Emulates with VEC_UNPACKU_EXPR +
DOT_PROD (short, int).

gcc/testsuite/ChangeLog:

* gcc.target/i386/udotprodint8_emulate.c: New test.

commit | commitdiff | tree

Marek Polacek [Tue, 19 Sep 2023 20:31:17 +0000 (16:31 -0400)]

c++: implement P2564, consteval needs to propagate up [PR107687]

This patch implements P2564, described at <wg21.link/p2564>, whereby
certain functions are promoted to consteval.  For example:

  consteval int id(int i) { return i; }

  template <typename T>
  constexpr int f(T t)
  {
    return t + id(t); // id causes f<int> to be promoted to consteval
  }

  void g(int i)
  {
    f (3);
  }

now compiles.  Previously the code was ill-formed: we would complain
that 't' in 'f' is not a constant expression.  Since 'f' is now
consteval, it means that the call to id(t) is in an immediate context,
so doesn't have to produce a constant -- this is how we allow consteval
functions composition.  But making 'f<int>' consteval also means that
the call to 'f' in 'g' must yield a constant; failure to do so results
in an error.  I made the effort to have cc1plus explain to us what's
going on.  For example, calling f(i) produces this neat diagnostic:

w.C:11:11: error: call to consteval function 'f<int>(i)' is not a constant expression
   11 |         f (i);
      |         ~~^~~
w.C:11:11: error: 'i' is not a constant expression
w.C:6:22: note: 'constexpr int f(T) [with T = int]' was promoted to an immediate function because its body contains an immediate-escalating expression 'id(t)'
    6 |         return t + id(t); // id causes f<int> to be promoted to consteval
      |                    ~~^~~

which hopefully makes it clear what's going on.

Implementing this proposal has been tricky.  One problem was delayed
instantiation: instantiating a function can set off a domino effect
where one call promotes a function to consteval but that then means
that another function should also be promoted, etc.

In v1, I addressed the delayed instantiation problem by instantiating
trees early, so that we can escalate functions right away.  That caused
a number of problems, and in certain cases, like consteval-prop3.C, it
can't work, because we need to wait till EOF to see the definition of
the function anyway.  Overeager instantiation tends to cause diagnostic
problems too.

In v2, I attempted to move the escalation to the gimplifier, at which
point all templates have been instantiated.  That attempt flopped,
however, because once we've gimplified a function, its body is discarded
and as a consequence, you can no longer evaluate a call to that function
which is required for escalating, which needs to decide if a call is
a constant expression or not.

Therefore, we have to perform the escalation before gimplifying, but
after instantiate_pending_templates.  That's not easy because we have
no way to walk all the trees.  In the v2 patch, I use two vectors: one
to store function decls that may become consteval, and another to
remember references to immediate-escalating functions.  Unfortunately
the latter must also stash functions that call immediate-escalating
functions.  Consider:

  int g(int i)
  {
    f<int>(i); // f is immediate-escalating
  }

where g itself is not immediate-escalating, but we have to make sure
that if f gets promoted to consteval, we give an error.

A new option, -fno-immediate-escalation, is provided to suppress
escalating functions.

v2 also adds a new flag, DECL_ESCALATION_CHECKED_P, so that we don't
escalate a function multiple times, and so that we can distinguish between
explicitly consteval functions and functions that have been promoted
to consteval.

In v3, I removed one of the new vectors and changed the other one
to a hash set.  This version also contains numerous cleanups.

v4 merges find_escalating_expr_r into cp_fold_immediate_r.  It also
adds a new optimization in cp_fold_function.

v5 greatly simplifies the code.

v6 simplifies the code further and removes an ff_ flag.

v7 removes maybe_promote_function_to_consteval and further simplifies
cp_fold_immediate_r logic.

v8 removes maybe_store_immediate_escalating_fn.

PR c++/107687
PR c++/110997

gcc/c-family/ChangeLog:

* c-cppbuiltin.cc (c_cpp_builtins): Update __cpp_consteval.
* c-opts.cc (c_common_post_options): Pre-C++20, unset
flag_immediate_escalation.
* c.opt (fimmediate-escalation): New option.

gcc/cp/ChangeLog:

* call.cc (in_immediate_context): No longer static.
* constexpr.cc (cxx_eval_call_expression): Adjust assert.
* cp-gimplify.cc (deferred_escalating_exprs): New vec.
(remember_escalating_expr): New.
(enum fold_flags): Remove ff_fold_immediate.
(immediate_escalating_function_p): New.
(unchecked_immediate_escalating_function_p): New.
(promote_function_to_consteval): New.
(cp_fold_immediate): Move above.  Return non-null if any errors were
emitted.
(maybe_explain_promoted_consteval): New.
(cp_gimplify_expr) <case CALL_EXPR>: Assert we've handled all
immediate invocations.
(taking_address_of_imm_fn_error): New.
(cp_fold_immediate_r): Merge ADDR_EXPR and PTRMEM_CST cases.  Implement
P2564 - promoting functions to consteval.
<case CALL_EXPR>: Implement P2564 - promoting functions to consteval.
(cp_fold_r): If an expression turns into a CALL_EXPR after cp_fold,
call cp_fold_immediate_r on the CALL_EXPR.
(cp_fold_function): Set DECL_ESCALATION_CHECKED_P if
deferred_escalating_exprs does not contain current_function_decl.
(process_and_check_pending_immediate_escalating_fns): New.
* cp-tree.h (struct lang_decl_fn): Add escalated_p bit-field.
(DECL_ESCALATION_CHECKED_P): New.
(immediate_invocation_p): Declare.
(process_pending_immediate_escalating_fns): Likewise.
* decl2.cc (c_parse_final_cleanups): Set at_eof to 2 after all
templates have been instantiated; and to 3 at the end of the function.
Call process_pending_immediate_escalating_fns.
* error.cc (dump_template_bindings): Check at_eof against an updated
value.
* module.cc (trees_out::lang_decl_bools): Stream escalated_p.
(trees_in::lang_decl_bools): Likewise.
* pt.cc (push_tinst_level_loc): Set at_eof to 3, not 2.
* typeck.cc (cp_build_addr_expr_1): Don't check
DECL_IMMEDIATE_FUNCTION_P.

gcc/ChangeLog:

* doc/invoke.texi: Document -fno-immediate-escalation.

libstdc++-v3/ChangeLog:

* testsuite/18_support/comparisons/categories/zero_neg.cc: Add
dg-prune-output.
* testsuite/std/format/string_neg.cc: Add dg-error.

gcc/testsuite/ChangeLog:

* g++.dg/cpp23/consteval-if10.C: Remove dg-error.
* g++.dg/cpp23/consteval-if2.C: Likewise.
* g++.dg/cpp23/feat-cxx2b.C: Adjust expected value of __cpp_consteval.
* g++.dg/cpp26/feat-cxx26.C: Likewise.
* g++.dg/cpp2a/consteval-memfn1.C: Add dg-error.
* g++.dg/cpp2a/consteval11.C: Likewise.
* g++.dg/cpp2a/consteval3.C: Adjust dg-error.
* g++.dg/cpp2a/consteval34.C: Add dg-error.
* g++.dg/cpp2a/consteval36.C: Likewise.
* g++.dg/cpp2a/consteval9.C: Likewise.
* g++.dg/cpp2a/feat-cxx2a.C: Adjust expected value of __cpp_consteval.
* g++.dg/cpp2a/spaceship-synth9.C: Adjust dg-error.
* g++.dg/cpp2a/consteval-prop1.C: New test.
* g++.dg/cpp2a/consteval-prop10.C: New test.
* g++.dg/cpp2a/consteval-prop11.C: New test.
* g++.dg/cpp2a/consteval-prop12.C: New test.
* g++.dg/cpp2a/consteval-prop13.C: New test.
* g++.dg/cpp2a/consteval-prop14.C: New test.
* g++.dg/cpp2a/consteval-prop15.C: New test.
* g++.dg/cpp2a/consteval-prop16.C: New test.
* g++.dg/cpp2a/consteval-prop17.C: New test.
* g++.dg/cpp2a/consteval-prop18.C: New test.
* g++.dg/cpp2a/consteval-prop19.C: New test.
* g++.dg/cpp2a/consteval-prop20.C: New test.
* g++.dg/cpp2a/consteval-prop2.C: New test.
* g++.dg/cpp2a/consteval-prop3.C: New test.
* g++.dg/cpp2a/consteval-prop4.C: New test.
* g++.dg/cpp2a/consteval-prop5.C: New test.
* g++.dg/cpp2a/consteval-prop6.C: New test.
* g++.dg/cpp2a/consteval-prop7.C: New test.
* g++.dg/cpp2a/consteval-prop8.C: New test.
* g++.dg/cpp2a/consteval-prop9.C: New test.

commit | commitdiff | tree

GCC Administrator [Tue, 5 Dec 2023 00:17:20 +0000 (00:17 +0000)]

Daily bump.

commit | commitdiff | tree

Jason Merrill [Mon, 4 Dec 2023 22:42:13 +0000 (17:42 -0500)]

c++: fix constexpr noreturn diagnostic

Mentioning a noreturn function does not involve an lvalue-rvalue
conversion.

gcc/cp/ChangeLog:

* constexpr.cc (potential_constant_expression_1): Fix
check for loading volatile lvalue.

gcc/testsuite/ChangeLog:

* g++.dg/cpp0x/constexpr-noreturn1.C: New test.

commit | commitdiff | tree

Andrew Pinski [Sun, 12 Nov 2023 04:33:28 +0000 (20:33 -0800)]

MATCH: Fix zero_one_valued_p's convert pattern

While working on PR 111972, I was getting a regression
due to zero_one_valued_p matching a signed 1 bit integer
when it came to convert. This patch fixes that by checking
the outer type too.

Bootstrapped and tested on x86_64-linux-gnu with no regressions.

gcc/ChangeLog:

* match.pd (zero_one_valued_p): For convert
make sure type is not a signed 1-bit integer.

Signed-off-by: Andrew Pinski <quic_apinski@quicinc.com>

commit | commitdiff | tree

Jeff Law [Mon, 4 Dec 2023 17:06:49 +0000 (10:06 -0700)]

[committed] Fix HImode load mnemonic on microblaze port

The tester recently started failing va-arg-22.c on microblaze-linux:

gcc.c-torture/execute/va-arg-22.c   -O0  (test for excess errors)

It was failing with an undefined reference to "r7" at link time.  This was
ultimately tracked down to a HImode load using (reg+reg) addressing mode, but
which used the lhui instruction instead of lhu.  The "i" means it's supposed to
be (reg+disp) so the assembler tried to interpret "r7" as an immediate/symbol.

The port uses %i<opnum> as an output modifier to select between sh/shi and
various other mnemonics for loads/stores.  The movhi pattern simply failed to
use it for the two cases where it's loading from memory (interestingly enough
it was used for stores).

Clearly we aren't using reg+reg much for HImode loads as this didn't fix
anything else in the testsuite.

gcc/
* config/microblaze/microblaze.md (movhi): Use %i for half-word
loads to properly select between lhu/lhui.

commit | commitdiff | tree

Robin Dapp [Tue, 21 Nov 2023 12:31:05 +0000 (13:31 +0100)]

RISC-V: testsuite: Remove redundant vector_hw and zvfh_hw.

This replaces the now-redundant vector_hw and zvfh_hw checks in the
testsuite by riscv_v and riscv_zvfh.

gcc/testsuite/ChangeLog:

* gcc.target/riscv/rvv/autovec/binop/copysign-zvfh-run.c:
Replace riscv_zvfh_hw with riscv_zvfh.
* gcc.target/riscv/rvv/autovec/binop/vadd-zvfh-run.c: Ditto.
* gcc.target/riscv/rvv/autovec/binop/vdiv-zvfh-run.c: Ditto.
* gcc.target/riscv/rvv/autovec/binop/vmax-zvfh-run.c: Ditto.
* gcc.target/riscv/rvv/autovec/binop/vmin-zvfh-run.c: Ditto.
* gcc.target/riscv/rvv/autovec/binop/vmul-zvfh-run.c: Ditto.
* gcc.target/riscv/rvv/autovec/cond/cond_copysign-zvfh-run.c:
Ditto.
* gcc.target/riscv/rvv/autovec/struct/struct_vect_run-10.c:
Ditto.
* gcc.target/riscv/rvv/autovec/struct/struct_vect_run-6.c: Allow
overriding N.
* gcc.target/riscv/rvv/autovec/unop/abs-zvfh-run.c: Replace
riscv zvfh_hw with riscv_zvfh.
* gcc.target/riscv/rvv/autovec/unop/vneg-zvfh-run.c: Ditto.
* gcc.target/riscv/rvv/autovec/widen/widen_run_zvfh-1.c: Ditto.
* gcc.target/riscv/rvv/autovec/widen/widen_run_zvfh-10.c: Ditto.
* gcc.target/riscv/rvv/autovec/widen/widen_run_zvfh-11.c: Ditto.
* gcc.target/riscv/rvv/autovec/widen/widen_run_zvfh-12.c: Ditto.
* gcc.target/riscv/rvv/autovec/widen/widen_run_zvfh-2.c: Ditto.
* gcc.target/riscv/rvv/autovec/widen/widen_run_zvfh-3.c: Ditto.
* gcc.target/riscv/rvv/autovec/widen/widen_run_zvfh-5.c: Ditto.
* gcc.target/riscv/rvv/autovec/widen/widen_run_zvfh-6.c: Ditto.
* gcc.target/riscv/rvv/autovec/widen/widen_run_zvfh-7.c: Ditto.
* gcc.target/riscv/rvv/autovec/widen/widen_run_zvfh-8.c: Ditto.
* lib/target-supports.exp: Remove riscv_vector_hw and
riscv_zvfh_hw.

commit | commitdiff | tree

Robin Dapp [Mon, 4 Dec 2023 12:22:18 +0000 (13:22 +0100)]

RISC-V: Fix two testscases related to -std changes.

Recent -std changes caused testsuite failures. Fix those by adding
-std=gnu99 and -Wno-incompatible-pointer-types.

gcc/testsuite/ChangeLog:

* gcc.target/riscv/rvv/autovec/pr112552.c: Add
-Wno-incompatible-pointer-types.
* gcc.target/riscv/rvv/autovec/struct/struct_vect_run-10.c:
Add -std=gnu99.

commit | commitdiff | tree

Robin Dapp [Fri, 1 Dec 2023 08:45:29 +0000 (09:45 +0100)]

RISC-V: Fix rawmemchr implementation.

This fixes a bug in the rawmemchr implementation by incrementing the
source address by vl * element_size instead of just vl.

This is normally harmless as we will just scan the same region more than
once but, in combination with an older qemu version, will lead to
an execution failure in SPEC2017.

gcc/ChangeLog:

* config/riscv/riscv-string.cc (expand_rawmemchr): Increment
source address by vl * element_size.

commit | commitdiff | tree

Robin Dapp [Fri, 1 Dec 2023 08:30:17 +0000 (09:30 +0100)]

RISC-V: Rename and unify stringop strategy handling.

In preparation for the vectorized strlen and strcmp support this NFC
patch unifies the stringop strategy handling a bit. The "auto"
strategy now is a combination of scalar and vector and an expander
should try the strategies in their preferred order.

For the block_move expander this patch does just that.

gcc/ChangeLog:

* config/riscv/riscv-opts.h (enum riscv_stringop_strategy_enum):
Rename...
(enum stringop_strategy_enum): ... to this.
* config/riscv/riscv-string.cc (riscv_expand_block_move): New
wrapper expander handling the strategies and delegation.
(riscv_expand_block_move_scalar): Rename function and make
static.
(expand_block_move): Remove strategy handling.
* config/riscv/riscv.md: Call expander wrapper.
* config/riscv/riscv.opt: Rename.

gcc/testsuite/ChangeLog:

* gcc.target/riscv/rvv/base/cpymem-strategy-1.c: Change to
-mstringop-strategy.
* gcc.target/riscv/rvv/base/cpymem-strategy-2.c: Ditto.
* gcc.target/riscv/rvv/base/cpymem-strategy-3.c: Ditto.
* gcc.target/riscv/rvv/base/cpymem-strategy-4.c: Ditto.
* gcc.target/riscv/rvv/base/cpymem-strategy-5.c: Ditto.

commit | commitdiff | tree

Richard Biener [Mon, 4 Dec 2023 13:50:59 +0000 (14:50 +0100)]

middle-end/112785 - guard against last_clique overflow

The PR shows that we'll ICE eventually when last_clique wraps. The
following avoids this by refusing to hand out new cliques after
exhausting them. We then use zero (no clique) as conservative
fallback.

PR middle-end/112785
* function.h (get_new_clique): New inline function handling
last_clique overflow.
* cfgrtl.cc (duplicate_insn_chain): Use it.
* tree-cfg.cc (gimple_duplicate_bb): Likewise.
* tree-inline.cc (remap_dependence_clique): Likewise.

commit | commitdiff | tree

Christoph Müllner [Sat, 2 Dec 2023 20:56:57 +0000 (21:56 +0100)]

RISC-V: Document optimization parameter riscv-strcmp-inline-limit

This patch documents the optimization parameter
riscv-strcmp-inline-limit, which can be used to tweak the behaviour
of -minline-strcmp and -minline-strncmp.

gcc/ChangeLog:

PR target/112650
* doc/invoke.texi: Document riscv-strcmp-inline-limit.

Signed-off-by: Christoph Müllner <christoph.muellner@vrull.eu>

commit | commitdiff | tree

Juzhe-Zhong [Mon, 4 Dec 2023 13:44:56 +0000 (21:44 +0800)]

RISC-V: Fix overlap group incorrect overlap on v0

In serious high register pressure case (appended in this patch):

We see vluxei8.v v0,(s1),v1,v0.t which is not allowed.
Since according to RVV ISA:

+;; The destination vector register group for a masked vector instruction cannot overlap the source mask register (v0),
+;; unless the destination vector register is being written with a mask value (e.g., compares) or the scalar result of a reduction.

Such case doesn't have spillings, however, we expect such case should be spilled and reload data.

The rootcause is I made a mistake in previous patch on matching dest operand and mask operand constraints:

dest: "=vr"
mask: "vmWc1"

After this patch:

dest: "vd,vr"
mask: "vm,Wc1"

make EEW widening pattern are same as other instruction patterns.

PR target/112431

gcc/ChangeLog:

* config/riscv/vector.md: Fix incorrect overlap in v0.

gcc/testsuite/ChangeLog:

* gcc.target/riscv/rvv/base/pr112431-34.c: New test.

commit | commitdiff | tree

Juzhe-Zhong [Mon, 4 Dec 2023 13:32:06 +0000 (21:32 +0800)]

RISC-V: Support highest-number regno overlap for widen ternary

Consider this example:

#include "riscv_vector.h"
void
foo6 (void *in, void *out)
{
  vfloat64m8_t accum = __riscv_vle64_v_f64m8 (in, 4);
  vfloat64m4_t high_eew64 = __riscv_vget_v_f64m8_f64m4 (accum, 1);
  vint64m4_t high_eew64_i = __riscv_vreinterpret_v_f64m4_i64m4 (high_eew64);
  vint32m4_t high_eew32_i = __riscv_vreinterpret_v_i64m4_i32m4 (high_eew64_i);
  vfloat32m4_t high_eew32 = __riscv_vreinterpret_v_i32m4_f32m4 (high_eew32_i);
  vfloat64m8_t result = __riscv_vfwnmsac_vf_f64m8 (accum, 64, high_eew32, 4);
  __riscv_vse64_v_f64m8 (out, result, 4);
}

Before this patch:

foo6:                                   # @foo6
        vsetivli        zero, 4, e32, m4, ta, ma
        vle64.v v8, (a0)
        lui     a0, 272384
        fmv.w.x fa5, a0
        vmv8r.v v16, v8
        vfwnmsac.vf     v16, fa5, v12
        vse64.v v16, (a1)
        ret

After this patch:

foo6:
.LFB5:
.cfi_startproc
lui a5,%hi(.LC0)
flw fa5,%lo(.LC0)(a5)
vsetivli zero,4,e32,m4,ta,ma
vle64.v v8,0(a0)
vfwnmsac.vf v8,fa5,v12
vse64.v v8,0(a1)
ret

PR target/112431

gcc/ChangeLog:

* config/riscv/vector.md: Add highest-number overlap support.

gcc/testsuite/ChangeLog:

* gcc.target/riscv/rvv/base/pr112431-37.c: New test.
* gcc.target/riscv/rvv/base/pr112431-38.c: New test.

commit | commitdiff | tree

Richard Biener [Mon, 4 Dec 2023 11:50:36 +0000 (12:50 +0100)]

tree-optimization/112818 - re-instantiate vector type size check for bswap

For __builtin_bswap vectorization we still require an equal vector
type size. Re-instantiate that check.

PR tree-optimization/112818
* tree-vect-stmts.cc (vectorizable_bswap): Check input and
output vector types have the same size.

* gcc.dg/vect/pr112818.c: New testcase.

commit | commitdiff | tree

Juzhe-Zhong [Mon, 4 Dec 2023 11:57:21 +0000 (19:57 +0800)]

RISC-V: Rename bug-01.C to bug-1.C

Rename test to make RVV tests consistent, prepare for the following patches.

gcc/testsuite/ChangeLog:

* g++.target/riscv/rvv/autovec/bug-01.C: Moved to...
* g++.target/riscv/rvv/autovec/bug-1.C: ...here.

commit | commitdiff | tree

Richard Biener [Mon, 4 Dec 2023 09:46:11 +0000 (10:46 +0100)]

tree-optimization/112827 - corrupt SCEV cache during SCCP

The following avoids corrupting the SCEV cache by my last change
to propagate constant final values immediately. The easiest fix
is to keep a dead initialization around.

PR tree-optimization/112827
* tree-scalar-evolution.cc (final_value_replacement_loop):
Do not release SSA name but keep a dead initialization around.

* gcc.dg/torture/pr112827-1.c: New testcase.
* gcc.dg/torture/pr112827-2.c: Likewise.

commit | commitdiff | tree

Juzhe-Zhong [Mon, 4 Dec 2023 08:51:06 +0000 (16:51 +0800)]

RISC-V: Remove earlyclobber from widen reduction

Since the destination of reduction is not a vector register group, there
is no need to apply overlap constraint.

Also confirm Clang:

The mir in LLVM has early clobber:
early-clobber %49:vrm2 = PseudoVWADD_VX_M1 $noreg(tied-def 0), killed %17:vr, %48:gpr, %0:gprnox0, 3, 0; example.c:59:24

The mir in LLVM doesn't have early clobber:
%48:vr = PseudoVWREDSUM_VS_M2_E8 $noreg(tied-def 0), %17:vrm2, killed %33:vr, %0:gprnox0, 3, 1; example.c:60:26

And also confirm both:

vwredsum.vs     v24, v8, v24 and vwredsum.vs     v8, v8, v24 all legal on LLVM.

Align with LLVM and honor RISC-V V spec, remove earlyclobber.

Before this patch:

vwredsum.vs     v8,v24,v8
        vwredsum.vs     v7,v22,v7
        vwredsum.vs     v6,v20,v6
        vwredsum.vs     v5,v18,v5
        vwredsum.vs     v4,v16,v4
        vwredsum.vs     v3,v14,v3
        vwredsum.vs     v2,v12,v2
        vwredsum.vs     v1,v10,v1
        vmv1r.v v9,v8
        vwredsum.vs     v9,v24,v9
        vmv1r.v v24,v7
        vwredsum.vs     v24,v22,v24
        vmv1r.v v22,v6
        vwredsum.vs     v22,v20,v22
        vmv1r.v v20,v5
        vwredsum.vs     v20,v18,v20
        vmv1r.v v18,v4
        vwredsum.vs     v18,v16,v18
        vmv1r.v v16,v3
        vwredsum.vs     v16,v14,v16
        vmv1r.v v14,v2
        vwredsum.vs     v14,v12,v14
        vmv1r.v v12,v1
        vwredsum.vs     v12,v10,v12

After this patch:

vfwredusum.vs v17,v12,v17
vfwredusum.vs v18,v10,v18
vfwredusum.vs v15,v26,v15
vfwredusum.vs v16,v24,v16
vfwredusum.vs v12,v12,v17
vfwredusum.vs v10,v10,v18
vfwredusum.vs v13,v6,v20
vfwredusum.vs v11,v8,v19
vfwredusum.vs v6,v6,v13
vfwredusum.vs v8,v8,v11
vfwredusum.vs v7,v4,v21
vfwredusum.vs v9,v2,v22
vfwredusum.vs v14,v26,v15
vfwredusum.vs v1,v24,v16
vfwredusum.vs v4,v4,v7
vfwredusum.vs v2,v2,v9

Same behavior as LLVM, and honor RISC-V V spec.

PR target/112431

gcc/ChangeLog:

* config/riscv/vector.md: Remove earlyclobber from widen reduction.

gcc/testsuite/ChangeLog:

* gcc.target/riscv/rvv/base/pr112431-35.c: New test.
* gcc.target/riscv/rvv/base/pr112431-36.c: New test.

commit | commitdiff | tree

Indu Bhagat [Mon, 4 Dec 2023 09:57:34 +0000 (01:57 -0800)]

BTF: fix PR debug/112656

PR debug/112656 - btf: function prototypes generated with name

With this patch, all BTF_KIND_FUNC_PROTO will appear anonymous in the
generated BTF section.

As noted in the discussion in the bugzilla, the number of
BTF_KIND_FUNC_PROTO types output varies across targets (BPF with -mco-re
vs non-BPF targets). Hence the check in the test case merely checks
that all BTF_KIND_FUNC_PROTO appear anonymous.

gcc/ChangeLog:

PR debug/112656
* btfout.cc (btf_asm_type): Fixup ctti_name for all
BTF types of kind BTF_KIND_FUNC_PROTO.

gcc/testsuite/ChangeLog:

PR debug/112656
* gcc.dg/debug/btf/btf-function-7.c: New test.

commit | commitdiff | tree

Indu Bhagat [Mon, 4 Dec 2023 09:57:25 +0000 (01:57 -0800)]

BTF: fix PR debug/112768

PR debug/112768 - btf: fix asm comment output for BTF_KIND_FUNC* kinds

The patch adds a small function to abstract out the detail and return
the name of the type.  The patch also fixes the issue of BTF_KIND_FUNC
appearing in the comments with a 'null' string.

For btf-function-6.c testcase, after the patch:

        .long   0       # TYPE 2 BTF_KIND_FUNC_PROTO ''
        .long   0xd000002       # btt_info: kind=13, kflag=0, vlen=2
        .long   0x1     # btt_type: (BTF_KIND_INT 'int')
        .long   0       # farg_name
        .long   0x1     # farg_type: (BTF_KIND_INT 'int')
        .long   0       # farg_name
        .long   0x1     # farg_type: (BTF_KIND_INT 'int')
        .long   0       # TYPE 3 BTF_KIND_FUNC_PROTO ''
        .long   0xd000001       # btt_info: kind=13, kflag=0, vlen=1
        .long   0x1     # btt_type: (BTF_KIND_INT 'int')
        .long   0x68    # farg_name
        .long   0x1     # farg_type: (BTF_KIND_INT 'int')
        .long   0x5     # TYPE 4 BTF_KIND_FUNC 'extfunc'
        .long   0xc000002       # btt_info: kind=12, kflag=0, linkage=2
        .long   0x2     # btt_type: (BTF_KIND_FUNC_PROTO '')
        .long   0xd     # TYPE 5 BTF_KIND_FUNC 'foo'
        .long   0xc000001       # btt_info: kind=12, kflag=0, linkage=1
        .long   0x3     # btt_type: (BTF_KIND_FUNC_PROTO '')

gcc/ChangeLog:

PR debug/112768
* btfout.cc (get_btf_type_name): New definition.
(btf_collect_datasec): Update dtd_name to the original type name
string.
(btf_asm_type_ref): Use the new get_btf_type_name function
instead.
(btf_asm_type): Likewise.
(btf_asm_func_type): Likewise.

gcc/testsuite/ChangeLog:

PR debug/112768
* gcc.dg/debug/btf/btf-function-6.c: Empty string expected with
BTF_KIND_FUNC_PROTO.

commit | commitdiff | tree

Pan Li [Mon, 4 Dec 2023 08:06:14 +0000 (16:06 +0800)]

RISC-V: Add test case for bug PR112813

The bugzilla 112813 has been fixed recently, add below test
case for the bug.

PR target/112813

gcc/testsuite/ChangeLog:

* gcc.target/riscv/rvv/vsetvl/pr112813-1.c: New test.

Signed-off-by: Pan Li <pan2.li@intel.com>

commit | commitdiff | tree

Jakub Jelinek [Mon, 4 Dec 2023 08:01:09 +0000 (09:01 +0100)]

i386: Fix rtl checking ICE in ix86_elim_entry_set_got [PR112837]

The following testcase ICEs with RTL checking, because it sets if
XINT (SET_SRC (set), 1) is UNSPEC_SET_GOT without checking if SET_SRC (set)
is actually an UNSPEC, so any time we see any other insn with PARALLEL
and a SET in it which is not an UNSPEC we ICE during RTL checking or
access there some other union member as if it was an rt_int.
The rest is just small cleanup.

2023-12-04 Jakub Jelinek <jakub@redhat.com>

PR target/112837
* config/i386/i386.cc (ix86_elim_entry_set_got): Before checking
for UNSPEC_SET_GOT check that SET_SRC is UNSPEC. Use SET_SRC and
SET_DEST macros instead of XEXP, rename vec variable to set.

* gcc.dg/pr112837.c: New test.

commit | commitdiff | tree

Jakub Jelinek [Mon, 4 Dec 2023 08:00:18 +0000 (09:00 +0100)]

i386: Fix up signbit<mode>2 expander [PR112816]

The following testcase ICEs, because the signbit<mode>2 expander uses an
explicit SUBREG in the pattern around match_operand with register_operand
predicate. If we are unlucky enough that expansion tries to expand it
with some SUBREG as operands[1], we have two nested SUBREGs in the IL,
which is not valid and causes ICE later.

2023-12-04 Jakub Jelinek <jakub@redhat.com>

PR target/112816
* config/i386/sse.md (signbit<mode>2): Force operands[1] into a REG.

* gcc.target/i386/sse2-pr112816.c: New test.

commit | commitdiff | tree

Jakub Jelinek [Mon, 4 Dec 2023 07:59:15 +0000 (08:59 +0100)]

c++: #pragma GCC unroll C++ fixes [PR112795]

foo in the unroll-5.C testcase ICEs because cp_parser_pragma_unroll
during parsing calls maybe_constant_value unconditionally, which is
fine if !processing_template_decl, but can ICE otherwise.

While just calling fold_non_dependent_expr there instead could be enough
to fix the ICE (and I guess the right thing to do for backports if any),
I don't see a reason why we couldn't handle a dependent #pragma GCC unroll
argument as well, the unrolling isn't done in the FE and all the middle-end
cares about is that ANNOTATE_EXPR has a 1..65534 last operand when it is
annot_expr_unroll_kind.

So, the following patch changes all the unsigned short unroll arguments
to tree unroll (and thus avoids the tree -> unsigned short -> tree
conversions), does the type and value checking during parsing only if
the argument isn't dependent and repeats it during instantiation.

2023-12-04  Jakub Jelinek  <jakub@redhat.com>

PR c++/112795
gcc/cp/
* cp-tree.h (cp_convert_range_for): Change UNROLL type from
unsigned short to tree.
(finish_while_stmt_cond, finish_do_stmt, finish_for_cond): Likewise.
* parser.cc (cp_parser_statement): Pass NULL_TREE rather than 0 to
cp_parser_iteration_statement UNROLL argument.
(cp_parser_for, cp_parser_c_for): Change UNROLL type from
unsigned short to tree.
(cp_parser_range_for): Likewise.  Set RANGE_FOR_UNROLL to just UNROLL
rather than build_int_cst from it.
(cp_convert_range_for, cp_parser_iteration_statement): Change UNROLL
type from unsigned short to tree.
(cp_parser_omp_loop_nest): Pass NULL_TREE rather than 0 to
cp_parser_range_for UNROLL argument.
(cp_parser_pragma_unroll): Return tree rather than unsigned short.
If parsed expression is type dependent, just return it, don't diagnose
issues with value if it is value dependent.
(cp_parser_pragma): Change UNROLL type from unsigned short to tree.
* semantics.cc (finish_while_stmt_cond): Change UNROLL type from
unsigned short to tree.  Build ANNOTATE_EXPR with UNROLL as its last
operand rather than build_int_cst from it.
(finish_do_stmt, finish_for_cond): Likewise.
* pt.cc (tsubst_stmt) <case RANGE_FOR_STMT>: Change UNROLL type from
unsigned short to tree and set it to RECUR on RANGE_FOR_UNROLL (t).
(tsubst_expr) <case ANNOTATE_EXPR>: For annot_expr_unroll_kind repeat
checks on UNROLL value from cp_parser_pragma_unroll.
gcc/testsuite/
* g++.dg/ext/unroll-5.C: New test.
* g++.dg/ext/unroll-6.C: New test.

commit | commitdiff | tree

Feng Wang [Mon, 4 Dec 2023 06:43:19 +0000 (06:43 +0000)]

RISC-V: Update crypto vector ISA info with latest spec

This patch add the Zvkb subset of crypto vector extension. The
corresponding test cases have aslo been modified.

gcc/ChangeLog:

* common/config/riscv/riscv-common.cc: Add zvkb ISA info.
* config/riscv/riscv.opt: Add Mask(ZVKB)

gcc/testsuite/ChangeLog:

* gcc.target/riscv/zvkn-1.c: Replace zvbb with zvkb.
* gcc.target/riscv/zvkn.c:   Ditto.
* gcc.target/riscv/zvknc-1.c:Ditto.
* gcc.target/riscv/zvknc-2.c:Ditto.
* gcc.target/riscv/zvknc.c:  Ditto.
* gcc.target/riscv/zvkng-1.c:Ditto.
* gcc.target/riscv/zvkng-2.c:Ditto.
* gcc.target/riscv/zvkng.c:  Ditto.
* gcc.target/riscv/zvks-1.c: Ditto.
* gcc.target/riscv/zvks.c:   Ditto.
* gcc.target/riscv/zvksc-1.c:Ditto.
* gcc.target/riscv/zvksc-2.c:Ditto.
* gcc.target/riscv/zvksc.c:  Ditto.
* gcc.target/riscv/zvksg-1.c:Ditto.
* gcc.target/riscv/zvksg-2.c:Ditto.
* gcc.target/riscv/zvksg.c:  Ditto.

commit | commitdiff | tree

Fei Gao [Tue, 28 Nov 2023 02:32:24 +0000 (02:32 +0000)]

prefer Zicond primitive semantics to SFB

Move Zicond md files ahead of SFB to recognize Zicond first.

Take the following case for example.

CFLAGS: -mtune=sifive-7-series -march=rv64gc_zicond -mabi=lp64d

long primitiveSemantics_00(long a, long b) { return a == 0 ? 0 : b; }

before patch:
primitiveSemantics_00:
bne a0,zero,1f # movcc
mv a1,zero
1:
mv a0,a1
ret

after patch:
primitiveSemantics_00:
czero.eqz a0,a1,a0
ret

Co-authored-by: Xiao Zeng<zengxiao@eswincomputing.com>
gcc/ChangeLog:

* config/riscv/riscv.md (*mov<GPR:mode><X:mode>cc):move to sfb.md
* config/riscv/sfb.md: New file.

gcc/testsuite/ChangeLog:

* gcc.target/riscv/zicond-sfb-primitiveSemantics.c: New test.

commit | commitdiff | tree

Kito Cheng [Mon, 4 Dec 2023 06:17:52 +0000 (14:17 +0800)]

RISC-V: Add sifive-x280 to -mcpu

x280 is one of SiFive core, and it release for a while, also
upstream LLVM already support that.

[1] https://www.sifive.com/cores/intelligence-x280

gcc/ChangeLog:

* config/riscv/riscv-cores.def: Add sifive-x280.
* doc/invoke.texi (RISC-V Options): Add sifive-x280

gcc/testsuite/ChangeLog:

* gcc.target/riscv/mcpu-sifive-x280.c: New test.

commit | commitdiff | tree

Kito Cheng [Mon, 27 Nov 2023 14:01:44 +0000 (22:01 +0800)]

RISC-V: Refactor riscv_implied_info_t to make it able to handle conditional implication [NFC]

RISC-V ISA implication rules become little bit complicated than before,
it may come with condition, so this commit extend the capability of
riscv_implied_info_t, also make it more...C++ize.

gcc/ChangeLog:

* common/config/riscv/riscv-common.cc (riscv_implied_predicator_t): New.
(riscv_implied_info_t::riscv_implied_info_t): New.
(riscv_implied_info_t::match): New.
(riscv_implied_info): New entry for zcf.
(riscv_subset_list::handle_implied_ext): Use
riscv_implied_info_t::match.
(riscv_subset_list::check_implied_ext): Ditto.
(riscv_subset_list::handle_combine_ext): Ditto.
(riscv_subset_list::parse): Move zcf implication handling to
riscv_implied_infos.

commit | commitdiff | tree

Kito Cheng [Mon, 27 Nov 2023 07:28:30 +0000 (15:28 +0800)]

RISC-V: Refine riscv_subset_list::parse [NFC]

Extract the logic of checking conflict extensions to a standard alone
function, prepare to add more checking logic.

gcc/ChangeLog:

* common/config/riscv/riscv-common.cc
(riscv_subset_list::check_conflict_ext): New.
(riscv_subset_list::parse): Move checking conflict ext. to
check_conflict_ext.
* config/riscv/riscv-subset.h:
Add riscv_subset_list::check_conflict_ext.

commit | commitdiff | tree

Hu, Lin1 [Mon, 27 Nov 2023 03:28:00 +0000 (11:28 +0800)]

i386: Fix CPUID of USER_MSR.

gcc/ChangeLog:

* common/config/i386/cpuinfo.h (get_available_features): Move USER_MSR
to the correct location.

gcc/testsuite/ChangeLog:

* gcc.target/i386/user_msr-1.c: Correct the MSR index for give the user
an proper example.

commit | commitdiff | tree

Gaius Mulley [Mon, 4 Dec 2023 01:35:46 +0000 (01:35 +0000)]

PR modula2/112825: modula2 builds target objects as part of all-gcc

This patch fixes the PR modula2/112825 which fails if the target
assembler is not present on the host.  This can be seen if the
build invokes make all-gcc.  m2 should not attempt to generate
target libraries when performing make all-gcc.

Prior to this patch it generated build/gcc/m2/gm2-libs/SYSTSEM.def
using the script gcc/m2/tools-src/makeSystem (and gm2 -c).
makeSystem should exec gm2 -S instead (and other flags)
to generate the list of target data types without requiring any
target tools.  The target types emitted are textually converted
into SYSTEM.def.

gcc/m2/ChangeLog:

PR modula2/112825
* tools-src/makeSystem: Change all occurrences of -c to -S.

Signed-off-by: Gaius Mulley <gaiusmod2@gmail.com>

commit | commitdiff | tree

Juzhe-Zhong [Sun, 3 Dec 2023 22:47:41 +0000 (06:47 +0800)]

RISC-V: Robostify the W43, W86, W87 constraint enabled attribute

Committed as it is obvious fix.

gcc/ChangeLog:

* config/riscv/riscv.md: Rostify the constraints.

commit | commitdiff | tree

chenxiaolong [Tue, 7 Nov 2023 03:53:39 +0000 (11:53 +0800)]

LoongArch: Add intrinsic function descriptions for LSX and LASX instructions to doc.

gcc/ChangeLog:

* doc/extend.texi: Add information about the intrinsic function of the vector
instruction.

commit | commitdiff | tree

GCC Administrator [Mon, 4 Dec 2023 00:16:38 +0000 (00:16 +0000)]

Daily bump.

commit | commitdiff | tree

Jakub Jelinek [Sun, 3 Dec 2023 19:03:27 +0000 (20:03 +0100)]

testsuite: Fix up gcc.target/aarch64/pr112406.c for modern C [PR112406]

On Fri, Nov 17, 2023 at 02:04:01PM +0100, Robin Dapp wrote:
> > Yes, your version is also OK.
>
> The attached was bootstrapped and regtested on aarch64, x86 and
> regtested on riscv.  Going to commit it later unless somebody objects.

Unfortunately the aarch64/pr112406.c was reduced too much and is rejected
since the switch to modern C patchset.

The following patch fixes that, I've verified the testcase
before/after the changes still ICEs in r14-5563 and doesn't with
r14-5564 and after the changes compiles fine with even latest trunk.
Everything admittedly with a cross-compiler, but that shouldn't change
anything.

Note, one of the modern C changes is that at least when people use
cvise/creduce/delta scripts which ensure no further errors are introduced
during the reduction then expected originally such reductions will not
appear anymore.

2023-12-03  Jakub Jelinek  <jakub@redhat.com>

PR middle-end/112406
* gcc.target/aarch64/pr112406.c (MagickPixelPacket): Add missing
semicolon.
(GetImageChannelMoments_image): Avoid using implicit int.
(SetMagickPixelPacket): Use void return type instead of implicit int.
(GetImageChannelMoments): Likewise.  Use __builtin_atan instead of
atan.

commit | commitdiff | tree

Jakub Jelinek [Sun, 3 Dec 2023 16:54:03 +0000 (17:54 +0100)]

lower-bitint: Fix up lower_addsub_overflow [PR112807]

lower_addsub_overflow uses handle_cast or handle_operand to extract current
limb from the operands.  Both of those functions heavily assume that they
return a large or huge BITINT_TYPE.  The problem in the testcase is that
this is violated.  Normally, lower_addsub_overflow isn't even called if
neither the return's type element type nor any of the operand is large/huge
BITINT_TYPE (on x86_64 129+ bits), for middle BITINT_TYPE (on x86_64 65-128
bits) some other code casts such operands to {,unsigned }__int128.
In the testcase the result is complex unsigned, so small, but one of the
arguments is _BitInt(256), so lower_addsub_overflow is called.  But
range_for_prec asks the ranger for ranges of the operands and in this
case the first argument has [0, 0xffffffff] range and second [-2, 1], so
unsigned 32-bit and signed 2-bit, and in such case the code for
handle_operand/handle_cast purposes would use the _BitInt(256) type for the
first operand (ok), but because prec3 aka maximum of result precision and
the VRP computes ranges of the arguments is 32, use cast to 32-bit
BITINT_TYPE, which is why it didn't work correctly.
The following patch ensures that in such cases we use handle_cast to the
type of the other argument.

Perhaps incrementally, we could try to optimize this in an earlier phase,
see that while the .{ADD,SUB}_OVERFLOW has large/huge _BitInt argument, as
ranger says it fits into a smaller type, add a cast of the larger argument
to the smaller precision type in which it fits.  Either in
gimple_lower_bitint, or match.pd.  An argument for the latter is that e.g.
complex unsigned .ADD_OVERFLOW (unsigned_long_long_arg, unsigned_arg)
where ranger says unsigned_long_long_arg fits into unsigned 32-bit could
be also more efficient as
.ADD_OVERFLOW ((unsigned) unsigned_long_long_arg, unsigned_arg)

2023-12-03  Jakub Jelinek  <jakub@redhat.com>

PR middle-end/112807
* gimple-lower-bitint.cc (bitint_large_huge::lower_addsub_overflow):
When choosing type0 and type1 types, if prec3 has small/middle bitint
kind, use maximum of type0 and type1's precision instead of prec3.

* gcc.dg/bitint-46.c: New test.

commit | commitdiff | tree

Saurabh Jha [Sun, 3 Dec 2023 16:15:24 +0000 (16:15 +0000)]

testsuite: Fix up pr112337.c test

gcc/testsuite/ChangeLog:

* gcc.target/arm/mve/pr112337.c: Use int32_t instead of int.

commit | commitdiff | tree

Pan Li [Sun, 3 Dec 2023 14:13:00 +0000 (22:13 +0800)]

RISC-V: Fix typo in test abi configuration

It should be -mabi=lp64d instead of -mabi=lp64, committed in as obvious.

gcc/testsuite/ChangeLog:

* gcc.target/riscv/rvv/base/pr112743-1.c: Fix typo.
* gcc.target/riscv/rvv/base/pr112743-2.c: Ditto.

Signed-off-by: Pan Li <pan2.li@intel.com>

commit | commitdiff | tree

Jeff Law [Sun, 3 Dec 2023 05:54:46 +0000 (22:54 -0700)]

[committed] Fix gnu23-builtins-no-dfp

Last patch for the night.  There's still a bit of minor fallout left in GCC
(loongarch testsuite for example).  But things are looking good on the targets
I test.  The plan is to start submitting the various newlib/libgloss fixes
tomorrow.

Anyway, this test was the one I was most concerned about.  Basically we're
testing that on a !dfp target that the builtins are not available.  It expects
a warning, but gets an error by default now.  I just changed the test to use
-fpermissive, so that the test behaves as it did previously.

Pushed to the trunk.

gcc/testsuite
* gcc.dg/gnu23-builtins-no-dfp-1.c: Add -fpermissive.

commit | commitdiff | tree

Jeff Law [Sun, 3 Dec 2023 05:45:48 +0000 (22:45 -0700)]

[committed] Fix build of libgcc on ports using FDPIC

read_encoded_value_with_base has an ifdef'd code path conditional on __FDPIC__
which was calling _Unwind_gnu_Find_got without a prototype. This naturally
caused various build failures.

This adds a suitable prototype.

Pushed to the trunk.

libgcc

* unwind-pe.h (_Unwind_gnu_Find_got): Add prototype.

commit | commitdiff | tree

Jeff Law [Sun, 3 Dec 2023 05:40:41 +0000 (22:40 -0700)]

[committed] Fix pr65369.c

There's a caller/callee type mismatch in this test that shows up on targets
where ints are something other than 32 bit types.

Based on reviewing the original bug report, the fix and the part of the test
this fixes, I'm reasonably confident this hasn't compromised the test.

gcc/testsuite
* gcc.c-torture/execute/pr65369.c: Fix type mismatch.

commit | commitdiff | tree

Jeff Law [Sun, 3 Dec 2023 05:32:22 +0000 (22:32 -0700)]

[committed] Fix comp-goto-1.c on 16 bit targets

I don't remember what port triggered this, but it's obviously that
comp-goto-1.c needs to be fixed.

Basically the test has two implementations. One is just a dummy with no return
value on main() triggering the new errors.

gcc/testsuite
* gcc.c-torture/execute/comp-goto-1.c: Fix return value of main for
16 bit targets.

commit | commitdiff | tree

Jeff Law [Sun, 3 Dec 2023 05:16:33 +0000 (22:16 -0700)]

[committed] Fix a few arc tests

Similar to others. Where it's easy to fix the implicit types or add prototypes
I did. One was just ugly and I didn't want to think too hard, so I just added
-fpermissive.

Pushed to the trunk.

gcc/testsuite
* gcc.target/arc/lra-1.c: Fix missing prototypes and implicit
types in variable definitions.
* gcc.target/arc/pic-1.c: Similarly.
* gcc.target/arc/pr9001191897.c: Similarly.
* gcc.target/arc/pr9001195952.c: Add -fpermissive.

commit | commitdiff | tree

Jeff Law [Sun, 3 Dec 2023 05:12:55 +0000 (22:12 -0700)]

[committed] Fix nios2 tests

The nios2 port has two tests that are affected by the recent changes. In
cdx-ldstwm-1.c it was easiest to just add -fpermissive. for cdx-ldstwm-2.c
adding an prototype for exit and abort is all that's needed.

gcc/testsuite
* gcc.target/nios2/cdx-ldstwm-1.c: Add -fpermissive.
* gcc.target/nios2/cdx-ldstwm-2.c: Add prototypes fro abort and exit.

commit | commitdiff | tree

Jeff Law [Sun, 3 Dec 2023 05:07:59 +0000 (22:07 -0700)]

[committed] Fix rx build failure in libgcc

The rx port has a bunch of what I presume are ABI compatibility functions in
libgcc. Those compatibility functions routines such as __eqdf2 from libgcc,
but without a prototype. This patch adds the missing prototypes.

libgcc/
* config/rx/rx-abi-functions.c (__ltdf2, __gtdf2): Add prototype.
(__ledf2, __gedf2, __eqdf2, __nedf2): Likewise.
(__ltsf2, __gtsf2, __lesf2, __gesf2, __eqsf2, __nesf2): Likewise.

commit | commitdiff | tree

Jeff Law [Sun, 3 Dec 2023 05:03:28 +0000 (22:03 -0700)]

[committed] Fix minor testsuite problems on H8 after C99 changes

Two minor regressions on the H8 were triggered by the C99 changes.  First
pr58400.c has several functions without prototypes.  I just added -fpermissive
to that test.  Second pr17306-2.c has a single call to an unprototyped function
for which I added the prototype.

These are both H8 specific tests.

gcc/testsuite
* gcc.target/h8300/pr58400.c: Add -fpermissive.
* gcc.target/h8300/pr17306-2.c: Add missing prototype.

commit | commitdiff | tree

Jeff Law [Sun, 3 Dec 2023 04:54:36 +0000 (21:54 -0700)]

[committed] Fix frv build after C99 changes

Two issues prevent the frv-elf port from building after the C99 changes. First
the trampoline code emitted into libgcc has calls to exit, but no prototype.
Adding a trivial prototype for exit() into the macro fixes that little goof.

Second, frvbegin.c has a call to atexit, so a quick prototype is added into
frvbegin.c to fix that problem.

That's enough to get the compiler building again.

gcc/
* config/frv/frv.h (TRANSFER_FROM_TRAMPOLINE): Add prototype for exit.

libgcc/
* config/frv/frvbegin.c (atexit): Add prototype.

commit | commitdiff | tree

GCC Administrator [Sun, 3 Dec 2023 00:16:41 +0000 (00:16 +0000)]

Daily bump.

commit | commitdiff | tree

Alexandre Oliva [Sat, 2 Dec 2023 17:14:02 +0000 (14:14 -0300)]

libsupc++: try cxa_thread_atexit_impl at runtime

g++.dg/tls/thread_local-order2.C fails when the toolchain is built for
a platform that lacks __cxa_thread_atexit_impl, even if the program is
built and run using that toolchain on a (later) platform that offers
__cxa_thread_atexit_impl.

This patch adds runtime testing for __cxa_thread_atexit_impl on
platforms that support weak symbols.

for libstdc++-v3/ChangeLog

* libsupc++/atexit_thread.cc [__GXX_WEAK__]: Add dynamic
detection of __cxa_thread_atexit_impl.

commit | commitdiff | tree

Harald Anlauf [Fri, 1 Dec 2023 21:44:30 +0000 (22:44 +0100)]

Fortran: deferred-length character optional dummy arguments [PR93762,PR100651]

gcc/fortran/ChangeLog:

PR fortran/93762
PR fortran/100651
* trans-array.cc (gfc_trans_deferred_array): Add presence check
for optional deferred-length character dummy arguments.
* trans-expr.cc (gfc_conv_missing_dummy): The character length for
deferred-length dummy arguments is passed by reference, so that
its value can be returned. Adjust handling for optional dummies.

gcc/testsuite/ChangeLog:

PR fortran/93762
PR fortran/100651
* gfortran.dg/optional_deferred_char_1.f90: New test.

commit | commitdiff | tree

Richard Sandiford [Sat, 2 Dec 2023 13:49:55 +0000 (13:49 +0000)]

attribs: Namespace-aware lookup_attribute_spec

attribute_ignored_p already used a namespace-aware query
to find the attribute_spec for an existing attribute:

const attribute_spec *as = lookup_attribute_spec (TREE_PURPOSE (attr));

This patch does the same for other callers in the file.

gcc/
* attribs.cc (comp_type_attributes): Pass the full TREE_PURPOSE
to lookup_attribute_spec, rather than just the name.
(remove_attributes_matching): Likewise.

commit | commitdiff | tree

Richard Sandiford [Sat, 2 Dec 2023 13:49:54 +0000 (13:49 +0000)]

attribs: Consider namespaces when comparing attributes

decl_attributes and comp_type_attributes both had code that
iterated over one list of attributes and looked for coresponding
attributes in another list. This patch makes those lookups
namespace-aware.

gcc/
* attribs.cc (find_same_attribute): New function.
(decl_attributes, comp_type_attributes): Use it when looking
up one list's attributes in another list.

commit | commitdiff | tree

Richard Sandiford [Sat, 2 Dec 2023 13:49:54 +0000 (13:49 +0000)]

attribs: Cache the gnu namespace

Later patches add more calls to get_attribute_namespace.
For scoped attributes, this is a simple operation on tree pointers.
But for normal GNU attributes (the vast majority), it involves a
call to get_identifier ("gnu"). This patch caches the identifier
for speed.

gcc/
* Makefile.in (GTFILES): Add attribs.cc.
* attribs.cc (gnu_namespace_cache): New variable.
(get_gnu_namespace): New function.
(lookup_attribute_spec): Use it instead of get_identifier ("gnu").
(get_attribute_namespace, attribs_cc_tests): Likewise.

commit | commitdiff | tree

Richard Sandiford [Sat, 2 Dec 2023 13:49:53 +0000 (13:49 +0000)]

Tweak language choice in config-list.mk

When I tried to use config-list.mk, the build for every triple except
the build machine's failed for m2.  This is because, unlike other
languages, m2 builds target objects during all-gcc.  The build will
therefore fail unless you have access to an appropriate binutils
(or an equivalent).  That's quite a big ask for over 100 targets. :)

This patch therefore makes m2 an optional inclusion.

Doing that wasn't entirely straightforward though.  The current
configure line includes "--enable-languages=all,...", which means
that the "..." can only force languages to be added that otherwise
wouldn't have been.  (I.e. the only effect of the "..." is to
override configure autodetection.)

The choice of all,ada and:

  # Make sure you have a recent enough gcc (with ada support) in your path so
  # that --enable-werror-always will work.

make it clear that lack of GNAT should be a build failure rather than
silently ignored.  This predates the D frontend, which requires GDC
in the same way that Ada requires GNAT.  I don't know of a reason
why D should be treated differently.

The patch therefore expands the "all" into a specific list of
languages.

That in turn meant that Fortran had to be handled specially,
since bpf and mmix don't support Fortran.

Perhaps there's an argument that m2 shouldn't build target objects
during all-gcc, but (a) it works for practical usage and (b) the
patch is an easy workaround.  I'd be happy for the patch to be
reverted if the build system changes.

contrib/
* config-list.mk (OPT_IN_LANGUAGES): New variable.
($(LIST)): Replace --enable-languages=all with a specifc list.
Disable fortran on bpf and mmix.  Enable the languages in
OPT_IN_LANGUAGES.

commit | commitdiff | tree

Richard Sandiford [Sat, 2 Dec 2023 13:49:52 +0000 (13:49 +0000)]

Allow target attributes in non-gnu namespaces

Currently there are four static sources of attributes:

- LANG_HOOKS_ATTRIBUTE_TABLE
- LANG_HOOKS_COMMON_ATTRIBUTE_TABLE
- LANG_HOOKS_FORMAT_ATTRIBUTE_TABLE
- TARGET_ATTRIBUTE_TABLE

All of the attributes in these tables go in the "gnu" namespace.
This means that they can use the traditional GNU __attribute__((...))
syntax and the standard [[gnu::...]] syntax.

Standard attributes are registered dynamically with a null namespace.
There are no supported attributes in other namespaces (clang, vendor
namespaces, etc.).

This patch tries to generalise things by making the namespace
part of the attribute specification.

It's usual for multiple attributes to be defined in the same namespace,
so rather than adding the namespace to each individual definition,
it seemed better to group attributes in the same namespace together.
This would also allow us to reuse the same table for clang attributes
that are written with the GNU syntax, or other similar situations
where the attribute can be accessed via multiple "spellings".

The patch therefore adds a scoped_attribute_specs that contains
a namespace and a list of attributes in that namespace.

It's still possible to have multiple scoped_attribute_specs
for the same namespace.  E.g. it makes sense to keep the
C++-specific, C/C++-common, and format-related attributes in
separate tables, even though they're all GNU attributes.

Current lists of attributes are terminated by a null name.
Rather than keep that for the new structure, it seemed neater
to use an array_slice.  This also makes the tables slighly more
compact.

In general, a target might want to support attributes in multiple
namespaces.  Rather than have a separate hook for each possibility
(like the three langhooks above), it seemed better to make
TARGET_ATTRIBUTE_TABLE a table of tables.  Specifically, it's
an array_slice of scoped_attribute_specs.

We can do the same thing for langhooks, which allows the three hooks
above to be merged into a single LANG_HOOKS_ATTRIBUTE_TABLE.
It also allows the standard attributes to be registered statically
and checked by the usual attribs.cc checks.

The patch adds a TARGET_GNU_ATTRIBUTES helper for the common case
in which a target wants a single table of gnu attributes.  It can
only be used if the table is free of preprocessor directives.

There are probably other things we need to do to make vendor namespaces
work smoothly.  E.g. in principle it would be good to make exclusion
sets namespace-aware.  But to some extent we have that with standard
vs. gnu attributes too.  This patch is just supposed to be a first step.

gcc/
* attribs.h (scoped_attribute_specs): New structure.
(register_scoped_attributes): Take a reference to a
scoped_attribute_specs instead of separate namespace and array
parameters.
* plugin.h (register_scoped_attributes): Likewise.
* attribs.cc (register_scoped_attributes): Likewise.
(attribute_tables): Change into an array of scoped_attribute_specs
pointers.  Reduce to 1 element for frontends and 1 element for targets.
(empty_attribute_table): Delete.
(check_attribute_tables): Update for changes to attribute_tables.
Use a hash_set to identify duplicates.
(handle_ignored_attributes_option): Update for above changes.
(init_attributes): Likewise.
(excl_pair): Delete.
(test_attribute_exclusions): Update for above changes.  Don't
enforce symmetry for standard attributes in the top-level namespace.
* langhooks-def.h (LANG_HOOKS_COMMON_ATTRIBUTE_TABLE): Delete.
(LANG_HOOKS_FORMAT_ATTRIBUTE_TABLE): Likewise.
(LANG_HOOKS_INITIALIZER): Update accordingly.
(LANG_HOOKS_ATTRIBUTE_TABLE): Define to an empty constructor.
* langhooks.h (lang_hooks::common_attribute_table): Delete.
(lang_hooks::format_attribute_table): Likewise.
(lang_hooks::attribute_table): Redefine to an array of
scoped_attribute_specs pointers.
* target-def.h (TARGET_GNU_ATTRIBUTES): New macro.
* target.def (attribute_spec): Redefine to return an array of
scoped_attribute_specs pointers.
* tree-inline.cc (function_attribute_inlinable_p): Update accordingly.
* doc/tm.texi: Regenerate.
* config/aarch64/aarch64.cc (aarch64_attribute_table): Define using
TARGET_GNU_ATTRIBUTES.
* config/alpha/alpha.cc (vms_attribute_table): Likewise.
* config/avr/avr.cc (avr_attribute_table): Likewise.
* config/bfin/bfin.cc (bfin_attribute_table): Likewise.
* config/bpf/bpf.cc (bpf_attribute_table): Likewise.
* config/csky/csky.cc (csky_attribute_table): Likewise.
* config/epiphany/epiphany.cc (epiphany_attribute_table): Likewise.
* config/gcn/gcn.cc (gcn_attribute_table): Likewise.
* config/h8300/h8300.cc (h8300_attribute_table): Likewise.
* config/loongarch/loongarch.cc (loongarch_attribute_table): Likewise.
* config/m32c/m32c.cc (m32c_attribute_table): Likewise.
* config/m32r/m32r.cc (m32r_attribute_table): Likewise.
* config/m68k/m68k.cc (m68k_attribute_table): Likewise.
* config/mcore/mcore.cc (mcore_attribute_table): Likewise.
* config/microblaze/microblaze.cc (microblaze_attribute_table):
Likewise.
* config/mips/mips.cc (mips_attribute_table): Likewise.
* config/msp430/msp430.cc (msp430_attribute_table): Likewise.
* config/nds32/nds32.cc (nds32_attribute_table): Likewise.
* config/nvptx/nvptx.cc (nvptx_attribute_table): Likewise.
* config/riscv/riscv.cc (riscv_attribute_table): Likewise.
* config/rl78/rl78.cc (rl78_attribute_table): Likewise.
* config/rx/rx.cc (rx_attribute_table): Likewise.
* config/s390/s390.cc (s390_attribute_table): Likewise.
* config/sh/sh.cc (sh_attribute_table): Likewise.
* config/sparc/sparc.cc (sparc_attribute_table): Likewise.
* config/stormy16/stormy16.cc (xstormy16_attribute_table): Likewise.
* config/v850/v850.cc (v850_attribute_table): Likewise.
* config/visium/visium.cc (visium_attribute_table): Likewise.
* config/arc/arc.cc (arc_attribute_table): Likewise.  Move further
down file.
* config/arm/arm.cc (arm_attribute_table): Update for above changes,
using...
(arm_gnu_attributes, arm_gnu_attribute_table): ...these new globals.
* config/i386/i386-options.h (ix86_attribute_table): Delete.
(ix86_gnu_attribute_table): Declare.
* config/i386/i386-options.cc (ix86_attribute_table): Replace with...
(ix86_gnu_attributes, ix86_gnu_attribute_table): ...these two globals.
* config/i386/i386.cc (ix86_attribute_table): Define as an array of
scoped_attribute_specs pointers.
* config/ia64/ia64.cc (ia64_attribute_table): Update for above changes,
using...
(ia64_gnu_attributes, ia64_gnu_attribute_table): ...these new globals.
* config/rs6000/rs6000.cc (rs6000_attribute_table): Update for above
changes, using...
(rs6000_gnu_attributes, rs6000_gnu_attribute_table): ...these new
globals.

gcc/ada/
* gcc-interface/gigi.h (gnat_internal_attribute_table): Change
type to scoped_attribute_specs.
* gcc-interface/utils.cc (gnat_internal_attribute_table): Likewise,
using...
(gnat_internal_attributes): ...this as the underlying array.
* gcc-interface/misc.cc (gnat_attribute_table): New global.
(LANG_HOOKS_ATTRIBUTE_TABLE): Use it.

gcc/c-family/
* c-common.h (c_common_attribute_table): Replace with...
(c_common_gnu_attribute_table): ...this.
(c_common_format_attribute_table): Change type to
scoped_attribute_specs.
* c-attribs.cc (c_common_attribute_table): Replace with...
(c_common_gnu_attributes, c_common_gnu_attribute_table): ...these
new globals.
(c_common_format_attribute_table): Change type to
scoped_attribute_specs, using...
(c_common_format_attributes): ...this as the underlying array.

gcc/c/
* c-tree.h (std_attribute_table): Declare.
* c-decl.cc (std_attribute_table): Change type to
scoped_attribute_specs, using...
(std_attributes): ...this as the underlying array.
(c_init_decl_processing): Remove call to register_scoped_attributes.
* c-objc-common.h (c_objc_attribute_table): New global.
(LANG_HOOKS_ATTRIBUTE_TABLE): Use it.
(LANG_HOOKS_COMMON_ATTRIBUTE_TABLE): Delete.
(LANG_HOOKS_FORMAT_ATTRIBUTE_TABLE): Delete.

gcc/cp/
* cp-tree.h (cxx_attribute_table): Delete.
(cxx_gnu_attribute_table, std_attribute_table): Declare.
* cp-objcp-common.h (LANG_HOOKS_COMMON_ATTRIBUTE_TABLE): Delete.
(LANG_HOOKS_FORMAT_ATTRIBUTE_TABLE): Delete.
(cp_objcp_attribute_table): New table.
(LANG_HOOKS_ATTRIBUTE_TABLE): Redefine.
* tree.cc (cxx_attribute_table): Replace with...
(cxx_gnu_attributes, cxx_gnu_attribute_table): ...these globals.
(std_attribute_table): Change type to scoped_attribute_specs, using...
(std_attributes): ...this as the underlying array.
(init_tree): Remove call to register_scoped_attributes.

gcc/d/
* d-tree.h (d_langhook_attribute_table): Replace with...
(d_langhook_gnu_attribute_table): ...this.
(d_langhook_common_attribute_table): Change type to
scoped_attribute_specs.
* d-attribs.cc (d_langhook_common_attribute_table): Change type to
scoped_attribute_specs, using...
(d_langhook_common_attributes): ...this as the underlying array.
(d_langhook_attribute_table): Replace with...
(d_langhook_gnu_attributes, d_langhook_gnu_attribute_table): ...these
new globals.
(uda_attribute_p): Update accordingly, and update for new
targetm.attribute_table type.
* d-lang.cc (d_langhook_attribute_table): New global.
(LANG_HOOKS_COMMON_ATTRIBUTE_TABLE): Delete.

gcc/fortran/
* f95-lang.cc: Include attribs.h.
(gfc_attribute_table): Change to an array of scoped_attribute_specs
pointers, using...
(gfc_gnu_attributes, gfc_gnu_attribute_table): ...these new globals.

gcc/jit/
* dummy-frontend.cc (jit_format_attribute_table): Change type to
scoped_attribute_specs, using...
(jit_format_attributes): ...this as the underlying array.
(jit_attribute_table): Change to an array of scoped_attribute_specs
pointers, using...
(jit_gnu_attributes, jit_gnu_attribute_table): ...these new globals
for the original array.  Include the format attributes.
(LANG_HOOKS_COMMON_ATTRIBUTE_TABLE): Delete.
(LANG_HOOKS_FORMAT_ATTRIBUTE_TABLE): Delete.
(LANG_HOOKS_ATTRIBUTE_TABLE): Define.

gcc/lto/
* lto-lang.cc (lto_format_attribute_table): Change type to
scoped_attribute_specs, using...
(lto_format_attributes): ...this as the underlying array.
(lto_attribute_table): Change to an array of scoped_attribute_specs
pointers, using...
(lto_gnu_attributes, lto_gnu_attribute_table): ...these new globals
for the original array.  Include the format attributes.
(LANG_HOOKS_COMMON_ATTRIBUTE_TABLE): Delete.
(LANG_HOOKS_FORMAT_ATTRIBUTE_TABLE): Delete.
(LANG_HOOKS_ATTRIBUTE_TABLE): Define.

commit | commitdiff | tree

Roger Sayle [Sat, 2 Dec 2023 11:15:14 +0000 (11:15 +0000)]

RISC-V: Improve style to work around PR 60994 in host compiler.

This simple patch allows me to build a cross-compiler to riscv using
older versions of RedHat's system compiler.  The issue is PR c++/60994
where g++ doesn't like the same name (demand_flags) to be used by both
a variable and a (enumeration) type, which is also undesirable from a
(GNU) coding style perspective.  One solution is to rename the type
to demand_flags_t, but a less invasive change is to simply use another
identifier for the problematic local variable, renaming demand_flags
to dflags.

2023-12-02  Roger Sayle  <roger@nextmovesoftware.com>

gcc/ChangeLog
* config/riscv/riscv-vsetvl.cc (csetvl_info::parse_insn): Rename
local variable from demand_flags to dflags, to avoid conflicting
with (enumeration) type of the same name.

commit | commitdiff | tree

Francois-Xavier Coudert [Sat, 2 Dec 2023 09:02:10 +0000 (10:02 +0100)]

Testsuite, Darwin: skip -mcmodel=large test

-mcmodel=large is not supported (yet) on Darwin [PR90698]

gcc/testsuite/ChangeLog:

* gcc.target/i386/libcall-1.c: Skip on darwin.

commit | commitdiff | tree

Li Wei [Tue, 28 Nov 2023 07:39:00 +0000 (15:39 +0800)]

LoongArch: Optimize vector constant extract-{even/odd} permutation.

For vector constant extract-{even/odd} permutation replace the default
[x]vshuf instruction combination with [x]vilv{l/h} instruction, which
can reduce instructions and improves performance.

gcc/ChangeLog:

* config/loongarch/loongarch.cc (loongarch_is_odd_extraction):
Supplementary function prototype.
(loongarch_is_even_extraction): Adjust.
(loongarch_try_expand_lsx_vshuf_const): Adjust.
(loongarch_is_extraction_permutation): Adjust.
(loongarch_expand_vec_perm_const_2): Adjust.

gcc/testsuite/ChangeLog:

* gcc.target/loongarch/lasx-extract-even_odd-opt.c: New test.

commit | commitdiff | tree

Li Wei [Tue, 28 Nov 2023 07:38:37 +0000 (15:38 +0800)]

LoongArch: Accelerate optimization of scalar signed/unsigned popcount.

In LoongArch, the vector popcount has corresponding instructions, while
the scalar does not. Currently, the scalar popcount is calculated
through a loop, and the value of a non-power of two needs to be iterated
several times, so the vector popcount instruction is considered for
optimization.

gcc/ChangeLog:

* config/loongarch/loongarch.md (v2di): Used to simplify the
following templates.
(popcount<mode>2): New.

gcc/testsuite/ChangeLog:

* gcc.target/loongarch/popcnt.c: New test.
* gcc.target/loongarch/popcount.c: New test.

commit | commitdiff | tree

chenxiaolong [Tue, 28 Nov 2023 08:23:53 +0000 (16:23 +0800)]

LoongArch: Added vectorized hardware inspection for testsuite.

When GCC regression tests are executed on a cpu that does not support
vectorization, the loongarch/vector directory will have some FAIL entries for
all test cases related to vectorization runs. In order to solve this kind
of problem, a vectorized hardware detection function was added to the code,
which can only be compiled but not run.

gcc/testsuite/ChangeLog:

* gcc.target/loongarch/vector/lasx/lasx-xvabsd-1.c:Remove
the default Settings to run the behavior.
* gcc.target/loongarch/vector/lasx/lasx-xvabsd-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvadd.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvadda.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvaddi.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvaddwev-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvaddwev-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvaddwev-3.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvaddwod-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvaddwod-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvaddwod-3.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvand.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvandi.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvandn.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvavg-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvavg-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvavgr-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvavgr-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvbitclr.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvbitclri.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvbitrev.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvbitrevi.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvbitsel.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvbitseli.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvbitset.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvbitseti.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvbsll_v.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvbsrl_v.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvclo.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvclz.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvdiv-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvdiv-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvext2xv-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvext2xv-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvexth-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvexth-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvextl-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvextl-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvextrins.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfadd_d.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfadd_s.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfclass_d.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfclass_s.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfcmp_caf_s.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfcmp_ceq_s.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfcmp_cle_s.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfcmp_clt_s.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfcmp_cne_s.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfcmp_cor_s.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfcmp_cun_s.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfcmp_saf_s.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfcmp_seq_s.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfcmp_sle_s.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfcmp_slt_s.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfcmp_sne_s.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfcmp_sor_s.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfcmp_sun_s.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfcvt.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfcvth.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvffint-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvffint-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvffinth.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvflogb_d.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvflogb_s.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfmadd_d.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfmadd_s.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfmax_d.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfmax_s.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfmaxa_d.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfmaxa_s.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfnmadd_d.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfnmadd_s.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfrint_d.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfrint_s.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfrstp.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfrstpi.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfsqrt_d.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvfsqrt_s.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvftint-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvftint-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvftint-3.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvftintl.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvhaddw-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvhaddw-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvhsubw-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvhsubw-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvilvh.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvilvl.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvinsgr2vr.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvinsve0.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvld.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvldi.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmadd.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmaddwev-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmaddwev-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmaddwev-3.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmaddwod-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmaddwod-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmaddwod-3.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmax-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmax-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmaxi-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmaxi-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmin-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmin-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmini-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmini-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmod-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmod-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmskgez.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmskltz.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmsknz.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmsub.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmuh-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmuh-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmul.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmulwev-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmulwev-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmulwev-3.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmulwod-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmulwod-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvmulwod-3.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvneg.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvnor.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvnori.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvor.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvori.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvorn.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvpackev.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvpackod.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvpcnt.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvpickev.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvpickod.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvpickve.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvpickve2gr.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvprem.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvpremi.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvreplgr2vr.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvreplve.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvreplve0.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvreplvei.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvrotr.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvrotri.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsadd-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsadd-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsat-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsat-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvseq.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvseqi.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvshuf4i_b.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvshuf_b.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsigncov.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsle-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsle-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvslei-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvslei-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsll.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvslli.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsllwil-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsllwil-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvslt-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvslt-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvslti-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvslti-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsra.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsrai.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsran.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsrani.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsrar.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsrari.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsrarn.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsrarni.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsrl.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsrli.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsrln.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsrlni.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsrlr.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsrlri.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsrlrn.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsrlrni.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvssran.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvssrani.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvssrarn.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvssrarni.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvssrln.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvssrlni.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvssrlrn.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvssrlrni.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvssub-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvssub-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvst.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsub.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsubi.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsubwev-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsubwev-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsubwod-1.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvsubwod-2.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvxor.c:Dito.
* gcc.target/loongarch/vector/lasx/lasx-xvxori.c:Dito.
* gcc.target/loongarch/vector/loongarch-vector.exp:Added hardware
detection to set the behavior of program execution based on the
characteristics of the hardware.
* gcc.target/loongarch/vector/lsx/lsx-vabsd-1.c:Remove the default
Settings to run the behavior.
* gcc.target/loongarch/vector/lsx/lsx-vabsd-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vadd.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vadda.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vaddi.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vaddwev-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vaddwev-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vaddwev-3.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vaddwod-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vaddwod-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vaddwod-3.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vand.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vandi.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vandn.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vavg-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vavg-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vavgr-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vavgr-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vbitclr.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vbitclri.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vbitrev.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vbitrevi.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vbitsel.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vbitseli.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vbitset.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vbitseti.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vbsll.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vbsrl.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vclo.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vclz.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vdiv-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vdiv-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vexth-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vexth-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vextl-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vextl-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vextrins.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfadd_d.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfadd_s.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfclass_d.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfclass_s.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfcmp_caf.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfcmp_ceq.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfcmp_cle.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfcmp_clt.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfcmp_cne.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfcmp_cor.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfcmp_cun.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfcmp_saf.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfcmp_seq.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfcmp_sle.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfcmp_slt.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfcmp_sne.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfcmp_sor.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfcmp_sun.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfcvt-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfcvt-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vffint-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vffint-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vffint-3.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vflogb_d.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vflogb_s.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfmadd_d.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfmadd_s.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfmax_d.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfmax_s.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfmaxa_d.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfmaxa_s.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfnmadd_d.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfnmadd_s.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfrint_d.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfrint_s.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfrstp.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfrstpi.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfsqrt_d.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vfsqrt_s.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vftint-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vftint-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vftint-3.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vftint-4.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vhaddw-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vhaddw-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vhsubw-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vhsubw-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vilvh.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vilvl.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vinsgr2vr.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vld.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vldi.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmadd.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmaddwev-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmaddwev-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmaddwev-3.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmaddwod-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmaddwod-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmaddwod-3.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmax-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmax-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmaxi-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmaxi-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmin-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmin-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmini-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmini-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmod-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmod-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmskgez.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmskltz.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmsknz.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmsub.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmuh-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmuh-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmul.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmulwev-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmulwev-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmulwev-3.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmulwod-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmulwod-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vmulwod-3.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vneg.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vnor.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vnori.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vor.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vori.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vorn.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vpackev.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vpackod.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vpcnt.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vpickev.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vpickod.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vpickve2gr.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vpremi.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vreplgr2vr.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vreplve.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vreplvei.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vrotr.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vrotri.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsadd-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsadd-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsat-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsat-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vseq.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vseqi.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vshuf.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vshuf4i.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsigncov.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsle-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsle-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vslei-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vslei-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsll.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vslli.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsllwil-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsllwil-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vslt-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vslt-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vslti-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vslti-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsra.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsrai.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsran.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsrani.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsrar.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsrari.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsrarn.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsrarni.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsrl.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsrli.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsrln.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsrlni.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsrlr.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsrlri.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsrlrn.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsrlrni.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vssran.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vssrani.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vssrarn.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vssrarni.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vssrln.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vssrlni.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vssrlrn.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vssrlrni.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vssub-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vssub-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vst.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsub.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsubi.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsubwev-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsubwev-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsubwod-1.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vsubwod-2.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vxor.c:Dito.
* gcc.target/loongarch/vector/lsx/lsx-vxori.c:Dito.

commit | commitdiff | tree

Li Wei [Tue, 28 Nov 2023 07:56:35 +0000 (15:56 +0800)]

LoongArch: Remove duplicate definition of CLZ_DEFINED_VALUE_AT_ZERO.

In the r14-5547 commit, C[LT]Z_DEFINED_VALUE_AT_ZERO were defined at
the same time, but in fact, CLZ_DEFINED_VALUE_AT_ZERO has already been
defined, so remove the duplicate definition.

gcc/ChangeLog:

* config/loongarch/loongarch.h (CTZ_DEFINED_VALUE_AT_ZERO): Add
description.
(CLZ_DEFINED_VALUE_AT_ZERO): Remove duplicate definition.

commit | commitdiff | tree

Juzhe-Zhong [Fri, 1 Dec 2023 12:31:50 +0000 (20:31 +0800)]

RISC-V: Fix incorrect combine of extended scalar pattern

Background:
RVV ISA vx instructions for example vadd.vx,
When EEW = 64 and RV32. We can't directly use vadd.vx.
Instead, we need to use:

sw
sw
vlse
vadd.vv

However, we have some special situation that we still can directly use
vadd.vx directly for EEW=64 && RV32.

that is, when scalar is a known CONST_INT value that doesn't overflow 32-bit value.
So, we have a dedicated pattern for such situation:

...
(sign_extend:<VEL> (match_operand:<VSUBEL> 3 "register_operand"          " r,  r,  r,  r")).
...

We first force_reg such CONST_INT (within 32bit value) into a SImode reg.
Then use such special patterns.
Those pattern with this operand match should only value on! TARGET_64BIT.

The PR112801 combine into such patterns on RV64 incorrectly (Those patterns should be only value on RV32).

This is the bug:

        andi    a2,a2,2
        vsetivli        zero,2,e64,m1,ta,ma
        sext.w  a3,a4
        vmv.v.x v1,a2
        vslide1down.vx  v1,v1,a4    -> it should be a3 instead of a4.

Such incorrect codegen is caused by
...
                (sign_extend:DI (subreg:SI (reg:DI 135 [ f.0_3 ]) 0))
            ] UNSPEC_VSLIDE1DOWN)) 16935 {*pred_slide1downv2di_extended}
...

Incorretly combine into the patterns should not be valid on RV64 system.

So add !TARGET_64BIT to all same type patterns which can fix such issue as well as robostify the vector.md.

PR target/112801

gcc/ChangeLog:

* config/riscv/vector.md: Add !TARGET_64BIT.

gcc/testsuite/ChangeLog:

* gcc.target/riscv/rvv/autovec/pr112801.c: New test.

commit | commitdiff | tree

Pan Li [Thu, 30 Nov 2023 07:08:50 +0000 (15:08 +0800)]

RISC-V: Bugfix for legitimize move when get vec mode in zve32f

If we want to extract 64bit value but ELEN < 64, we use RVV
vector mode with EEW = 32 to extract the highpart and lowpart.
However, this approach doesn't honor DFmode when movdf pattern
when ZVE32f and of course results in ICE when zve32f.

This patch would like to reuse the approach with some additional
handing, consider lowpart bits is meaningless for FP mode, we need
one int reg as bridge here. For example:

rtx tmp = gen_rtx_reg (DImode)
reg:DI = reg:DF (fmv.d.x) // Move DF reg to DI
...
perform the extract for high and low parts
...
reg:DF = reg:DI (fmv.x.d) // Move DI reg back to DF after all done

PR target/112743

gcc/ChangeLog:

* config/riscv/riscv.cc (riscv_legitimize_move): Take the
exist (U *mode) and handle DFmode like DImode when EEW is
32bits for ZVE32F.

gcc/testsuite/ChangeLog:

* gcc.target/riscv/rvv/base/pr112743-2.c: New test.

Signed-off-by: Pan Li <pan2.li@intel.com>

commit | commitdiff | tree

GCC Administrator [Sat, 2 Dec 2023 00:16:54 +0000 (00:16 +0000)]

Daily bump.

commit | commitdiff | tree

Harald Anlauf [Thu, 30 Nov 2023 20:53:21 +0000 (21:53 +0100)]

Fortran: copy-out for possibly missing OPTIONAL CLASS arguments [PR112772]

gcc/fortran/ChangeLog:

PR fortran/112772
* trans-expr.cc (gfc_conv_class_to_class): Make copy-out conditional
on the presence of an OPTIONAL CLASS argument passed to an OPTIONAL
CLASS dummy.

gcc/testsuite/ChangeLog:

PR fortran/112772
* gfortran.dg/missing_optional_dummy_7.f90: New test.

commit | commitdiff | tree

Jason Merrill [Sat, 18 Nov 2023 02:57:52 +0000 (21:57 -0500)]

c++: mangling for CTAD placeholder

Per https://github.com/itanium-cxx-abi/cxx-abi/issues/109 mangle a C++17
CTAD placeholder as its template.

gcc/cp/ChangeLog:

* mangle.cc (write_type): Mangle placeholder as its template.

gcc/testsuite/ChangeLog:

* g++.dg/cpp2a/nontype-class4.C: Specify ABI v18.
* g++.dg/cpp2a/nontype-class4a.C: New test.

commit | commitdiff | tree

Jason Merrill [Mon, 25 Sep 2023 09:15:02 +0000 (10:15 +0100)]

c++: mangle function template constraints

Per https://github.com/itanium-cxx-abi/cxx-abi/issues/24 and
https://github.com/itanium-cxx-abi/cxx-abi/pull/166

We need to mangle constraints to be able to distinguish between function
templates that only differ in constraints.  From the latter link, we want to
use the template parameter mangling previously specified for lambdas to also
make explicit the form of a template parameter where the argument is not a
"natural" fit for it, such as when the parameter is constrained or deduced.

I'm concerned about how the latter link changes the mangling for some C++98
and C++11 patterns, so I've limited template_parm_natural_p to avoid two
cases found by running the testsuite with -Wabi forced on:

template <class T, T V> T f() { return V; }
int main() { return f<int,42>(); }

template <int i> int max() { return i; }
template <int i, int j, int... rest> int max()
{
  int sub = max<j, rest...>();
  return i > sub ? i : sub;
}
int main() {  return max<1,2,3>(); }

A third C++11 pattern is changed by this patch:

template <template <typename...> class TT, typename... Ts> TT<Ts...> f();
template <typename> struct A { };
int main() { f<A,int>(); }

I aim to resolve these with the ABI committee before GCC 14.1.

We also need to resolve https://github.com/itanium-cxx-abi/cxx-abi/issues/38
(mangling references to dependent template-ids where the name is fully
resolved) as references to concepts in std:: will consistently run into this
area.  This is why mangle-concepts1.C only refers to concepts in the global
namespace so far.

The library changes are to avoid trying to mangle builtins, which fails.

Demangler support and test coverage is not complete yet.

gcc/cp/ChangeLog:

* cp-tree.h (TEMPLATE_ARGS_TYPE_CONSTRAINT_P): New.
(get_concept_check_template): Declare.
* constraint.cc (combine_constraint_expressions)
(finish_shorthand_constraint): Use UNKNOWN_LOCATION.
* pt.cc (convert_generic_types_to_packs): Likewise.
* mangle.cc (write_constraint_expression)
(write_tparms_constraints, write_type_constraint)
(template_parm_natural_p, write_requirement)
(write_requires_expr): New.
(write_encoding): Mangle trailing requires-clause.
(write_name): Pass parms to write_template_args.
(write_template_param_decl): Factor out from...
(write_closure_template_head): ...here.
(write_template_args): Mangle non-natural parms
and requires-clause.
(write_expression): Handle REQUIRES_EXPR.

include/ChangeLog:

* demangle.h (enum demangle_component_type): Add
DEMANGLE_COMPONENT_CONSTRAINTS.

libiberty/ChangeLog:

* cp-demangle.c (d_make_comp): Handle
DEMANGLE_COMPONENT_CONSTRAINTS.
(d_count_templates_scopes): Likewise.
(d_print_comp_inner): Likewise.
(d_maybe_constraints): New.
(d_encoding, d_template_args_1): Call it.
(d_parmlist): Handle 'Q'.
* testsuite/demangle-expected: Add some constraint tests.

libstdc++-v3/ChangeLog:

* include/std/bit: Avoid builtins in requires-clauses.
* include/std/variant: Likewise.

gcc/testsuite/ChangeLog:

* g++.dg/abi/mangle10.C: Disable compat aliases.
* g++.dg/abi/mangle52.C: Specify ABI 18.
* g++.dg/cpp2a/class-deduction-alias3.C
* g++.dg/cpp2a/class-deduction-alias8.C:
Avoid builtins in requires-clauses.
* g++.dg/abi/mangle-concepts1.C: New test.
* g++.dg/abi/mangle-ttp1.C: New test.

commit | commitdiff | tree

Andrew MacLeod [Fri, 1 Dec 2023 16:15:33 +0000 (11:15 -0500)]

Use range_compatible_p in check_operands_p.

Instead of directly checking type precision, check_operands_p should
invoke range_compatible_p to keep the range checking centralized.

* gimple-range-fold.h (range_compatible_p): Relocate.
* value-range.h (range_compatible_p): Here.
* range-op-mixed.h (operand_equal::operand_check_p): Call
range_compatible_p rather than comparing precision.
(operand_not_equal::operand_check_p): Ditto.
(operand_not_lt::operand_check_p): Ditto.
(operand_not_le::operand_check_p): Ditto.
(operand_not_gt::operand_check_p): Ditto.
(operand_not_ge::operand_check_p): Ditto.
(operand_plus::operand_check_p): Ditto.
(operand_abs::operand_check_p): Ditto.
(operand_minus::operand_check_p): Ditto.
(operand_negate::operand_check_p): Ditto.
(operand_mult::operand_check_p): Ditto.
(operand_bitwise_not::operand_check_p): Ditto.
(operand_bitwise_xor::operand_check_p): Ditto.
(operand_bitwise_and::operand_check_p): Ditto.
(operand_bitwise_or::operand_check_p): Ditto.
(operand_min::operand_check_p): Ditto.
(operand_max::operand_check_p): Ditto.
* range-op.cc (operand_lshift::operand_check_p): Ditto.
(operand_rshift::operand_check_p): Ditto.
(operand_logical_and::operand_check_p): Ditto.
(operand_logical_or::operand_check_p): Ditto.
(operand_logical_not::operand_check_p): Ditto.

commit | commitdiff | tree

Alexandre Oliva [Fri, 1 Dec 2023 17:31:22 +0000 (14:31 -0300)]

hardcfr: make builtin_return tests more portable [PR112334]

Rework __builtin_return tests to explicitly call __builtin_apply and
use its return value rather than anything else.  Also require
untyped_assembly.  Avoid the noise out of exceptions escaping the
builtin-applied function, but add a test to cover their effects as
well.

for  gcc/testsuite/ChangeLog

PR target/112334
* c-c++-common/torture/harden-cfr-bret.c: Rework for stricter
untyped_return requirements.  Require untyped_assembly.
* c-c++-common/torture/harden-cfr-bret-except.c: New.
* c-c++-common/torture/harden-cfr-bret-always.c: Require
untyped_assembly.
* c-c++-common/torture/harden-cfr-bret-never.c: Likewise.
* c-c++-common/torture/harden-cfr-bret-noopt.c: Likewise.
* c-c++-common/torture/harden-cfr-bret-noret.c: Likewise.
* c-c++-common/torture/harden-cfr-bret-no-xthrow.c: Likewise.
* c-c++-common/torture/harden-cfr-bret-nothrow.c: Likewise.
* c-c++-common/torture/harden-cfr-bret-retcl.c: Likewise.

commit | commitdiff | tree

Alexandre Oliva [Fri, 1 Dec 2023 17:31:12 +0000 (14:31 -0300)]

hardcfr: libgcc sym versioning

The libgcc-exported runtime component of control flow redundancy
hardening was missing symbol versioning information. Add it.

for libgcc/ChangeLog

* libgcc-std.ver.in (__hardcfr_check): Add to GCC_14.0.0.

commit | commitdiff | tree

Vladimir N. Makarov [Fri, 1 Dec 2023 16:46:37 +0000 (11:46 -0500)]

[PR112445][LRA]: Fix "unable to find a register to spill" error

PR112445 is a very complicated bug occurring from interaction of
constraint subpass, inheritance, and hard reg live range splitting.
It is hard to debug this PR only from LRA standard logs.  Therefore I
added dumping all func insns at the end of complicated sub-passes
(constraint, inheritance, undoing inheritance, hard reg live range
splitting, and rematerialization).  As such output can be quite big,
it is switched only one level 7 of -fira-verbose value.  The reason
for the bug is a skip of live-range splitting of hard reg (dx) on the
1st live range splitting subpass.  Splitting is done for reload
pseudos around an original insn and its reload insns but the subpass
did not recognize such insn pattern because previous inheritance and
undoing inheritance subpasses extended a bit reload pseudo live range.
Although we undid inheritance in question, the result code was a bit
different from a code before the corresponding inheritance pass.  The
following fixes the bug by restoring exact code before the
inheritance.

gcc/ChangeLog:

PR target/112445
* lra.h (lra): Add one more arg.
* lra-int.h (lra_verbose, lra_dump_insns): New externals.
(lra_dump_insns_if_possible): Ditto.
* lra.cc (lra_dump_insns): Dump all insns.
(lra_dump_insns_if_possible):  Dump all insns for lra_verbose >= 7.
(lra_verbose): New global.
(lra): Add new arg.  Setup lra_verbose from its value.
* lra-assigns.cc (lra_split_hard_reg_for): Dump insns if rtl
was changed.
* lra-remat.cc (lra_remat): Dump insns if rtl was changed.
* lra-constraints.cc (lra_inheritance): Dump insns.
(lra_constraints, lra_undo_inheritance): Dump insns if rtl
was changed.
(remove_inheritance_pseudos): Use restore reg if it is set up.
* ira.cc: (lra): Pass internal_flag_ira_verbose.

gcc/testsuite/ChangeLog:

PR target/112445
* gcc.target/i386/pr112445.c: New test.

commit | commitdiff | tree

Jakub Jelinek [Fri, 1 Dec 2023 15:57:30 +0000 (16:57 +0100)]

extend.texi: Fix up defbuiltin* with spaces in return type

In https://gcc.gnu.org/onlinedocs/gcc/Other-Builtins.html#index-_005f_005fbuiltin_005fstdc_005fbit_005ffloor
I've noticed that while e.g. __builtin_stdc_bit_floor builtin is properly
rendered in bold and bigger size, for the __builtin_stdc_bit_width builtin
it is not the builtin name which is marked like that, but the keyword int
before it. Also, seems such builtins are missing from the index.

I've read the texinfo docs and they seem to suggest in
https://www.gnu.org/software/texinfo/manual/texinfo/html_node/Line-Macros.html
that return types of functions with spaces in the return type should be
wrapped with {}s and we already use that e.g. in
@defbuiltin{{void *} __builtin_thread_pointer (void)}

The following patch adjusts builtins I found which contained one or two
spaces in the return type name (plus two spots which used 2 spaces after
single keyword return type instead of 1 which triggered my search regex as
well).

2023-12-01 Jakub Jelinek <jakub@redhat.com>

* doc/extend.texi (__builtin_addc, __builtin_addcl, __builtin_addcll,
__builtin_subc, __builtin_subcl, __builtin_subcll,
__builtin_stdc_bit_width, __builtin_stdc_count_ones,
__builtin_stdc_count_zeros, __builtin_stdc_first_leading_one,
__builtin_stdc_first_leading_zero, __builtin_stdc_first_trailing_one,
__builtin_stdc_first_trailing_zero, __builtin_stdc_has_single_bit,
__builtin_stdc_leading_ones, __builtin_stdc_leading_zeros,
__builtin_stdc_trailing_ones, __builtin_stdc_trailing_zeros,
__builtin_nvptx_brev, __builtin_nvptx_brevll, __builtin_darn,
__builtin_darn_raw, __builtin_ia32_vec_ext_v2di,
__builtin_ia32_crc32qi, __builtin_ia32_crc32hi,
__builtin_ia32_crc32si, __builtin_ia32_crc32di): Put {}s around
return type with spaces in it.
(__builtin_rx_mvfachi, __builtin_rx_mvfacmi): Remove superfluous
whitespace.

commit | commitdiff | tree

Rainer Orth [Fri, 1 Dec 2023 15:47:28 +0000 (16:47 +0100)]

ada: Fix Ada bootstrap on macOS

The recent warning changes broke Ada bootstrap on macOS:

adaint.c: In function '__gnat_copy_attribs':
adaint.c:3336:10: error: implicit declaration of function 'utimes'; did you
mean 'utime'? [-Wimplicit-function-declaration]
3336 |      if (utimes (to, tbuf) == -1) {
      |          ^~~~~~
      |          utime
adaint.c: In function '__gnat_kill':
adaint.c:3597:3: error: implicit declaration of function 'kill'
[-Wimplicit-function-declaration]
3597 |   kill (pid, sig);
      |   ^~~~
terminals.c: In function 'allocate_pty_desc':
terminals.c:1196:12: error: implicit declaration of function 'openpty'; did
you mean 'openat'? [-Wimplicit-function-declaration]
1196 |   status = openpty (&master_fd, &slave_fd, NULL, NULL, NULL);
      |            ^~~~~~~
      |            openat
terminals.c: In function '__gnat_setup_winsize':
terminals.c:1392:6: error: implicit declaration of function 'kill'
[-Wimplicit-function-declaration]
1392 |      kill (desc->child_pid, SIGWINCH);
      |      ^~~~

This patch fixes this by including the necessary headers: <sys/time.h>
for utimes, <signal.h> for kill, and <util.h> for openpty.  With those
changes, the build completed on x86_64-apple-darwin2[0-3] (make check
still running).

2023-12-01  Rainer Orth  <ro@CeBiTec.Uni-Bielefeld.DE>

gcc/ada:
* adaint.c [__APPLE__]: Include <signal.h>, <sys/time.h>.
* terminals.c [!_WIN32]: Include <signal.h>.
[__APPLE__]: Include <util.h>.
Fix typos.

commit | commitdiff | tree

David Malcolm [Fri, 1 Dec 2023 13:47:41 +0000 (08:47 -0500)]

diagnostics, analyzer: add optional per-diagnostic property bags to SARIF

I've found it useful in debugging the analyzer for the SARIF output to
contain extra analyzer-specific data in each diagnostic.

This patch:
* adds a way for a diagnostic_metadata to populate a property
bag within a SARIF "result" object based on a new vfunc
* reworks how diagnostics are emitted within the analyzer so
that a custom diagnostic_metadata subclass is used, which populates
the property bag with information from the saved_diagnostic, and with
a vfunc hook allowing for per-pending_diagnotic-subclass extra
properties.

Doing so makes it trivial to go from the SARIF output back to
pertinent parts of the analyzer's internals (e.g. the index of
the diagnostic within the ana::diagnostic_manager, the index of
the ana::exploded_node, etc).

It also replaces a lot of boilerplate in the "emit" implementations
in the various pending_diagnostics subclasses. In particular, doing
so fixes missing CVE metadata for -Wanalyzer-fd-phase-mismatch (where
sm-fd.cc's fd_phase_mismatch::emit was failing to use its
diagnostic_metadata instance).

gcc/analyzer/ChangeLog:
* analyzer.h (class saved_diagnostic): New forward decl.
* bounds-checking.cc: Update for changes to
pending_diagnostic::emit.
* call-details.cc: Likewise.
* diagnostic-manager.cc: Include "diagnostic-format-sarif.h".
(saved_diagnostic::maybe_add_sarif_properties): New.
(class pending_diagnostic_metadata): New.
(diagnostic_manager::emit_saved_diagnostic): Create a
pending_diagnostic_metadata and a diagnostic_emission_context.
Pass the latter to the pending_diagnostic::emit vfunc.
* diagnostic-manager.h
(saved_diagnostic::maybe_add_sarif_properties): New decl.
* engine.cc: Update for changes to pending_diagnostic::emit.
* infinite-loop.cc: Likewise.
* infinite-recursion.cc: Likewise.
* kf-analyzer.cc: Likewise.
* kf.cc: Likewise.
* pending-diagnostic.cc
(diagnostic_emission_context::get_pending_diagnostic): New.
(diagnostic_emission_context::warn): New.
(diagnostic_emission_context::inform): New.
* pending-diagnostic.h (class diagnostic_emission_context): New.
(pending_diagnostic::emit): Update params.
(pending_diagnostic::maybe_add_sarif_properties): New vfunc.
* region.cc: Don't include "diagnostic-metadata.h".
* region-model.cc: Include "diagnostic-format-sarif.h". Update
for changes to pending_diagnostic::emit.
(exposure_through_uninit_copy::maybe_add_sarif_properties): New.
* sm-fd.cc: Update for changes to pending_diagnostic::emit.
* sm-file.cc: Likewise.
* sm-malloc.cc: Likewise.
* sm-pattern-test.cc: Likewise.
* sm-sensitive.cc: Likewise.
* sm-signal.cc: Likewise.
* sm-taint.cc: Likewise.
* store.cc: Don't include "diagnostic-metadata.h".
* varargs.cc: Update for changes to pending_diagnostic::emit.

gcc/ChangeLog:
* diagnostic-core.h (emit_diagnostic_valist): New overload decl.
* diagnostic-format-sarif.cc (sarif_builder::make_result_object):
When we have metadata, call its maybe_add_sarif_properties vfunc.
* diagnostic-metadata.h (class sarif_object): Forward decl.
(diagnostic_metadata::~diagnostic_metadata): New.
(diagnostic_metadata::maybe_add_sarif_properties): New vfunc.
* diagnostic.cc (emit_diagnostic_valist): New overload.

gcc/testsuite/ChangeLog:
* gcc.dg/analyzer/fd-accept.c: Update for fix to missing CWE
metadata for -Wanalyzer-fd-phase-mismatch.
* gcc.dg/analyzer/fd-bind.c: Likewise.
* gcc.dg/analyzer/fd-socket-misuse.c: Likewise.
* gcc.dg/plugin/analyzer_cpython_plugin.c: Update for changes to
pending_diagnostic::emit.
* gcc.dg/plugin/analyzer_gil_plugin.c: Likewise.

Signed-off-by: David Malcolm <dmalcolm@redhat.com>

commit | commitdiff | tree

David Malcolm [Fri, 1 Dec 2023 13:47:41 +0000 (08:47 -0500)]

docs: remove stray reference to -fanalyzer-checker=taint [PR103533]

I missed this one in r14-5464-gcfaaa8b11b8429.

gcc/ChangeLog:
PR analyzer/103533
* doc/extend.texi: Remove stray reference to
-fanalyzer-checker=taint.

Signed-off-by: David Malcolm <dmalcolm@redhat.com>

commit | commitdiff | tree

Juzhe-Zhong [Fri, 1 Dec 2023 07:00:27 +0000 (15:00 +0800)]

RISC-V: Support highpart register overlap for widen vx/vf instructions

This patch leverages the same approach as vwcvt.

Before this patch:

.L5:
        add     a3,s0,s1
        add     a4,s6,s1
        add     a5,s7,s1
        vsetvli zero,s0,e32,m4,ta,ma
        vle32.v v16,0(s1)
        vle32.v v12,0(a3)
        mv      s1,s2
        vle32.v v8,0(a4)
        vle32.v v4,0(a5)
        nop
        vfwadd.vf       v24,v16,fs0
        vfwadd.vf       v16,v12,fs0
        vs8r.v  v16,0(sp)                -----> spill
        vfwadd.vf       v16,v8,fs0
        vfwadd.vf       v8,v4,fs0
        nop
        vsetvli zero,zero,e64,m8,ta,ma
        vfmv.f.s        fa4,v24
        vl8re64.v       v24,0(sp)       -----> reload
        vfmv.f.s        fa5,v24
        fcvt.lu.d a0,fa4,rtz
        fcvt.lu.d a1,fa5,rtz
        vfmv.f.s        fa4,v16
        vfmv.f.s        fa5,v8
        fcvt.lu.d a2,fa4,rtz
        fcvt.lu.d a3,fa5,rtz
        add     s2,s2,s5
        call    sumation
        add     s3,s3,a0
        bgeu    s4,s2,.L5

After this patch:

.L5:
add a3,s0,s1
add a4,s6,s1
add a5,s7,s1
vsetvli zero,s0,e32,m4,ta,ma
vle32.v v4,0(s1)
vle32.v v28,0(a3)
mv s1,s2
vle32.v v20,0(a4)
vle32.v v12,0(a5)
vfwadd.vf v0,v4,fs0
vfwadd.vf v24,v28,fs0
vfwadd.vf v16,v20,fs0
vfwadd.vf v8,v12,fs0
vsetvli zero,zero,e64,m8,ta,ma
vfmv.f.s fa4,v0
vfmv.f.s fa5,v24
fcvt.lu.d a0,fa4,rtz
fcvt.lu.d a1,fa5,rtz
vfmv.f.s fa4,v16
vfmv.f.s fa5,v8
fcvt.lu.d a2,fa4,rtz
fcvt.lu.d a3,fa5,rtz
add s2,s2,s5
call sumation
add s3,s3,a0
bgeu s4,s2,.L5

PR target/112431

gcc/ChangeLog:

* config/riscv/vector.md: Support highpart overlap for vx/vf.

gcc/testsuite/ChangeLog:

* gcc.target/riscv/rvv/base/pr112431-22.c: New test.
* gcc.target/riscv/rvv/base/pr112431-23.c: New test.
* gcc.target/riscv/rvv/base/pr112431-24.c: New test.
* gcc.target/riscv/rvv/base/pr112431-25.c: New test.
* gcc.target/riscv/rvv/base/pr112431-26.c: New test.
* gcc.target/riscv/rvv/base/pr112431-27.c: New test.

commit | commitdiff | tree

Juzhe-Zhong [Fri, 1 Dec 2023 08:09:59 +0000 (16:09 +0800)]

RISC-V: Support highpart overlap for indexed load with SRC EEW < DEST EEW

Leverage previous approach.

Before this patch:

.L5:
        add     a3,s0,s2
        add     a4,s6,s2
        add     a5,s7,s2
        vsetvli zero,s0,e64,m8,ta,ma
        vle8.v  v4,0(s2)
        vle8.v  v3,0(a3)
        mv      s2,s1
        vle8.v  v2,0(a4)
        vle8.v  v1,0(a5)
        nop
        vluxei8.v       v8,(s1),v4
        vs8r.v  v8,0(sp)              ---> spill
        vluxei8.v       v8,(s1),v3
        vluxei8.v       v16,(s1),v2
        vluxei8.v       v24,(s1),v1
        nop
        vmv.x.s a1,v8
        vl8re64.v       v8,0(sp)     ---> reload
        vmv.x.s a3,v24
        vmv.x.s a2,v16
        vmv.x.s a0,v8
        add     s1,s1,s5
        call    sumation
        add     s3,s3,a0
        bgeu    s4,s1,.L5

After this patch:

.L5:
add a3,s0,s2
add a4,s6,s2
add a5,s7,s2
vsetvli zero,s0,e64,m8,ta,ma
vle8.v v15,0(s2)
vle8.v v23,0(a3)
mv s2,s1
vle8.v v31,0(a4)
vle8.v v7,0(a5)
vluxei8.v v8,(s1),v15
vluxei8.v v16,(s1),v23
vluxei8.v v24,(s1),v31
vluxei8.v v0,(s1),v7
vmv.x.s a3,v0
vmv.x.s a2,v24
vmv.x.s a1,v16
vmv.x.s a0,v8
add s1,s1,s5
call sumation
add s3,s3,a0
bgeu s4,s1,.L5

PR target/112431

gcc/ChangeLog:

* config/riscv/vector.md: Support highpart overlap for indexed load.

gcc/testsuite/ChangeLog:

* gcc.target/riscv/rvv/base/pr112431-28.c: New test.
* gcc.target/riscv/rvv/base/pr112431-29.c: New test.
* gcc.target/riscv/rvv/base/pr112431-30.c: New test.
* gcc.target/riscv/rvv/base/pr112431-31.c: New test.
* gcc.target/riscv/rvv/base/pr112431-32.c: New test.
* gcc.target/riscv/rvv/base/pr112431-33.c: New test.

commit | commitdiff | tree

Richard Biener [Fri, 1 Dec 2023 10:14:53 +0000 (11:14 +0100)]

Fix ambiguity between vect_get_vec_defs with/without vectype

When querying a single set of vector defs with the overloaded
vect_get_vec_defs API then when you try to use the overload with
the vector type specified the call will be ambiguous with the
variant without the vector type. The following fixes this by
re-ordering the vector type argument to come before the output
def vector argument.

I've changed vectorizable_conversion as that triggered this
so it has coverage showing this works. The motivation is to
reduce the number of (redundant) get_vectype_for_scalar_type
calls.

* tree-vectorizer.h (vect_get_vec_defs): Re-order arguments.
* tree-vect-stmts.cc (vect_get_vec_defs): Likewise.
(vectorizable_condition): Update caller.
(vectorizable_comparison_1): Likewise.
(vectorizable_conversion): Specify the vector type to be
used for invariant/external defs.
* tree-vect-loop.cc (vect_transform_reduction): Update caller.

commit | commitdiff | tree

Jakub Jelinek [Fri, 1 Dec 2023 11:45:58 +0000 (12:45 +0100)]

testsuite: Tweak some further tests for modern C changes

On IRC Richi mentioned some FAILs in gcc.target/x86_64 and in pr83126.c.

The following patch fixes the former ones (they need recent binutils to
be enabled), for pr83126.c because I didn't have graphite configured I've
just verified that the test compiles (didn't without the patch) and that
the gimple dump is identical with one from yesterday's gcc (as it was a
tree-parloops.cc ICE, I guess identical gimple is all we care about
and no need to verify it further).

2023-12-01 Jakub Jelinek <jakub@redhat.com>

* gcc.target/x86_64/abi/avx512fp16/m512h/test_passing_m512.c
(fun_check_passing_m512_8_values, fun_check_passing_m512h_8_values):
Add missing void return type.
* gcc.target/x86_64/abi/avx512fp16/m256h/test_passing_m256.c
(fun_check_passing_m256_8_values, fun_check_passing_m256h_8_values):
Likewise.
* gcc.dg/graphite/pr83126.c (ew): Add missing casts to __INTPTR_TYPE__
and then to int *.

commit | commitdiff | tree

Jakub Jelinek [Fri, 1 Dec 2023 09:56:52 +0000 (10:56 +0100)]

lower-bitint: Fix lowering of middle sized _BitInt operations which can throw [PR112770]

The middle kind _BitInt lowering is mostly done by casting the BITINT_TYPE
operands (if any) to a signed/unsigned integer type which has larger/equal
precision, using such integer type also for the lhs (if BITINT_TYPE) and
and adding a cast after the statement from that new lhs to the old
(BITINT_TYPE) lhs.  Note, for middle kind this isn't done for GIMPLE_CALLs.
Most of the time that works nicely, the exception as the following testcase
shows is -fnon-call-exceptions and some operations which can trap.  Because
inserting the cast to a new lhs after the statement results in a trapping
statement in the middle of a basic block.
The following patch fixes that by emitting the cast on the fallthru edge
instead.

2023-12-01  Jakub Jelinek  <jakub@redhat.com>

PR middle-end/112770
* gimple-lower-bitint.cc (gimple_lower_bitint): When adjusting
lhs of middle _BitInt setter which ends bb, insert cast on
the fallthru edge rather than after stmt.

* gcc.dg/bitint-45.c: New test.

commit | commitdiff | tree

Jakub Jelinek [Fri, 1 Dec 2023 09:55:49 +0000 (10:55 +0100)]

lower-bitint: Fix up handle_operand_addr for 0 constants [PR112771]

handle_operand_addr for INTEGER_CSTs uses wi::min_precision (UNSIGNED
for non-negative constants, SIGNED for negative ones) and from that
computes mp as minimum number of limbs which can represent that value,
and in some cases creates a test BITINT_TYPE with that precision to
categorize it and decide based on that what types to use on the constant
emitted into memory.  For the actual precisions (what will be passed
to libgcc) it actually already uses MAX/MIN to adjust the corner cases:
          *prec = MAX (min_prec, 1);
...
          *prec = MIN ((int) -min_prec, -2);
but for integer_zerop min_prec will be 0,
mp = CEIL (min_prec, limb_prec) * limb_prec;
will be also 0 and we ICE trying to build unsigned BITINT_TYPE with
0 precision.

Fixed thusly by noting even 0 has to be encoded at least as one limb.

2023-12-01  Jakub Jelinek  <jakub@redhat.com>

PR middle-end/112771
* gimple-lower-bitint.cc (bitint_large_huge::handle_operand_addr):
Use mp = 1 if it is zero.

* gcc.dg/bitint-44.c: New test.

commit | commitdiff | tree

Jose E. Marchesi [Fri, 1 Dec 2023 09:38:00 +0000 (10:38 +0100)]

bpf: quote section names whenever necessary in assembly output

In BPF section names are used to encode the kind of BPF program and
other metadata, involving all sort of non alphanumeric characters.

For example:

  /* use auto-attach format for section definition. */
  SEC("uretprobe//proc/self/exe:trigger_func2")
  int handle_uretprobe_byname(struct pt_regs *ctx)
  {
   uretprobe_byname_res = 6;
   return 0;
  }

The above requires to quote the section name in the output assembly
file, since otherwise the // in the name would be interpreted by the
assembler lexer as the beginning of a line comment.

This patch makes the BPF backend to emit quoted section names in
.section directives if the name requires to be quoted.  Simple section
names whose constituent characters are in the set [0-9a-zA-Z_] are
still emitted unquoted.

Tested in target bpf-unknown-none-gcc and host x86_64-linux-gnu.

gcc/ChangeLog

* config/bpf/bpf.cc (bpf_asm_named_section): New function.
(TARGET_ASM_NAMED_SECTION): Set to bpf_asm_named_section.

gcc/testsuite/ChangeLog

* gcc.target/bpf/section-name-quoting-1.c: New test.

commit | commitdiff | tree

Di Zhao [Fri, 1 Dec 2023 08:16:40 +0000 (16:16 +0800)]

aarch64: modify Ampere CPU tunings on reassociation/FMA

1. Allow reassociation on FP additions.

2. Avoid generating loop-dependant FMA chains. Added a tuning
option 'AARCH64_EXTRA_TUNE_AVOID_CROSS_LOOP_FMA' for this.

gcc/ChangeLog:

* config/aarch64/aarch64-tuning-flags.def
(AARCH64_EXTRA_TUNING_OPTION): New tuning option to avoid
cross-loop FMA.
* config/aarch64/aarch64.cc
(aarch64_override_options_internal): Set
param_avoid_fma_max_bits according to tuning option.
* config/aarch64/tuning_models/ampere1.h (ampere1_tunings):
Modify tunings related with FMA.
* config/aarch64/tuning_models/ampere1a.h (ampere1a_tunings):
Likewise.
* config/aarch64/tuning_models/ampere1b.h (ampere1b_tunings):
Likewise.

commit | commitdiff | tree

Rainer Orth [Fri, 1 Dec 2023 08:59:08 +0000 (09:59 +0100)]

libstdc++: Regenerate GCC_CHECK_ASSEMBLER_HWCAP users

This patch regenerates the remaining user of the GCC_CHECK_ASSEMBLER_HWCAP
macro. No functional changes.

2023-11-30 Rainer Orth <ro@CeBiTec.Uni-Bielefeld.DE>

libstdc++-v3:
* configure: Regenerate.

commit | commitdiff | tree

Sebastian Huber [Thu, 30 Nov 2023 16:16:53 +0000 (17:16 +0100)]

gcov: Fix use of __LIBGCC_HAVE_LIBATOMIC

libgcc/ChangeLog:

PR target/112777

* libgcov.h (GCOV_SUPPORTS_ATOMIC): Honor that __LIBGCC_HAVE_LIBATOMIC is
always defined as either 0 or 1.

commit | commitdiff | tree

Richard Sandiford [Fri, 1 Dec 2023 08:36:15 +0000 (08:36 +0000)]

aarch64: Add a result_mode helper function

SME will add more intrinsics whose expansion code requires
the mode of the function return value. This patch adds an
associated helper routine.

gcc/
* config/aarch64/aarch64-sve-builtins.h
(function_expander::result_mode): New member function.
* config/aarch64/aarch64-sve-builtins-base.cc
(svld234_impl::expand): Use it.
* config/aarch64/aarch64-sve-builtins.cc
(function_expander::get_reg_target): Likewise.

commit | commitdiff | tree

Jakub Jelinek [Fri, 1 Dec 2023 08:26:56 +0000 (09:26 +0100)]

lower-bitint: Fix up maximum addition/subtraction/multiplication result computations

When debugging PR112750, I've noticed some issues in the computation
of precisions and the following patch attempts to fix those.

The pass uses range_to_prec function, which possibly using ranger returns
minimum precision of some operand in the style that libgcc _BitInt
entrypoints expect, i.e. for operands with unsigned types either the
precision of that type or with help of ranger
wi::min_precision (upper_bound, UNSIGNED) (done both if the types
are really unsigned or even when lower_bound is non-negative), while
for operands with signed types either negated precision of that type or
with help of ranger negated value of maximum of SIGNED min_precisions
of lower and upper bound.
Because _BitInt in C only supports unsigned precisions >= 1 and signed
precisions >= 2, the function also ensures that 0 is never returned (returns
1 instead) and should ensure that -1 is never returned (should return -2).
That is the first bug I found though, for the ranger case it ensured that,
but if an operand would be signed 1-bit precision (that means
non-BITINT_TYPE) operand, it could return -1.

Another thing is that both lower_addsub_overflow and lower_mul_overflow
compute from the prec0 and prec1 of the operands (returned by range_to_prec
with the above value meanings) prec2, which is how many bits of the result
we actually need to compute to know the infinite precision result.
This is then used by arith_overflow function together with prec
(TYPE_PRECISION (type)), type (type of the result), prec0 and prec1 to
compute which range of bits should be tested (if any, or that there is never
an overflow) and with which kind (require those bits to be zero vs.
check if all those bits together all all zeros/ones).
The arith_overflow function has one special case, when
prec0 >= 0 && prec1 >= 0 and operation is not .SUB_OVERFLOW; in that case
we treat prec2 as minimum precision to express any infinite precision
unsigned result (the result is never negative in that case), while
in all other cases prec2 is treated as minimum precision to express
any infinite precision signed result (because the result can be also
negative).
The computations of those values were apparently incorrect for all of
.{ADD,SUB}_OVERFLOW (in that case only if one operand was signed and
the other unsigned) and for .MUL_OVERFLOW it was sometimes too large.

It took me a while to get to the right expression how to compute that,
I've started with writing into the comment the possible results for
different prec0 and prec1 values (used just 8/-8/10/-10 as placeholders
for equal or different absolute values of the 2 precisions and cases
with positive and/or negative signs) and then turned into the attached
test program that actually printed what I was writing to make sure
I didn't make mistakes in it and in the end also verified the computation,
this time for all combinations of 1..14 and -2..-14 precisions.
The UNSIGNED vs. SIGNED in the table is what arith_overflow expects
the prec2 to be (see above mentioned exception).

2023-12-01 Jakub Jelinek <jakub@redhat.com>

* gimple-lower-bitint.cc (range_to_prec): Don't return -1 for
signed types.
(bitint_large_huge::lower_addsub_overflow): Fix up computation of
prec2.
(bitint_large_huge::lower_mul_overflow): Likewise.

commit | commitdiff | tree

Jakub Jelinek [Fri, 1 Dec 2023 08:26:24 +0000 (09:26 +0100)]

lower-bitint: Fix ICE on bitint-39.c

torture/bitint-39.c ICEs with -O1; the problem is that the
finish_arith_overflow code in one spot replaces use_stmt with an
assignment or cast, but if unlucky and m_gsi iterator is the same statement,
when the code later
      tree clobber = build_clobber (TREE_TYPE (var), CLOBBER_EOL);
      g = gimple_build_assign (var, clobber);
      gsi_insert_after (&m_gsi, g, GSI_SAME_STMT);
it will insert after iterator which contains already replaced statement and
that causes the gimple chain corruption.

2023-12-01  Jakub Jelinek  <jakub@redhat.com>

* gimple-lower-bitint.cc (bitint_large_huge::finish_arith_overflow):
When replacing use_stmt which is gsi_stmt (m_gsi), update m_gsi to
the new statement.

commit | commitdiff | tree

Jakub Jelinek [Fri, 1 Dec 2023 08:25:45 +0000 (09:25 +0100)]

lower-bitint: Fix _BitInt .{ADD,SUB}_OVERFLOW lowering [PR112750]

The .{ADD,SUB}_OVERFLOW lowering is implemented by performing normal
addition/subtraction (perhaps extended to even more bits than normally by
continuing with extended values of operands after running of normal bits)
and in addition to that trying to compute if certain sets of bits are either
all zero or all sign extensions of the least significant bit.

That code is in a lot of cases guarded just by a single condition (which
can be idx > startlimb, idx >= startlimb or idx == startlimb) or by
2 conditions - if (idx >= startlimb) { if (idx == startlimb) ... else ... }
Now, if_then_if_then_else when the second argument is NULL works just as
if_then and sets m_gsi to be within the initially empty then block and that is
where we emit code for constant tidx startlimb + (cmp_code == GT_EXPR).
But in the 2 conditions case, m_gsi is set to the initially empty else
block, so using EQ_EXPR for the condition was incorrect (and strangely
nothing in the testsuite caught that), because the code for extracting the
lowest set of bits (i.e. when tidx is startlimb) is then done when idx
is not startlimb rather than when it is.
The following patch fixes that.

Note, when developing the lowering, I was using gcov to make sure all code
is covered by the testsuite with minimum exceptions, so no idea how this
slipped out.

2023-12-01 Jakub Jelinek <jakub@redhat.com>

PR middle-end/112750
* gimple-lower-bitint.cc (bitint_large_huge::lower_addsub_overflow):
Use NE_EXPR rather than EQ_EXPR for g2 if !single_comparison and
adjust probabilities.

* gcc.dg/bitint-41.c: Use -std=c23 rather than -std=c2x.
* gcc.dg/torture/bitint-43.c: Likewise.
* gcc.dg/torture/bitint-44.c: Likewise.
* gcc.dg/torture/bitint-45.c: New test.

commit | commitdiff | tree

Xi Ruoyao [Thu, 30 Nov 2023 09:35:10 +0000 (17:35 +0800)]

doc: Update the status of build directory not fully separated

Recently there are some people building GCC with srcdir == objdir and
the attempts just failed [1]. So stop to say "it should work". OTOH
objdir as a subdirectory of srcdir works: we've built GCC in LFS [2]
and BLFS [3] this way for decades and this is confirmed during the
review of a previous version of this patch [4].

[1]: https://gcc.gnu.org/pipermail/gcc-help/2023-November/143068.html
[2]: https://www.linuxfromscratch.org/lfs/view/12.0/chapter08/gcc.html
[3]: https://www.linuxfromscratch.org/blfs/view/12.0/general/gcc.html
[4]: https://gcc.gnu.org/pipermail/gcc-patches/2023-November/638760.html

gcc/ChangeLog:

* doc/install.texi: Deem srcdir == objdir broken, but objdir
as a subdirectory of srcdir fine.

Mirror of https://gcc.gnu.org/git/gcc.git

RSS Atom