1 2021-12-02 Ilya Lipnitskiy <ilya.lipnitskiy@gmail.com>
3 * configure: Regenerate.
4 * configure.ac: Define TARGET_LIBC_GNUSTACK on musl.
6 2021-12-02 Iain Sandoe <iain@sandoe.co.uk>
9 * config/host-darwin.c (TRY_EMPTY_VM_SPACE,
10 SAFE_ALLOC_SIZE): New.
11 (darwin_gt_pch_get_address): Rewrite to use nominated
12 memory segments rather than part of the compiler __DATA
14 (darwin_gt_pch_use_address): Likewise.
16 2021-12-02 Gerald Pfeifer <gerald@pfeifer.com>
18 * doc/install.texi (*-*-freebsd*): Remove references to
19 FreeBSD 1 and FreeBSD 2.
21 2021-12-02 Vladimir N. Makarov <vmakarov@redhat.com>
23 PR rtl-optimization/103437
24 * ira-color.c (setup_allocno_priorities): Switch off backup code
25 for overflow if compiler has __builtin_smul_overflow. Use <
26 for comparison with -INT_MAX.
28 2021-12-02 Bill Schmidt <wschmidt@linux.ibm.com>
30 * config/rs6000/altivec.h: Delete a number of #defines that are now
31 superfluous. Alphabetize. Include rs6000-vecdefines.h. Include some
33 * config/rs6000/rs6000-builtin-new.def (CMPB): Flag as no32bit.
34 (BPERMD): Flag as 32bit (needing special handling for 32-bit).
35 (UNPACK_TD): Return unsigned long long instead of unsigned long.
36 (GET_TEXASR): Return unsigned long instead of unsigned long long.
37 (GET_TEXASRU): Likewise.
38 (GET_TFHAR): Likewise.
39 (GET_TFIAR): Likewise.
40 (SET_TEXASR): Pass unsigned long instead of unsigned long long.
41 (SET_TEXASRU): Likewise.
42 (SET_TFHAR): Likewise.
43 (SET_TFIAR): Likewise.
45 (TABORTDCI): Likewise.
46 * config/rs6000/rs6000-call.c (rs6000_expand_new_builtin): Fix error
47 handling for no32bit. Add 32bit handling for RS6000_BIF_BPERMD.
48 * config/rs6000/rs6000-gen-builtins.c (write_init_file): Initialize
49 new_builtins_are_live to 1.
51 2021-12-02 Vladimir N. Makarov <vmakarov@redhat.com>
53 PR rtl-optimization/103437
54 * ira-color.c (setup_allocno_priorities): Use long long
55 multiplication as backup for overflow processing.
57 2021-12-02 Tamar Christina <tamar.christina@arm.com>
59 * config/aarch64/aarch64-protos.h (aarch64_gen_shareable_zero): New.
60 * config/aarch64/aarch64-simd.md (aarch64_rshrn<mode>,
61 aarch64_rshrn2<mode>): Generate rounding half-ing add when appropriate.
62 * config/aarch64/aarch64.c (aarch64_gen_shareable_zero): New.
64 2021-12-02 Vladimir N. Makarov <vmakarov@redhat.com>
66 PR rtl-optimization/103437
67 * ira-color.c (setup_allocno_priorities): Process multiplication
70 2021-12-02 Martin Liska <mliska@suse.cz>
73 * common.opt: New option.
74 * coverage.c (coverage_begin_function): Emit filename with
75 remap_profile_filename.
76 * doc/invoke.texi: Document the new option.
77 * file-prefix-map.c (add_profile_prefix_map): New.
78 (remap_profile_filename): Likewise.
79 * file-prefix-map.h (add_profile_prefix_map): Likewise.
80 (remap_profile_filename): Likewise.
81 * lto-opts.c (lto_write_options): Handle
82 OPT_fprofile_prefix_map_.
83 * opts-global.c (handle_common_deferred_options): Likewise.
84 * opts.c (common_handle_option): Likewise.
85 (gen_command_line_string): Likewise.
86 * profile.c (output_location): Emit filename with
87 remap_profile_filename.
89 2021-12-02 Richard Biener <rguenther@suse.de>
92 * internal-fn.c (expand_DEFERRED_INIT): When the base
93 of the LHS is a decl with matching constant size use
94 that as the initialization target instead of an
95 eventual VLA typed one.
97 2021-12-02 Richard Biener <rguenther@suse.de>
99 PR tree-optimization/103527
100 * tree-vect-stmts.c (vect_build_gather_load_calls): Always
101 use a truth type for building the vector mask.
103 2021-12-02 Rasmus Villemoes <rasmus.villemoes@prevas.dk>
105 * config/vxworks.h (LTO_PLUGIN_SPEC): Adapt to corrected
106 spelling of -linker-output-auto-nolto-rel.
108 2021-12-01 Uroš Bizjak <ubizjak@gmail.com>
111 * config/i386/sse.md (VI2F): Remove mode iterator.
112 (VI2F_256_512): New mode iterator.
113 (vec_set<V8_128:mode>_0): New insn pattern.
114 (vec_set<VI2F_256_512:mode>_0>): Rename from vec_set<VI2F:mode>mode.
115 Use VI2F_256_512 mode iterator instead of VI2F.
116 (*axv512fp16_movsh): Remove.
117 (<sse2p4_1>_pinsr<ssemodesuffix>): Add (x,x,x) AVX2 alternative.
118 Do not disable V8HF mode insn on AVX2 targets.
119 (pinsrw -> pbroadcast + pblendw peephole2): New peephole.
120 (pinsrw -> pbroadcast + pblendw splitter): New post-reload splitter.
121 * config/i386/i386.md (extendhfsf): Call gen_vec_setv8hf_0.
122 * config/i386/i386-expand.c (ix86_expand_vector_set)
123 <case E_V8HFmode>: Use vec_merge path for TARGET_AVX2.
125 2021-12-01 Roger Sayle <roger@nextmovesoftware.com>
126 Richard Biener <rguenther@suse.de>
128 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
129 Check if simplify_using_initial_conditions allows us to
130 simplify the expression for may_be_zero.
131 * match.pd (X != C ? -X : -C -> -X): New transform.
132 (X != C ? ~X : ~C -> ~X): Likewise.
133 ((X+1) > Y ? -X : 1 -> X >= Y ? -X : 1): Likewise.
135 2021-12-01 Peter Bergner <bergner@linux.ibm.com>
138 * gimplify.c (is_var_need_auto_init): Handle opaque types.
140 2021-12-01 Iain Buclaw <ibuclaw@gdcproject.org>
142 * common.opt (static-libphobos): Add option.
143 * config/darwin.h (LINK_SPEC): Substitute -lgphobos with libgphobos.a
144 when linking statically.
145 * gcc.c (driver_handle_option): Set -static-libphobos as always valid.
147 2021-12-01 Jason Merrill <jason@redhat.com>
150 * fold-const.c (maybe_nonzero_address): Use get_create or get
151 depending on folding_initializer.
152 (fold_binary_initializer_loc): New.
153 * fold-const.h (fold_binary_initializer_loc): Declare.
154 * varasm.c (mark_weak): Don't use the decl location.
155 * doc/invoke.texi: Document -fconstexpr-fp-except.
157 2021-12-01 Bill Schmidt <wschmidt@linux.ibm.com>
160 * config/rs6000/rs6000-c.c (rs6000-builtins.h): Stop including.
161 (rs6000-internal.h): Include.
162 (altivec_resolve_new_overloaded_builtin): Move call to
163 rs6000_invalid_new_builtin here from rs6000_new_builtin_decl.
164 * config/rs6000/rs6000-call.c (rs6000-builtins.h): Stop including.
165 (rs6000_invalid_new_builtin): Remove static qualifier.
166 (rs6000_new_builtin_decl): Remove test for supported builtin.
167 * config/rs6000/rs6000-internal.h (rs6000-builtins.h): Include.
168 (rs6000_invalid_new_builtin): Declare.
169 * config/rs6000/rs6000.c (rs6000-builtins.h): Don't include.
171 2021-12-01 Aldy Hernandez <aldyh@redhat.com>
173 PR tree-optimization/103409
174 * gimple-range-path.cc (path_range_query::compute_ranges_in_phis):
175 Do all the work with just one ssa_global_cache.
176 * gimple-range-path.h: Remove m_tmp_phi_cache.
178 2021-12-01 Richard Sandiford <richard.sandiford@arm.com>
180 PR tree-optimization/103517
181 * tree-vect-slp.c (vect_build_slp_tree_1): When allowing two
182 different component references, check the codes of both them,
183 rather than just the first.
185 2021-12-01 Andrew MacLeod <amacleod@redhat.com>
187 PR tree-optimization/103464
188 * gimple-range.cc (gimple_ranger::prefill_name): Process phis also.
189 (gimple_ranger::prefill_stmt_dependencies): Ditto.
191 2021-12-01 Iain Buclaw <ibuclaw@gdcproject.org>
193 * doc/install.texi (Prerequisites): Add note that D front end now
194 requires GDC installed in order to bootstrap.
195 (Building): Add D compiler section, referencing prerequisites.
197 2021-12-01 Jakub Jelinek <jakub@redhat.com>
199 * config/i386/x86-tune.def (X86_TUNE_SCHEDULE,
200 X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY,
201 X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Formatting fixes.
202 (X86_TUNE_USE_GATHER): Put m_GENERIC last for consistency.
204 2021-12-01 Jakub Jelinek <jakub@redhat.com>
206 PR rtl-optimization/102356
207 * rtl.h (simplify_context): Add assoc_count member and
208 max_assoc_count static member.
209 * simplify-rtx.c (simplify_associative_operation): Don't reassociate
210 more than max_assoc_count times within one outermost simplify_* call.
211 * dwarf2out.c (mem_loc_descriptor): Optimize binary operation
212 with both operands the same using DW_OP_dup.
214 2021-12-01 Tamar Christina <tamar.christina@arm.com>
216 * match.pd: Move below pattern that rewrites to EQ, NE.
217 * tree.c (bitmask_inv_cst_vector_p): Correct do .. while indentation.
219 2021-12-01 Siddhesh Poyarekar <siddhesh@gotplt.org>
220 Martin Liška <mliska@suse.cz>
222 PR tree-optimization/103456
223 * tree-object-size.c (merge_object_sizes): Update osi->changed
224 only if object_sizes_set succeeded.
226 2021-11-30 liuhongt <hongtao.liu@intel.com>
230 * config/i386/i386.md (*x86_64_shld_1): Set_attr
231 length_immediate to 1.
232 (*x86_shld_1): Ditto.
233 (*x86_64_shrd_1): Ditto.
234 (*x86_shrd_1): Ditto.
236 2021-11-30 Bill Schmidt <wschmidt@linux.ibm.com>
238 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
240 (altivec_resolve_new_overloaded_builtin): Likewise.
242 2021-11-30 Martin Jambor <mjambor@suse.cz>
245 * ipa-sra.c (scan_function): Also check ECF_LOOPING_CONST_OR_PURE flag.
247 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
249 PR tree-optimization/103494
250 * tree-vect-stmts.c (vect_get_gather_scatter_ops): Remove ncopies
251 argument and calculate ncopies from gs_info->offset_vectype
253 (vectorizable_store, vectorizable_load): Update accordingly.
255 2021-11-30 Iain Buclaw <ibuclaw@gdcproject.org>
257 * Makefile.in (GDC): New variable.
258 (GDCFLAGS): New variable.
259 * configure: Regenerate.
260 * configure.ac: Add call to ACX_PROG_GDC. Substitute GDCFLAGS.
262 2021-11-30 Martin Liska <mliska@suse.cz>
263 Martin Jambor <mjambor@suse.cz>
266 * ipa-param-manipulation.c
267 (ipa_param_body_adjustments::prepare_debug_expressions): Be
268 careful about hash_map reallocating itself. Simpify a return
269 which always returns true.
271 2021-11-30 Andrew MacLeod <amacleod@redhat.com>
273 PR tree-optimization/103440
274 * gimple-range-fold.cc (fold_using_range::range_of_phi): Continue
275 normal param processing for equiv params.
277 2021-11-30 Richard Biener <rguenther@suse.de>
279 * config/i386/i386.c (ix86_shift_rotate_cost): Remove
281 * tree-chrec.c (evolution_function_is_invariant_rec_p):
283 * tree-if-conv.c (if_convertible_stmt_p): Likewise.
284 * tree-ssa-pre.c (fully_constant_expression): Likewise.
285 * tree-vrp.c (operand_less_p): Likewise.
286 * reload.c (reg_overlap_mentioned_for_reload_p): Remove
287 unreachable gcc_unreachable ().
288 * sel-sched-ir.h (bb_next_bb): Likewise.
289 * varasm.c (compare_constant): Likewise.
291 2021-11-30 Richard Biener <rguenther@suse.de>
293 PR tree-optimization/103489
294 * tree-vect-loop.c (vectorizable_phi): Verify argument
295 vector type compatibility to mitigate bool pattern recog
298 2021-11-30 Thomas Schwinge <thomas@codesourcery.com>
300 * omp-offload.c (oacc_loop_auto_partitions): Remove erroneous
301 "Orphan reductions cannot have gang partitioning" handling.
303 2021-11-30 Thomas Schwinge <thomas@codesourcery.com>
305 * omp-offload.c (oacc_loop_process): Implement "gang reduction on
306 an orphan loop" checking.
308 2021-11-30 Cesar Philippidis <cesar@codesourcery.com>
309 Thomas Schwinge <thomas@codesourcery.com>
311 * omp-general.h (enum oacc_loop_flags): Add OLF_REDUCTION enum.
312 * omp-low.c (lower_oacc_head_mark): Use it to mark OpenACC
314 * omp-offload.c (oacc_loop_auto_partitions): Don't assign gang
315 level parallelism to orphan reductions.
317 2021-11-30 Aldy Hernandez <aldyh@redhat.com>
319 PR tree-optimization/103451
320 * range-op.cc (operator_div::wi_fold): Remove
321 can_throw_non_call_exceptions special case.
322 * tree-ssa-sink.c (sink_code_in_bb): Same.
324 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
326 * tree-vect-slp.c (arg1_arg4_map): New variable.
327 (vect_get_operand_map): Handle IFN_MASK_GATHER_LOAD.
328 (vect_build_slp_tree_1): Likewise.
329 (vect_build_slp_tree_2): Likewise.
330 * tree-vect-stmts.c (vectorizable_load): Expect the mask to be
331 the last SLP child node rather than the first.
333 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
335 * tree-if-conv.c: Include tree-eh.h.
336 (predicate_statements): Remove pe argument. Don't hoist
338 (combine_blocks): Remove pe argument.
339 (ifcvt_available_on_edge_p, ifcvt_can_hoist): New functions.
340 (ifcvt_hoist_invariants): Likewise.
341 (tree_if_conversion): Update call to combine_blocks. Call
342 ifcvt_hoist_invariants after VN.
344 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
346 PR tree-optimization/102467
347 * doc/sourcebuild.texi (vect_gather_load_ifn): Document.
348 * tree-vect-data-refs.c (vect_analyze_data_ref_dependence):
349 Commonize safelen handling. Punt for anything involving
350 gathers and scatters unless safelen says otherwise.
351 * tree-vect-slp.c (arg1_map): New variable.
352 (vect_get_operand_map): Handle IFN_GATHER_LOAD.
353 (vect_build_slp_tree_1): Likewise.
354 (vect_build_slp_tree_2): Likewise.
355 (compatible_calls_p): If vect_get_operand_map returns nonnull,
356 check that any skipped arguments are equal.
357 (vect_slp_analyze_node_operations_1): Tighten reduction check.
358 * tree-vect-stmts.c (check_load_store_for_partial_vectors): Take
360 (vect_get_gather_scatter_ops): Take slp_node and ncopies arguments.
362 (vectorizable_store, vectorizable_load): Adjust accordingly.
364 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
366 * gimple.h (gimple_num_args, gimple_arg, gimple_arg_ptr): New
368 * tree-vect-slp.c (cond_expr_maps, arg2_map): New variables.
369 (vect_get_operand_map): New function.
370 (vect_get_and_check_slp_defs): Fix outdated comment.
371 Use vect_get_operand_map and new gimple argument accessors.
372 (vect_build_slp_tree_2): Likewise.
374 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
376 * tree-vect-slp.c (vect_build_slp_tree_1): Use code_helper
377 to record the operations performed by statements, only using
378 CALL_EXPR for things that don't map to built-in or internal
379 functions. For shifts, require all shift amounts to be equal
380 if optab_vector is not supported but optab_scalar is.
382 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
384 * tree-vect-data-refs.c (vect_check_gather_scatter): Continue
385 processing conversions if the current offset is a pointer.
387 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
389 * tree-vectorizer.h (vect_is_reduction): Use STMT_VINFO_REDUC_IDX.
391 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
393 * genopinit.c (main): Turn supports_vec_gather_load and
394 supports_vec_scatter_store into signed char arrays and remove
395 supports_vec_gather_load_cached and supports_vec_scatter_store_cached.
396 * optabs-query.c (supports_vec_convert_optab_p): Add a mode parameter.
397 If the mode is not VOIDmode, test only for that mode.
398 (supports_vec_gather_load_p): Likewise.
399 (supports_vec_scatter_store_p): Likewise.
400 * optabs-query.h (supports_vec_gather_load_p): Likewise.
401 (supports_vec_scatter_store_p): Likewise.
402 * tree-vect-data-refs.c (vect_check_gather_scatter): Pass the
403 vector mode to supports_vec_gather_load_p and
404 supports_vec_scatter_store_p.
406 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
408 * internal-fn.c (commutative_binary_fn_p): Handle IFN_ADD_OVERFLOW
409 and IFN_MUL_OVERFLOW.
411 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
413 * internal-fn.c (commutative_binary_fn_p): Handle IFN_UBSAN_CHECK_ADD
414 and IFN_UBSAN_CHECK_MUL.
416 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
418 * internal-fn.c (commutative_binary_fn_p): Handle IFN_COMPLEX_MUL.
420 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
422 * gimple-fold.c: Include internal-fn.h.
423 (fold_stmt_1): If a function maps to an internal one, use
424 first_commutative_argument to canonicalize the order of
425 commutative arguments.
426 * gimple-match-head.c (gimple_resimplify2, gimple_resimplify3)
427 (gimple_resimplify4, gimple_resimplify5): Extend commutativity
430 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
432 * doc/md.texi (reduc_fmin_scal_@var{m}): Document.
433 (reduc_fmax_scal_@var{m}): Likewise.
434 * optabs.def (reduc_fmax_scal_optab): New optab.
435 (reduc_fmin_scal_optab): Likewise
436 * internal-fn.def (REDUC_FMAX, REDUC_FMIN): New functions.
437 * tree-vect-loop.c (reduction_fn_for_scalar_code): Handle
438 CASE_CFN_FMAX and CASE_CFN_FMIN.
439 (neutral_op_for_reduction): Likewise.
440 (needs_fold_left_reduction_p): Likewise.
441 * config/aarch64/iterators.md (FMAXMINV): New iterator.
442 (fmaxmin): Handle UNSPEC_FMAXNMV and UNSPEC_FMINNMV.
443 * config/aarch64/aarch64-simd.md (reduc_<optab>_scal_<mode>): Fix
445 (reduc_<fmaxmin>_scal_<mode>): New pattern.
446 * config/aarch64/aarch64-sve.md (reduc_<fmaxmin>_scal_<mode>):
449 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
451 * builtins.h (associated_internal_fn): Declare overload that
452 takes a (combined_cfn, return type) pair.
453 * builtins.c (associated_internal_fn): Split new overload out
454 of original fndecl version. Also provide an overload that takes
455 a (combined_cfn, return type) pair.
456 * internal-fn.h (commutative_binary_fn_p): Declare.
457 (commutative_ternary_fn_p): Likewise.
458 (associative_binary_fn_p): Likewise.
459 * internal-fn.c (commutative_binary_fn_p, commutative_ternary_fn_p):
460 New functions, split out from...
461 (first_commutative_argument): ...here.
462 (associative_binary_fn_p): New function.
463 * gimple-match.h (code_helper): Add a constructor that takes
465 (commutative_binary_op_p): Declare.
466 (commutative_ternary_op_p): Likewise.
467 (first_commutative_argument): Likewise.
468 (associative_binary_op_p): Likewise.
469 (canonicalize_code): Likewise.
470 (directly_supported_p): Likewise.
471 (get_conditional_internal_fn): Likewise.
472 (gimple_build): New overloads that takes a code_helper.
473 * gimple-fold.c (gimple_build): Likewise.
474 * gimple-match-head.c (commutative_binary_op_p): New function.
475 (commutative_ternary_op_p): Likewise.
476 (first_commutative_argument): Likewise.
477 (associative_binary_op_p): Likewise.
478 (canonicalize_code): Likewise.
479 (directly_supported_p): Likewise.
480 (get_conditional_internal_fn): Likewise.
481 * tree-vectorizer.h: Include gimple-match.h.
482 (neutral_op_for_reduction): Take a code_helper instead of a tree_code.
483 (needs_fold_left_reduction_p): Likewise.
484 (reduction_fn_for_scalar_code): Likewise.
485 (vect_can_vectorize_without_simd_p): Declare a nNew overload that takes
487 * tree-vect-loop.c: Include case-cfn-macros.h.
488 (fold_left_reduction_fn): Take a code_helper instead of a tree_code.
489 (reduction_fn_for_scalar_code): Likewise.
490 (neutral_op_for_reduction): Likewise.
491 (needs_fold_left_reduction_p): Likewise.
492 (use_mask_by_cond_expr_p): Likewise.
493 (build_vect_cond_expr): Likewise.
494 (vect_create_partial_epilog): Likewise. Use gimple_build rather
495 than gimple_build_assign.
496 (check_reduction_path): Handle calls and operate on code_helpers
497 rather than tree_codes.
498 (vect_is_simple_reduction): Likewise.
499 (vect_model_reduction_cost): Likewise.
500 (vect_find_reusable_accumulator): Likewise.
501 (vect_create_epilog_for_reduction): Likewise.
502 (vect_transform_cycle_phi): Likewise.
503 (vectorizable_reduction): Likewise. Make more use of
505 (vect_transform_reduction): Use gimple_extract_op but expect
507 (vect_can_vectorize_without_simd_p): New overload that takes
509 * tree-vect-stmts.c (vectorizable_call): Handle reductions in
511 * tree-vect-patterns.c (vect_mark_pattern_stmts): Use
512 gimple_extract_op when updating STMT_VINFO_REDUC_IDX.
514 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
516 * gimple-match.h (code_helper): Provide == and != overloads.
517 (code_helper::operator tree_code): Make explicit.
518 (code_helper::operator combined_fn): Likewise.
519 * gimple-match-head.c (convert_conditional_op): Use explicit
520 conversions where necessary.
521 (gimple_resimplify1, gimple_resimplify2, gimple_resimplify3): Likewise.
522 (maybe_push_res_to_seq, gimple_simplify): Likewise.
523 * gimple-fold.c (replace_stmt_with_simplification): Likewise.
525 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
527 * gimple-match.h (code_helper): Add functions for querying whether
528 the code represents an internal_fn or a built_in_function.
529 Provide explicit conversion operators for both cases.
530 (gimple_extract_op): Declare.
531 * gimple-match-head.c (gimple_extract): New function, extracted from...
532 (gimple_simplify): ...here.
533 (gimple_extract_op): New function.
535 2021-11-30 Eric Botcazou <ebotcazou@adacore.com>
538 * config/i386/i386.c (ix86_output_call_insn): Beef up comment about
539 nops emitted with SEH.
540 * config/i386/winnt.c (i386_pe_seh_unwind_emit): When switching to
541 the cold section, emit a nop before the directive if the previous
542 active instruction is a call.
544 2021-11-30 Roger Sayle <roger@nextmovesoftware.com>
545 Uroš Bizjak <ubizjak@gmail.com>
548 * config/i386/sse.md (define_split): New splitters to simplify
549 not;vec_duplicate;and as vec_duplicate;andn.
551 2021-11-30 Richard Biener <rguenther@suse.de>
554 * match.pd (-((int)x >> (prec - 1)) to (unsigned)x >> (prec - 1)):
555 Use VIEW_CONVERT_EXPR for vectors.
557 2021-11-30 Richard Biener <rguenther@suse.de>
559 * cfgrtl.c (skip_insns_after_block): Refactor code to
560 be more easily readable.
561 * expr.c (op_by_pieces_d::run): Remove unreachable
563 * sched-deps.c (sched_analyze): Remove unreachable
565 * sel-sched-ir.c (in_same_ebb_p): Likewise.
566 * tree-ssa-alias.c (nonoverlapping_refs_since_match_p):
567 Remove unreachable code.
568 * tree-vect-slp.c (vectorize_slp_instance_root_stmt):
569 Refactor to avoid unreachable loop iteration.
570 * tree.c (walk_tree_1): Remove unreachable break.
571 * vec-perm-indices.c (vec_perm_indices::series_p): Remove
574 2021-11-30 Kewen Lin <linkw@linux.ibm.com>
577 * config/rs6000/rs6000-call.c (rs6000_builtin_decl): Remove builtin mask
580 2021-11-30 Kewen Lin <linkw@linux.ibm.com>
582 * config/rs6000/rs6000.c
583 (rs6000_cost_data::update_target_cost_per_stmt): Adjust the way to
584 compute extra penalized cost. Remove useless parameter.
585 (rs6000_cost_data::rs6000_add_stmt_cost): Adjust the call to function
586 update_target_cost_per_stmt.
588 2021-11-30 Kewen Lin <linkw@linux.ibm.com>
591 2021-11-17 Kewen Lin <linkw@linux.ibm.com>
593 * config/visium/visium.md (*add<mode>3_insn, *addsi3_insn, *addi3_insn,
594 *sub<mode>3_insn, *subsi3_insn, *subdi3_insn, *neg<mode>2_insn,
595 *negdi2_insn, *and<mode>3_insn, *ior<mode>3_insn, *xor<mode>3_insn,
596 *one_cmpl<mode>2_insn, *ashl<mode>3_insn, *ashr<mode>3_insn,
597 *lshr<mode>3_insn, *trunchiqi2_insn, *truncsihi2_insn,
598 *truncdisi2_insn, *extendqihi2_insn, *extendqisi2_insn,
599 *extendhisi2_insn, *extendsidi2_insn, *zero_extendqihi2_insn,
600 *zero_extendqisi2_insn, *zero_extendsidi2_insn): Fix split condition.
602 2021-11-30 Andrew MacLeod <amacleod@redhat.com>
604 PR tree-optimization/103467
605 * gimple-range-gori.cc (range_def_chain::register_dependency): Don't
606 use an object reference after a potential resize.
608 2021-11-29 Uroš Bizjak <ubizjak@gmail.com>
611 * config/i386/i386.md (*movhi_internal): Introduce (*v,C) alternative.
612 Do not allocate non-GPR registers. Optimize xmm->xmm moves when
613 optimizing for size. Fix vpinsrw insn template.
614 (*movhf_internal): Fix pinsrw and pextrw insn templates for
615 AVX targets. Use sselog1 type instead of sselog. Optimize GPR moves.
616 Optimize xmm->xmm moves for TARGET_SSE_PARTIAL_REG_DEPENDENCY
617 and TARGET_SSE_SPLIT_REGS targets.
619 2021-11-29 Eric Gallager <egallager@gcc.gnu.org>
621 * configure: Re-regenerate.
623 2021-11-29 Eric Gallager <egallager@gcc.gnu.org>
626 * Makefile.in: Substitute CTAGS, ETAGS, and CSCOPE
627 variables. Use ETAGS variable in TAGS target.
628 * configure: Regenerate.
629 * configure.ac: Allow CTAGS, ETAGS, and CSCOPE
630 variables to be overridden.
632 2021-11-29 Paul A. Clarke <pc@us.ibm.com>
634 * config/rs6000/xmmintrin.h (_mm_movemask_ps): Use vec_extractm
636 * config/rs6000/emmintrin.h (_mm_movemask_pd): Likewise.
637 (_mm_movemask_epi8): Likewise.
639 2021-11-29 Richard Biener <rguenther@suse.de>
641 * read-rtl-function.c (function_reader::read_rtx_operand):
642 Return only after resetting m_in_call_function_usage.
644 2021-11-29 Roger Sayle <roger@nextmovesoftware.com>
646 * config/i386/i386-expand.c (ix86_expand_v1ti_to_ti): Perform the
647 conversion via V2DImode using vec_extractv2didi on TARGET_SSE2.
648 * config/i386/sse.md (rotlv1ti3, rotrv1ti3): Change constraint
649 on QImode shift amounts from const_int_operand to general_operand.
651 2021-11-29 Richard Biener <rguenther@suse.de>
653 * tree.h (reverse_storage_order_for_component_p): Remove
654 spurious gcc_unreachable.
655 * cfganal.c (dfs_find_deadend): Likewise.
656 * fold-const-call.c (fold_const_logb): Likewise.
657 (fold_const_significand): Likewise.
658 * gimple-ssa-store-merging.c (lhs_valid_for_store_merging_p):
661 2021-11-29 Richard Biener <rguenther@suse.de>
663 * vec.c (qsort_chk): Do not return the void return value
664 from the noreturn qsort_chk_error.
665 * ccmp.c (expand_ccmp_expr_1): Remove unreachable return.
666 * df-scan.c (df_ref_equal_p): Likewise.
667 * dwarf2out.c (is_base_type): Likewise.
668 (add_const_value_attribute): Likewise.
669 * fixed-value.c (fixed_arithmetic): Likewise.
670 * gimple-fold.c (gimple_fold_builtin_fputs): Likewise.
671 * gimple-ssa-strength-reduction.c (stmt_cost): Likewise.
672 * graphite-isl-ast-to-gimple.c
673 (gcc_expression_from_isl_expr_op): Likewise.
674 (gcc_expression_from_isl_expression): Likewise.
675 * ipa-fnsummary.c (will_be_nonconstant_expr_predicate):
677 * lto-streamer-in.c (lto_input_mode_table): Likewise.
679 2021-11-29 liuhongt <hongtao.liu@intel.com>
682 * config/i386/i386.c (inline_secondary_memory_needed): HImode
683 move between GPR and SSE registers is supported under
684 TARGET_SSE2 and above.
685 * config/i386/i386.md (extendhfsf2): Optimize expander.
687 * config/i386/sse.md (sse2p4_1): Adjust attr for V8HFmode to
690 2021-11-29 liuhongt <hongtao.liu@intel.com>
694 * config/i386/i386.c (ix86_secondary_reload): Without
695 TARGET_SSE4_1, General register is needed to move HImode from
696 sse register to memory.
697 * config/i386/sse.md (*vec_extrachf): Use %vpextrw instead of
698 pextrw in output templates.
699 * config/i386/i386.md (movhi_internal): Ditto, also fix typo of
700 MEM_P (operands[1]) and adjust mode/prefix/type attribute for
701 alternatives related to sse register.
703 2021-11-29 Richard Biener <rguenther@suse.de>
705 PR tree-optimization/103458
706 * tree-ssa-dce.c (make_forwarders_with_degenerate_phis): Do not
707 create forwarders for blocks with abnormal predecessors.
709 2021-11-29 Richard Biener <rguenther@suse.de>
711 * gimple-predicate-analysis.cc (can_be_invalidated_p):
712 Restore semantics to the one before the split from
715 2021-11-28 Jan Hubicka <hubicka@ucw.cz>
717 * profile.c: Include sreal.h
718 (struct bb_stats): New.
719 (cmp_stats): New function.
720 (compute_branch_probabilities): Output bb stats.
722 2021-11-28 Jan Hubicka <hubicka@ucw.cz>
724 * cfghooks.c: Include sreal.h, profile.h.
725 (profile_record_check_consistency): Fix checking of count counsistency;
726 record also dynamic mismatches.
727 * cfgrtl.c (rtl_account_profile_record): Similarly.
728 * tree-cfg.c (gimple_account_profile_record): Likewise.
729 * cfghooks.h (struct profile_record): Remove num_mismatched_freq_in,
730 num_mismatched_freq_out, turn time to double, add
731 dyn_mismatched_prob_out, dyn_mismatched_count_in,
732 num_mismatched_prob_out; remove num_mismatched_count_out.
733 * passes.c (account_profile_1): New function.
734 (account_profile_in_list): New function.
735 (pass_manager::dump_profile_report): Rewrite.
736 (execute_one_ipa_transform_pass): Check profile consistency after
738 (execute_all_ipa_transforms): Remove cfun test; record all transform
740 (execute_one_pass): Fix collecting of profile stats.
742 2021-11-27 Jakub Jelinek <jakub@redhat.com>
745 * config/i386/i386.md (x86_64_shld, x86_shld, x86_64_shrd, x86_shrd):
746 Change insn pattern to accurately describe the instructions.
747 (*x86_64_shld_1, *x86_shld_1, *x86_64_shrd_1, *x86_shrd_1): New
748 define_insn patterns.
749 (*x86_64_shld_2, *x86_shld_2, *x86_64_shrd_2, *x86_shrd_2): New
750 define_insn_and_split patterns.
751 (*ashl<dwi>3_doubleword_mask, *ashl<dwi>3_doubleword_mask_1,
752 *<insn><dwi>3_doubleword_mask, *<insn><dwi>3_doubleword_mask_1,
753 ix86_rotl<dwi>3_doubleword, ix86_rotr<dwi>3_doubleword): Adjust
754 splitters for x86_{,64_}sh{l,r}d pattern changes.
756 2021-11-27 Jakub Jelinek <jakub@redhat.com>
758 PR tree-optimization/103435
759 * gimple-ssa-store-merging.c (find_bswap_or_nop_finalize): Avoid UB if
760 n->range - rsize == 8, just clear both *cmpnop and *cmpxchg in that
763 2021-11-27 Martin Jambor <mjambor@suse.cz>
766 * ipa-prop.c (ipcp_transform_function): Call
767 delete_unreachable_blocks_update_callgraph instead of returning
770 2021-11-26 Siddhesh Poyarekar <siddhesh@gotplt.org>
772 * tree-object-size.c (object_sizes_grow, object_sizes_release,
773 object_sizes_unknown_p, object_sizes_get, object_size_set_force,
774 object_sizes_set): New functions.
775 (addr_object_size, compute_builtin_object_size,
776 expr_object_size, call_object_size, unknown_object_size,
777 merge_object_sizes, plus_stmt_object_size,
778 cond_expr_object_size, collect_object_sizes_for,
779 check_for_plus_in_loops_1, init_object_sizes,
780 fini_object_sizes): Adjust.
782 2021-11-26 Siddhesh Poyarekar <siddhesh@gotplt.org>
784 * tree-object-size.c: New enum.
785 (object_sizes, computed, addr_object_size,
786 compute_builtin_object_size, expr_object_size, call_object_size,
787 merge_object_sizes, plus_stmt_object_size,
788 collect_object_sizes_for, init_object_sizes, fini_object_sizes,
789 object_sizes_execute): Replace magic numbers with enums.
791 2021-11-26 Roger Sayle <roger@nextmovesoftware.com>
793 * tree-ssa-loop-ivopts.c (cand_value_at): Take a class
794 tree_niter_desc* argument instead of just a tree for NITER.
795 If we require the iv candidate value at the end of the final
796 loop iteration, try using the original loop bound as the
797 NITER for sufficiently simple loops.
798 (may_eliminate_iv): Update (only) call to cand_value_at.
800 2021-11-26 Jan Hubicka <hubicka@ucw.cz>
802 * ipa-modref.c (analyze_function): Drop parameter F and dump
803 cgraph node name rather than cfun name.
804 (modref_generate): Update.
805 (modref_summaries::insert):Update.
806 (modref_summaries_lto::insert):Update.
807 (pass_modref::execute):Update.
808 (ipa_merge_modref_summary_after_inlining): Improve combining of
811 2021-11-26 Jan Hubicka <jh@suse.cz>
814 * ipa-modref.c (update_escape_summary_1): Fix handling of min_flags.
816 2021-11-26 konglin1 <lingling.kong@intel.com>
819 * config/i386/i386.c (ix86_can_change_mode_class): Allow 16 bit data in XMM register
821 * config/i386/i386.md (extendhfsf2): Add extenndhfsf2 for TARGET_F16C.
822 (extendhfdf2): Restrict extendhfdf for TARGET_AVX512FP16 only.
823 (*extendhf<mode>2): Rename from extendhf<mode>2.
824 (truncsfhf2): Likewise.
825 (truncdfhf2): Likewise.
826 (*trunc<mode>2): Likewise.
828 2021-11-26 liuhongt <hongtao.liu@intel.com>
831 * match.pd: Fix typo, use the type of second parameter, not
834 2021-11-25 Jan Hubicka <jh@suse.cz>
836 * ipa-cp.c (ipa_value_range_from_jfunc): Remove forgotten early return.
838 2021-11-25 Roger Sayle <roger@nextmovesoftware.com>
841 * match.pd (minus @0 @0): Check tree_expr_maybe_infinite_p.
843 2021-11-25 Martin Jambor <mjambor@suse.cz>
846 * ipa-prop.h (ipa_get_param): New overload. Move bits of the existing
848 * ipa-param-manipulation.h (ipa_param_adjustments): New member
849 function get_updated_index_or_split.
850 * ipa-param-manipulation.c
851 (ipa_param_adjustments::get_updated_index_or_split): New function.
852 * ipa-prop.c (adjust_agg_replacement_values): Reimplement, add
853 capability to identify scalarized parameters and perform substitution
855 (ipcp_transform_function): Create descriptors earlier, handle new
856 return values of adjust_agg_replacement_values.
858 2021-11-25 Aldy Hernandez <aldyh@redhat.com>
860 * gimple-range-path.cc (path_range_query::compute_ranges_defined): Remove.
861 (path_range_query::compute_ranges_in_block): Revert to bitmap order.
862 * gimple-range-path.h: Remove compute_ranges_defined.
864 2021-11-25 Andrew Stubbs <ams@codesourcery.com>
867 * config/gcn/gcn.c (move_callee_saved_registers): Ensure that the
868 number of spilled registers is counted correctly.
870 2021-11-25 Jan Hubicka <hubicka@ucw.cz>
872 * ipa-prop.h (ipa_node_params::ipa_node_params): Initialize
875 2021-11-25 Andrew MacLeod <amacleod@redhat.com>
877 PR tree-optimization/103359
878 * gimple-range-fold.cc (fold_using_range::range_of_phi): If arg is
879 equivalent to def, don't initially include it's range.
881 2021-11-25 Jan Hubicka <hubicka@ucw.cz>
883 * tree-ssa-alias.c (ref_maybe_used_by_call_p_1): Do not check
886 2021-11-25 Richard Biener <rguenther@suse.de>
888 * cgraphunit.c (symbol_table::output_weakrefs): Remove
890 (get_alias_symbol): Remove now unused function.
892 2021-11-25 Richard Biener <rguenther@suse.de>
894 * cfgrtl.c (rtl_verify_fallthru): Do not stop verifying
896 (skip_insns_after_block): Remove unreachable break and continue.
898 2021-11-25 Richard Biener <rguenther@suse.de>
900 * cfgexpand.c (label_rtx_for_bb): Remove dead loop construct.
902 2021-11-25 Richard Biener <rguenther@suse.de>
904 * regset.h (REG_SET_EMPTY_P): New macro.
905 * cfgcleanup.c (thread_jump): Use REG_SET_EMPTY_P.
907 2021-11-25 Martin Liska <mliska@suse.cz>
909 * doc/invoke.texi: Use @option for -Wuninitialized.
911 2021-11-25 Aldy Hernandez <aldyh@redhat.com>
913 PR tree-optimization/103254
914 * gimple-range-path.cc (path_range_query::compute_ranges): Move
915 exported boolean code...
916 (path_range_query::compute_imports): ...here.
918 2021-11-25 Aldy Hernandez <aldyh@redhat.com>
920 PR tree-optimization/103254
921 * gimple-range-path.cc (path_range_query::compute_ranges_defined): New
922 (path_range_query::compute_ranges_in_block): Move to
923 compute_ranges_defined.
924 * gimple-range-path.h (compute_ranges_defined): New.
926 2021-11-25 Jakub Jelinek <jakub@redhat.com>
928 PR tree-optimization/103417
929 * match.pd ((X & Y) CMP 0): Only handle eq and ne. Commonalize
932 2021-11-25 Jakub Jelinek <jakub@redhat.com>
934 PR tree-optimization/103376
935 * gimple-ssa-store-merging.c (perform_symbolic_merge): For
936 BIT_IOR_EXPR, if masked1 && masked2 && masked1 != masked2, don't
937 punt, but set the corresponding result byte to MARKER_BYTE_UNKNOWN.
938 For BIT_XOR_EXPR similarly and if masked1 == masked2 and the
939 byte isn't MARKER_BYTE_UNKNOWN, set the corresponding result byte to
942 2021-11-25 Jakub Jelinek <jakub@redhat.com>
945 * doc/invoke.texi (-Wcomma-subscript): Document that for
946 -std=c++20 the option isn't enabled by default with -Wno-deprecated
947 but for -std=c++23 it is.
949 2021-11-24 Jan Hubicka <hubicka@ucw.cz>
951 * ipa-modref.c (implicit_eaf_flags_for_edge_and_arg): Break out from...
952 (modref_merge_call_site_flags): ... here.
953 (ipa_merge_modref_summary_after_inlining): Use it.
955 2021-11-24 Thomas Schwinge <thomas@codesourcery.com>
957 * cfgloop.c (verify_loop_structure): Reduce scope of
958 'class loop *loop' variable.
959 * ipa-fnsummary.c (analyze_function_body): Likewise.
960 * loop-init.c (fix_loop_structure): Likewise.
961 * loop-invariant.c (calculate_loop_reg_pressure): Likewise.
962 * predict.c (predict_loops): Likewise.
963 * tree-loop-distribution.c (loop_distribution::execute): Likewise.
964 * tree-vectorizer.c (pass_vectorize::execute): Likewise.
966 2021-11-24 Andrew MacLeod <amacleod@redhat.com>
968 PR tree-optimization/103231
969 * gimple-range.cc (gimple_ranger::gimple_ranger): Create stmt stack.
970 (gimple_ranger::gimple_ranger): Delete stmt stack.
971 (gimple_ranger::range_of_stmt): Process depenedencies if they have no
973 (gimple_ranger::prefill_name): New.
974 (gimple_ranger::prefill_stmt_dependencies): New.
975 * gimple-range.h (class gimple_ranger): Add prototypes.
977 2021-11-24 Andrew MacLeod <amacleod@redhat.com>
979 * gimple-range-cache.cc (ranger_cache::get_global_range): Always
980 return a range, return if it came from the cache or not.
981 (get_non_stale_global_range): Rename to get_global_range, and return
982 the temporal state in a flag.
983 * gimple-range-cache.h (get_non_stale_global_range): Rename and adjust.
984 * gimple-range.cc (gimple_ranger::range_of_expr): No need to query
986 (gimple_ranger::range_of_stmt): Adjust for global cache temporal state
989 2021-11-24 Andrew MacLeod <amacleod@redhat.com>
991 * gimple-range.cc (gimple_ranger::range_on_edge): Call trailer when
992 a constant is encountered to terminate the trace.
994 2021-11-24 Maciej W. Rozycki <macro@embecosm.com>
997 * reload.c (find_reloads_address_1): Also accept the ASHIFT form
998 of indexed addressing.
999 (find_reloads): Adjust accordingly.
1001 2021-11-24 Richard Biener <rguenther@suse.de>
1002 Jan Hubicka <jh@suse.cz>
1004 PR tree-optimization/103168
1005 * ipa-modref.h (struct modref_summary): Add load_accesses.
1006 * ipa-modref.c (modref_summary::finalize): Initialize load_accesses.
1007 * tree-ssa-sccvn.c (visit_reference_op_call): Use modref
1008 info to walk the virtual use->def chain to CSE const/pure
1009 function calls possibly reading from memory.
1011 2021-11-24 Thomas Schwinge <thomas@codesourcery.com>
1013 PR middle-end/103244
1014 * gimplify.c (gimplify_adjust_omp_clauses): Restore previous
1017 2021-11-24 Richard Biener <rguenther@suse.de>
1019 PR middle-end/103193
1020 * match.pd: Avoid canonicalizing (le/ge @0 @0) to (eq @0 @0)
1021 with NaNs and -ftrapping-math.
1023 2021-11-24 Jakub Jelinek <jakub@redhat.com>
1025 PR middle-end/103384
1026 * omp-general.c (omp_context_selector_matches): For ACCEL_COMPILER,
1027 return 0 for kind(host) and continue for kind(nohost).
1029 2021-11-24 Jakub Jelinek <jakub@redhat.com>
1031 PR middle-end/103365
1032 * attribs.h (lookup_attribute): Allow attr_name to start with
1033 underscore, as long as canonicalize_attr_name returns false.
1034 (lookup_attribute_by_prefix): Don't call get_attribute_name twice.
1035 * attribs.c (extract_attribute_substring): Reimplement using
1036 canonicalize_attr_name.
1037 (register_scoped_attribute): Change gcc_assert into
1038 gcc_checking_assert, verify !canonicalize_attr_name rather than
1039 that str.str doesn't start with '_'.
1041 2021-11-24 Jakub Jelinek <jakub@redhat.com>
1043 PR tree-optimization/103376
1044 * gimple-ssa-store-merging.c (perform_symbolic_merge): Add CODE
1045 argument. If CODE is not BIT_IOR_EXPR, ensure that one of masked1
1047 (find_bswap_or_nop_1, find_bswap_or_nop,
1048 imm_store_chain_info::try_coalesce_bswap): Adjust
1049 perform_symbolic_merge callers.
1051 2021-11-24 Richard Biener <rguenther@suse.de>
1053 * tree-ssa-loop-ivopts.c (find_givs): Take loop body as
1054 argument instead of re-computing it.
1055 (find_interesting_uses): Likewise.
1056 (find_induction_variables): Pass through loop body.
1057 (tree_ssa_iv_optimize_loop): Pass down loop body.
1059 2021-11-24 Tamar Christina <tamar.christina@arm.com>
1061 * tree-ssa-phiopt.c (spaceship_replacement): Handle new canonical
1064 2021-11-24 Tamar Christina <tamar.christina@arm.com>
1066 * tree.c (bitmask_inv_cst_vector_p): New.
1067 * tree.h (bitmask_inv_cst_vector_p): New.
1068 * match.pd: Use it in new bitmask compare pattern.
1070 2021-11-24 Jason Merrill <jason@redhat.com>
1072 * timevar.h (class auto_cond_timevar): New.
1074 2021-11-24 Hongtao Liu <hongtao.liu@intel.com>
1075 H.J. Lu <hongjiu.lu@intel.com>
1077 PR tree-optimization/103194
1078 * match.pd (gimple_nop_atomic_bit_test_and_p): Extended to
1080 * tree-ssa-ccp.c (gimple_nop_convert): Declare.
1081 (optimize_atomic_bit_test_and): Enhance
1082 optimize_atomic_bit_test_and to handle truncation.
1084 2021-11-23 Martin Sebor <msebor@redhat.com>
1087 * Makefile.in (OBJS): Add gimple-warn-recursion.o.
1088 * common.opt: Add -Winfinite-recursion.
1089 * doc/invoke.texi (-Winfinite-recursion): Document.
1090 * passes.def (pass_warn_recursion): Schedule a new pass.
1091 * tree-pass.h (make_pass_warn_recursion): Declare.
1092 * gimple-warn-recursion.c: New file.
1094 2021-11-23 Bill Schmidt <wschmidt@linux.ibm.com>
1096 * config/rs6000/rs6000-call.c (rs6000_invalid_new_builtin): Change
1097 error messages for ENB_P8V and ENB_P9V.
1099 2021-11-23 Bill Schmidt <wschmidt@linux.ibm.com>
1101 * config/rs6000/rs6000-builtin-new.def: Add power6-64 stanza. Move
1102 CMPB to power6-64 stanza.
1103 * config/rs6000/rs6000-call.c (rs6000_invalid_new_builtin): Handle
1105 (rs6000_new_builtin_is_supported): Likewise.
1106 (rs6000_expand_new_builtin): Likewise. Clean up formatting.
1107 (rs6000_init_builtins): Handle ENB_P6_64 case.
1108 * config/rs6000/rs6000-gen-builtins.c (bif_stanza): Add BSTZ_P6_64.
1109 (stanza_map): Add entry mapping power6-64 to BSTZ_P6_64.
1110 (enable_string): Add "ENB_P6_64".
1111 (write_decls): Add ENB_P6_64 to bif_enable enum.
1113 2021-11-23 Jan Hubicka <jh@suse.cz>
1115 * ipa-modref-tree.h (struct modref_tree): Remove max_bases, max_refs
1117 (modref_tree::modref_tree): Remove parametr.
1118 (modref_tree::insert_base): Add max_bases parameter.
1119 (modref_tree::insert): Add max_bases, max_refs, max_accesses
1121 (modref_tree::insert): New member function.
1122 (modref_tree::merge): Add max_bases, max_refs, max_accesses
1124 (modref_tree::insert): New member function.
1125 * ipa-modref-tree.c (test_insert_search_collapse): Update.
1126 (test_merge): Update.
1127 * ipa-modref.c (dump_records): Don't dump max_refs and max_bases.
1128 (dump_lto_records): Likewise.
1129 (modref_summary::finalize): Fix whitespace.
1130 (get_modref_function_summary): Likewise.
1131 (modref_access_analysis::record_access): Update.
1132 (modref_access_analysis::record_access_lto): Update.
1133 (modref_access_analysis::process_fnspec): Update.
1134 (analyze_function): Update.
1135 (modref_summaries::duplicate): Update.
1136 (modref_summaries_lto::duplicate): Update.
1137 (write_modref_records): Update.
1138 (read_modref_records): Update.
1139 (read_section): Update.
1140 (propagate_unknown_call): Update.
1141 (modref_propagate_in_scc): Update.
1142 (ipa_merge_modref_summary_after_inlining): Update.
1144 2021-11-23 Martin Liska <mliska@suse.cz>
1146 * doc/invoke.texi: Remove 2 more duplicite param descriptions.
1148 2021-11-23 Richard Biener <rguenther@suse.de>
1150 PR tree-optimization/103361
1151 * gimple-loop-jam.c (adjust_unroll_factor): Use lambda_int
1152 for the dependence distance.
1153 * tree-data-ref.c (print_lambda_vector): Properly print a lambda_int.
1155 2021-11-23 Jakub Jelinek <jakub@redhat.com>
1157 * tree-inline.h (struct copy_body_data): Remove
1158 transform_lang_insert_block member.
1159 * tree-inline.c (remap_block): Don't call
1160 id->transform_lang_insert_block.
1161 (optimize_inline_calls, copy_gimple_seq_and_replace_locals,
1162 tree_function_versioning, maybe_inline_call_in_expr,
1163 copy_fn): Don't initialize id.transform_lang_insert_block.
1164 * gimplify.c (gimplify_omp_loop): Likewise.
1166 2021-11-23 Jan Hubicka <hubicka@ucw.cz>
1168 PR tree-optimization/103335
1169 * tree-ssa-dse.c (valid_ao_ref_for_dse): Rename to ...
1170 (valid_ao_ref_kill_for_dse): ... this; do not check that boundaries
1171 are divisible by BITS_PER_UNIT.
1172 (get_byte_aligned_range_containing_ref): New function.
1173 (get_byte_aligned_range_contained_in_ref): New function.
1174 (normalize_ref): Rename to ...
1175 (get_byte_range): ... this one; handle accesses not aligned to byte
1176 boundary; return range in bytes rater than updating ao_ref.
1177 (clear_live_bytes_for_ref): Take write ref by reference; simplify using
1179 (setup_live_bytes_from_ref): Likewise.
1180 (clear_bytes_written_by): Update.
1181 (live_bytes_read): Update.
1182 (dse_classify_store): Simplify tech before live_bytes_read checks.
1184 2021-11-23 Andrew Pinski <apinski@marvell.com>
1186 PR tree-optimization/102216
1187 * gimple-fold.c (fold_stmt_1): Add canonicalization
1188 of "&MEM[ssa_n, CST]" to "ssa_n p+ CST", note this
1189 can only be done if !in_place.
1191 2021-11-23 Jakub Jelinek <jakub@redhat.com>
1193 PR middle-end/102431
1194 * gimplify.c (replace_reduction_placeholders): Remove.
1195 (note_no_context_vars): New function.
1196 (gimplify_omp_loop): For OMP_PARALLEL's BIND_EXPR create a new
1197 BLOCK. Use copy_tree_body_r with walk_tree instead of unshare_expr
1198 and replace_reduction_placeholders for duplication of
1199 OMP_CLAUSE_REDUCTION_{INIT,MERGE} expressions. Ensure all mentioned
1200 automatic vars have DECL_CONTEXT set to non-NULL before doing so
1201 and reset it afterwards for those vars and their corresponding
1204 2021-11-23 Haochen Gui <guihaoc@gcc.gnu.org>
1207 * config/rs6000/altivec.md (altivec_vreve<mode>2 for VEC_K): Use
1208 xxbrq for v16qi, xxbrq + xxbrh for v8hi and xxbrq + xxbrw for v4si
1209 or v4sf when p9_vector is set.
1210 (altivec_vreve<mode>2 for VEC_64): Defined. Implemented by xxswapd.
1212 2021-11-23 Navid Rahimi <navidrahimi@microsoft.com>
1214 PR tree-optimization/102232
1215 * match.pd (x * (1 + y / x) - y) -> (x - y % x): New optimization.
1217 2021-11-23 Navid Rahimi <navidrahimi@microsoft.com>
1219 PR tree-optimization/96779
1220 * match.pd (-x == x) -> (x == 0): New optimization.
1222 2021-11-22 Roger Sayle <roger@nextmovesoftware.com>
1224 PR tree-optimization/98953
1225 PR tree-optimization/103345
1226 * gimple-ssa-store-merging.c (find_bswap_or_nop_1): Handle
1227 BIT_XOR_EXPR and PLUS_EXPR the same as BIT_IOR_EXPR.
1228 (pass_optimize_bswap::execute): Likewise.
1230 2021-11-22 Martin Liska <mliska@suse.cz>
1232 * doc/invoke.texi: Remove duplicate documentation for 3 params.
1234 2021-11-22 Richard Biener <rguenther@suse.de>
1236 PR tree-optimization/103351
1237 * tree-ssa-dce.c (sort_phi_args): Sort after e->dest_idx as
1240 2021-11-22 liuhongt <hongtao.liu@intel.com>
1243 * config/i386/constraints.md (Bk): New
1244 define_memory_constraint.
1245 * config/i386/i386-protos.h (ix86_gpr_tls_address_pattern_p):
1247 * config/i386/i386.c (ix86_gpr_tls_address_pattern_p): New
1249 * config/i386/i386.md (*movsi_internal): Don't allow
1250 mask/sse/mmx move in TLS code sequences.
1251 (*movdi_internal): Ditto.
1253 2021-11-22 Kewen Lin <linkw@linux.ibm.com>
1255 * config/xtensa/xtensa.md (movdi_internal, movdf_internal): Fix split
1258 2021-11-21 Jakub Jelinek <jakub@redhat.com>
1261 * config/i386/i386-options.c (ix86_valid_target_attribute_p): If
1262 fndecl already has DECL_FUNCTION_SPECIFIC_TARGET, use that as base
1263 instead of target_option_default_node.
1265 2021-11-21 Jan Hubicka <hubicka@ucw.cz>
1268 * ipa-modref.c (parm_map_for_arg): Rename to ...
1269 (parm_map_for_ptr): .. this one; handle static chain and calls to
1271 (modref_access_analysis::get_access): Use parm_map_for_ptr.
1272 (modref_access_analysis::process_fnspec): Update.
1273 (modref_access_analysis::analyze_load): Update.
1274 (modref_access_analysis::analyze_store): Update.
1276 2021-11-21 Jan Hubicka <hubicka@ucw.cz>
1278 * ipa-modref.c (ignore_nondeterminism_p): Move earlier in source
1280 (ignore_retval_p): Likewise.
1281 (ignore_stores_p): Likewise.
1282 (parm_map_for_arg): Likewise.
1283 (class modref_access_analysis): New class.
1284 (modref_access_analysis::set_side_effects): New member function.
1285 (modref_access_analysis::set_nondeterministic): New member function.
1286 (get_access): Turn to ...
1287 (modref_access_analysis::get_access): ... this one.
1288 (record_access): Turn to ...
1289 (modref_access_analysis::record_access): ... this one.
1290 (record_access_lto): Turn to ...
1291 (modref_access_analysis::record_access_lto): ... This one.
1292 (record_access_p): Turn to ...
1293 (modref_access_analysis::record_access_p): ... This one
1294 (modref_access_analysis::record_unknown_load): New member function.
1295 (modref_access_analysis::record_unknown_store): New member function.
1296 (get_access_for_fnspec): Turn to ...
1297 (modref_access_analysis::get_access_for_fnspec): ... this one.
1298 (merge_call_side_effects): Turn to ...
1299 (moderf_access_analysis::merge_call_side_effects): Turn to ...
1300 (collapse_loads): Move later in source code.
1301 (collapse_stores): Move later in source code.
1302 (process_fnspec): Turn to ...
1303 (modref_access_analysis::process_fnspec): ... this one.
1304 (analyze_call): Turn to ...
1305 (modref_access_analysis::analyze_call): ... this one.
1306 (struct summary_ptrs): Remove.
1307 (analyze_load): Turn to ...
1308 (modref_access_analysis::analyze_load): ... this one.
1309 (analyze_store): Turn to ...
1310 (modref_access_analysis::analyze_store): ... this one.
1311 (analyze_stmt): Turn to ...
1312 (modref_access_analysis::analyze_stmt): ... This one.
1313 (remove_summary): Remove.
1314 (modref_access_analysis::propagate): Break out from ...
1315 (modref_access_analysis::analyze): Break out from ...
1316 (analyze_function): ... here.
1318 2021-11-21 Roger Sayle <roger@nextmovesoftware.com>
1319 Robin Dapp <rdapp@linux.ibm.com>
1322 * tree-ssa-math-opts.c (convert_mult_to_widen): Recognize
1323 signed WIDEN_MULT_EXPR if the target supports umul_widen_optab.
1325 2021-11-20 Jan Hubicka <hubicka@ucw.cz>
1328 * ipa-modref.c (ignore_nondeterminism_p): Allow looping pure/cont.
1329 (merge_call_side_effects): Improve debug output.
1331 2021-11-20 Jan Hubicka <hubicka@ucw.cz>
1334 * ipa-pure-const.c (propagate_pure_const): Fix merging of loping flag.
1336 2021-11-20 Jeff Law <jeffreyalaw@gmail.com>
1338 PR tree-optimization/103226
1339 * config/bfin/bfin.md (doloop pattern, splitter and expander): Clobber
1342 2021-11-20 Andrew Pinski <apinski@marvell.com>
1344 PR tree-optimization/103220
1345 * match.pd ((type) X bitop CST): Don't check if CST
1346 fits into the type if only the sign changes.
1348 2021-11-20 Alexandre Oliva <oliva@adacore.com>
1350 PR tree-optimization/102988
1351 * gimple-harden-conditionals.cc (detach_value): Copy SSA_NAME
1352 without decl sharing.
1354 2021-11-19 Iain Sandoe <iain@sandoe.co.uk>
1356 * doc/invoke.texi: Remove whitespace after an @option.
1358 2021-11-19 Paul A. Clarke <pc@us.ibm.com>
1360 * config/rs6000/emmintrin.h (_mm_sad_epu8): Use vec_absd when
1361 _ARCH_PWR9, optimize vec_sum2s when LE.
1363 2021-11-19 Iain Sandoe <iain@sandoe.co.uk>
1366 * config/darwin-driver.c (darwin_driver_init): Handle exported
1367 symbols and symbol lists (suppress automatic export of the TLS
1369 * config/darwin.c (darwin_rename_builtins): Remove workaround.
1370 * config/darwin.h (LINK_GCC_C_SEQUENCE_SPEC): Likewise.
1371 (REAL_LIBGCC_SPEC): Handle revised library uses.
1372 * config/darwin.opt (nodefaultexport): New.
1373 * config/i386/darwin.h (PR80556_WORKAROUND): Remove.
1374 * config/i386/darwin32-biarch.h (PR80556_WORKAROUND): Likewise.
1375 * config/i386/darwin64-biarch.h (PR80556_WORKAROUND): Likewise.
1377 2021-11-19 Martin Jambor <mjambor@suse.cz>
1379 * opts.c (default_options_table): Switch off
1380 flag_semantic_interposition at Ofast.
1381 * doc/invoke.texi (Optimize Options): Document that Ofast switches off
1382 -fsemantic-interposition.
1384 2021-11-19 Jan Hubicka <hubicka@ucw.cz>
1386 * tree-ssa-alias.c (ref_maybe_used_by_call_p_1): Do not guard modref
1387 by !gimple_call_chain.
1389 2021-11-19 Martin Sebor <msebor@redhat.com>
1393 * doc/invoke.texi (-Waddress): Update.
1395 2021-11-19 Andrew MacLeod <amacleod@redhat.com>
1397 PR tree-optimization/103254
1398 * gimple-range-gori.cc (range_def_chain::get_def_chain): Limit the
1399 depth for all statements with multple ssa names.
1401 2021-11-19 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
1403 * config/s390/s390.md (define_peephole2): Variable insn points
1404 to the first matched insn. Use peep2_next_insn(1) to refer to
1405 the second matched insn.
1407 2021-11-19 Tamar Christina <tamar.christina@arm.com>
1409 PR tree-optimization/103311
1411 * tree-vect-slp-patterns.c (vect_validate_multiplication): Fix CONJ
1412 test to new codegen.
1413 (complex_mul_pattern::matches): Move check downwards.
1415 2021-11-19 Martin Liska <mliska@suse.cz>
1418 2021-11-19 Martin Liska <mliska@suse.cz>
1420 * cfgexpand.c (pass_expand::execute): Use option directly.
1421 * function.c (allocate_struct_function): Likewise.
1422 * gimple-low.c (lower_function_body): Likewise.
1423 (lower_stmt): Likewise.
1424 * gimple-ssa-backprop.c (backprop::prepare_change): Likewise.
1425 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call): Likewise.
1426 * ipa-split.c (split_function): Likewise.
1427 * lto-streamer-in.c (input_function): Likewise.
1428 * sese.c (sese_insert_phis_for_liveouts): Likewise.
1429 * ssa-iterators.h (num_imm_uses): Likewise.
1430 * tree-cfg.c (make_blocks): Likewise.
1431 (gimple_merge_blocks): Likewise.
1432 * tree-inline.c (tree_function_versioning): Likewise.
1433 * tree-loop-distribution.c (generate_loops_for_partition): Likewise.
1434 * tree-sra.c (analyze_access_subtree): Likewise.
1435 * tree-ssa-dce.c (remove_dead_stmt): Likewise.
1436 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
1437 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
1438 * tree-ssa-reassoc.c (reassoc_remove_stmt): Likewise.
1439 * tree-ssa-tail-merge.c (tail_merge_optimize): Likewise.
1440 * tree-ssa-threadedge.c (propagate_threaded_block_debug_into): Likewise.
1441 * tree-ssa.c (gimple_replace_ssa_lhs): Likewise.
1442 (target_for_debug_bind): Likewise.
1443 (insert_debug_temp_for_var_def): Likewise.
1444 (insert_debug_temps_for_defs): Likewise.
1445 (reset_debug_uses): Likewise.
1446 * tree-ssanames.c (release_ssa_name_fn): Likewise.
1447 * tree-vect-loop-manip.c (adjust_vec_debug_stmts): Likewise.
1448 (adjust_debug_stmts): Likewise.
1449 (adjust_phi_and_debug_stmts): Likewise.
1450 (vect_do_peeling): Likewise.
1451 * tree-vect-loop.c (vect_transform_loop_stmt): Likewise.
1452 (vect_transform_loop): Likewise.
1453 * tree.h (MAY_HAVE_DEBUG_MARKER_STMTS): Remove
1454 (MAY_HAVE_DEBUG_BIND_STMTS): Remove.
1455 (MAY_HAVE_DEBUG_STMTS): Use options directly.
1457 2021-11-19 Giuliano Belinassi <gbelinassi@suse.de>
1459 * gcc.c (process_command): Skip dumpdir override if file is a
1461 * doc/invoke.texi: Update -dumpdir documentation.
1463 2021-11-19 Andrew Pinski <apinski@marvell.com>
1465 PR tree-optimization/103314
1466 * match.pd ((type) X op CST): Restrict the equal
1467 TYPE_PRECISION case to GIMPLE only.
1469 2021-11-19 Martin Liska <mliska@suse.cz>
1472 * ipa-modref-tree.h (struct modref_parm_map): Add default
1474 * ipa-modref.c (ipa_merge_modref_summary_after_inlining): Use it.
1476 2021-11-19 Richard Biener <rguenther@suse.de>
1478 PR middle-end/103248
1479 * tree-eh.c (operation_could_trap_helper_p): Properly handle
1480 fixed-point RDIV_EXPR.
1482 2021-11-19 Richard Biener <rguenther@suse.de>
1484 PR tree-optimization/102436
1485 * tree-ssa-loop-im.c (execute_sm_if_changed): Add mode
1486 to just create the if structure and return the then block.
1487 (execute_sm): Add flag to indicate the var will re-use
1489 (hoist_memory_references): Support a single conditional
1490 block with all stores as special case.
1492 2021-11-19 Andrew Pinski <apinski@marvell.com>
1494 PR tree-optimization/103317
1495 * tree-ssa-phiopt.c (minmax_replacement): For the non empty
1496 middle bb case, check to make sure it has a single predecessor.
1498 2021-11-19 Andrew Pinski <apinski@marvell.com>
1500 PR tree-optimization/103257
1502 ((m1 >/</>=/<= m2) * d -> (m1 >/</>=/<= m2) ? d : 0):
1503 Disable until !canonicalize_math_p ().
1505 2021-11-19 Marek Polacek <polacek@redhat.com>
1509 * doc/invoke.texi: Update documentation for -Wuninitialized.
1510 * tree.c (stabilize_reference): Set location.
1512 2021-11-19 liuhongt <hongtao.liu@intel.com>
1515 * config/i386/x86-tune-costs.h (skylake_cost): Reduce cost of
1516 storing 256/512-bit SSE register to be equal to cost of
1517 unaligned store to avoid odd alignment peeling.
1518 (icelake_cost): Ditto.
1520 2021-11-18 David Edelsohn <dje.gcc@gmail.com>
1522 * config/rs6000/predicates.md (current_file_function_operand):
1523 Add flag_semantic_interposition to call of decl_replaceable_p.
1525 2021-11-18 Martin Liska <mliska@suse.cz>
1527 * ipa-modref.c (analyze_function): Do not execute the code
1528 only if dump_file != NULL.
1530 2021-11-18 Martin Liska <mliska@suse.cz>
1532 * ipa-modref.c (analyze_function): Use fnode instead of repeated
1533 cgraph_node::get (current_function_decl).
1535 2021-11-18 Jan Hubicka <hubicka@ucw.cz>
1537 * cgraph.c (cgraph_node::get_availability): Update call of
1539 (cgraph_node::verify_node): Verify that semantic_interposition flag
1541 * cgraph.h: (symtab_node): Add semantic_interposition flag.
1542 * cgraphclones.c (set_new_clone_decl_and_node_flags): Clear
1543 semantic_interposition flag.
1544 * cgraphunit.c (cgraph_node::finalize_function): Set
1545 semantic_interposition flag.
1546 (cgraph_node::add_new_function): Likewise.
1547 (varpool_node::finalize_decl): Likewise.
1548 (cgraph_node::create_wrapper): Likewise.
1549 * common.opt (fsemantic-interposition): Turn to optimization node.
1550 * lto-cgraph.c (lto_output_node): Stream semantic_interposition.
1551 (lto_output_varpool_node): Likewise.
1552 (input_overwrite_node): Likewise.
1553 (input_varpool_node): Likewise.
1554 * symtab.c (symtab_node::dump_base): Dump new flag.
1555 * varasm.c (decl_replaceable_p): Add semantic_interposition_p
1557 * varasm.h (decl_replaceable_p): Update declaration.
1558 * varpool.c (varpool_node::ctor_useable_for_folding_p):
1559 Use semantic_interposition flag.
1560 (varpool_node::get_availability): Likewise.
1561 (varpool_node::create_alias): Copy semantic_interposition flag.
1563 2021-11-18 Jan Hubicka <hubicka@ucw.cz>
1566 * ipa-modref.c (modref_eaf_analysis::merge_call_lhs_flags): Unused
1567 parameter may still be returned.
1568 (modref_eaf_analysis::analyze_ssa_name): Call merge_call_lhs_flags
1569 even for unused function args.
1571 2021-11-18 Tamar Christina <tamar.christina@arm.com>
1573 PR tree-optimization/103311
1574 * tree-vect-slp-patterns.c (complex_mul_pattern::matches,
1575 complex_fms_pattern::matches): Check for multiplications.
1577 2021-11-18 H.J. Lu <hjl.tools@gmail.com>
1580 * config/i386/i386.c (ix86_output_jmp_thunk_or_indirect): Emit
1581 CS prefix for -mindirect-branch-cs-prefix.
1582 (ix86_output_indirect_branch_via_reg): Likewise.
1583 * config/i386/i386.opt: Add -mindirect-branch-cs-prefix.
1584 * doc/invoke.texi: Document -mindirect-branch-cs-prefix.
1586 2021-11-18 David Edelsohn <dje.gcc@gmail.com>
1588 * config/rs6000/driver-rs6000.c (detect_processor_aix): Add
1591 2021-11-18 Tamar Christina <tamar.christina@arm.com>
1593 PR tree-optimization/103253
1594 * tree-ssa-math-opts.c (convert_mult_to_fma): Check for LHS.
1596 2021-11-18 Matthias Kretz <m.kretz@gsi.de>
1598 * doc/extend.texi: Document __builtin_assoc_barrier.
1600 2021-11-18 Martin Liska <mliska@suse.cz>
1602 * cfgexpand.c (pass_expand::execute): Use option directly.
1603 * function.c (allocate_struct_function): Likewise.
1604 * gimple-low.c (lower_function_body): Likewise.
1605 (lower_stmt): Likewise.
1606 * gimple-ssa-backprop.c (backprop::prepare_change): Likewise.
1607 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call): Likewise.
1608 * ipa-split.c (split_function): Likewise.
1609 * lto-streamer-in.c (input_function): Likewise.
1610 * sese.c (sese_insert_phis_for_liveouts): Likewise.
1611 * ssa-iterators.h (num_imm_uses): Likewise.
1612 * tree-cfg.c (make_blocks): Likewise.
1613 (gimple_merge_blocks): Likewise.
1614 * tree-inline.c (tree_function_versioning): Likewise.
1615 * tree-loop-distribution.c (generate_loops_for_partition): Likewise.
1616 * tree-sra.c (analyze_access_subtree): Likewise.
1617 * tree-ssa-dce.c (remove_dead_stmt): Likewise.
1618 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
1619 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
1620 * tree-ssa-reassoc.c (reassoc_remove_stmt): Likewise.
1621 * tree-ssa-tail-merge.c (tail_merge_optimize): Likewise.
1622 * tree-ssa-threadedge.c (propagate_threaded_block_debug_into): Likewise.
1623 * tree-ssa.c (gimple_replace_ssa_lhs): Likewise.
1624 (target_for_debug_bind): Likewise.
1625 (insert_debug_temp_for_var_def): Likewise.
1626 (insert_debug_temps_for_defs): Likewise.
1627 (reset_debug_uses): Likewise.
1628 * tree-ssanames.c (release_ssa_name_fn): Likewise.
1629 * tree-vect-loop-manip.c (adjust_vec_debug_stmts): Likewise.
1630 (adjust_debug_stmts): Likewise.
1631 (adjust_phi_and_debug_stmts): Likewise.
1632 (vect_do_peeling): Likewise.
1633 * tree-vect-loop.c (vect_transform_loop_stmt): Likewise.
1634 (vect_transform_loop): Likewise.
1635 * tree.h (MAY_HAVE_DEBUG_MARKER_STMTS): Remove
1636 (MAY_HAVE_DEBUG_BIND_STMTS): Remove.
1637 (MAY_HAVE_DEBUG_STMTS): Use options directly.
1639 2021-11-18 Richard Biener <rguenther@suse.de>
1641 PR tree-optimization/103277
1642 * tree-ssa-dse.c (need_ab_cleanup): New.
1643 (dse_optimize_redundant_stores): Adjust.
1644 (delete_dead_or_redundant_assignment): Get extra
1645 need_ab_cleanup argument and set when abnormal cleanup is
1647 (dse_optimize_call): Adjust.
1648 (dse_optimize_stmt): Likewise.
1649 (pass_dse::execute): Allocate and deallocate need_ab_cleanup.
1650 Perform abnormal cleanup.
1651 * tree-ssa-dse.h (delete_dead_or_redundant_assignment): Adjust.
1653 2021-11-18 Hongyu Wang <hongyu.wang@intel.com>
1655 * config/i386/i386-expand.c (ix86_expand_atomic_fetch_op_loop):
1656 Adjust generated cfg to avoid infinite loop.
1658 2021-11-18 konglin1 <lingling.kong@intel.com>
1660 * config/i386/avx512fp16intrin.h (_mm512_mul_pch): Add alias for _mm512_fmul_pch.
1661 (_mm512_mask_mul_pch): Likewise.
1662 (_mm512_maskz_mul_pch): Likewise.
1663 (_mm512_mul_round_pch): Likewise.
1664 (_mm512_mask_mul_round_pch): Likewise.
1665 (_mm512_maskz_mul_round_pch): Likewise.
1666 (_mm512_cmul_pch): Likewise.
1667 (_mm512_mask_cmul_pch): Likewise.
1668 (_mm512_maskz_cmul_pch): Likewise.
1669 (_mm512_cmul_round_pch): Likewise.
1670 (_mm512_mask_cmul_round_pch): Likewise.
1671 (_mm512_maskz_cmul_round_pch): Likewise.
1672 (_mm_mul_sch): Likewise.
1673 (_mm_mask_mul_sch): Likewise.
1674 (_mm_maskz_mul_sch): Likewise.
1675 (_mm_mul_round_sch): Likewise.
1676 (_mm_mask_mul_round_sch): Likewise.
1677 (_mm_maskz_mul_round_sch): Likewise.
1678 (_mm_cmul_sch): Likewise.
1679 (_mm_mask_cmul_sch): Likewise.
1680 (_mm_maskz_cmul_sch): Likewise.
1681 (_mm_cmul_round_sch): Likewise.
1682 (_mm_mask_cmul_round_sch): Likewise.
1683 (_mm_maskz_cmul_round_sch): Likewise.
1684 * config/i386/avx512fp16vlintrin.h (_mm_mul_pch): Likewise.
1685 (_mm_mask_mul_pch): Likewise.
1686 (_mm_maskz_mul_pch): Likewise.
1687 (_mm256_mul_pch): Likewise.
1688 (_mm256_mask_mul_pch): Likewise.
1689 (_mm256_maskz_mul_pch): Likewise.
1690 (_mm_cmul_pch): Likewise.
1691 (_mm_mask_cmul_pch): Likewise.
1692 (_mm_maskz_cmul_pch): Likewise.
1693 (_mm256_cmul_pch): Likewise.
1694 (_mm256_mask_cmul_pch): Likewise.
1695 (_mm256_maskz_cmul_pch): Likewise.
1697 2021-11-17 Andrew Pinski <apinski@marvell.com>
1699 PR tree-optimization/103228
1700 PR tree-optimization/55177
1701 * match.pd ((type) X bitop CST): Also do this
1702 transformation for nop conversions.
1704 2021-11-17 Martin Sebor <msebor@redhat.com>
1706 PR tree-optimization/102759
1707 * gimple-array-bounds.cc (build_printable_array_type): Move...
1708 * gimple-ssa-warn-access.cc (build_printable_array_type): Avoid
1709 pathological function redeclarations that remove a previously
1711 Improve formatting of function arguments in informational notes.
1712 * pointer-query.cc (build_printable_array_type): ...to here.
1713 * pointer-query.h (build_printable_array_type): Declared.
1715 2021-11-17 H.J. Lu <hjl.tools@gmail.com>
1718 * config/i386/i386-opts.h (harden_sls): New enum.
1719 * config/i386/i386.c (output_indirect_thunk): Mitigate against
1720 SLS for function return.
1721 (ix86_output_function_return): Likewise.
1722 (ix86_output_jmp_thunk_or_indirect): Mitigate against indirect
1724 (ix86_output_indirect_jmp): Likewise.
1725 (ix86_output_call_insn): Likewise.
1726 * config/i386/i386.opt: Add -mharden-sls=.
1727 * doc/invoke.texi: Document -mharden-sls=.
1729 2021-11-17 H.J. Lu <hjl.tools@gmail.com>
1732 * config/i386/i386.c (ix86_code_end): Remove "%!" before ret.
1733 (ix86_output_function_return): Likewise.
1734 * config/i386/i386.md (simple_return_pop_internal): Likewise.
1736 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
1739 * ipa-modref.c (read_modref_records): Fix streaminig in of every_access
1742 2021-11-17 Uroš Bizjak <ubizjak@gmail.com>
1744 * config/i386/i386.c (indirect_thunks_used): Redefine as HARD_REG_SET.
1745 (ix86_code_end): Use TEST_HARD_REG_BIT on indirect_thunks_used.
1746 (ix86_output_indirect_branch_via_reg): Use SET_HARD_REG_BIT
1747 on indirect_thunks_used.
1748 (ix86_output_indirect_function_return): Ditto.
1750 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
1752 * ipa-modref-tree.c: Include cgraph.h and tree-streamer.h.
1753 (modref_access_node::stream_out): New member function.
1754 (modref_access_node::stream_in): New member function.
1755 * ipa-modref-tree.h (modref_access_node::stream_out,
1756 modref_access_node::stream_in): Declare.
1757 * ipa-modref.c (modref_summary_lto::useful_p): Free useless kills.
1758 (modref_summary_lto::dump): Dump kills.
1759 (analyze_store): Record kills for LTO
1760 (analyze_stmt): Likewise.
1761 (modref_summaries_lto::duplicate): Duplicate kills.
1762 (write_modref_records): Use new stream_out member function.
1763 (read_modref_records): Likewise.
1764 (modref_write): Stream out kills.
1765 (read_section): Stream in kills
1766 (remap_kills): New function.
1767 (update_signature): Use it.
1769 2021-11-17 Uroš Bizjak <ubizjak@gmail.com>
1771 * config/i386/i386.h (LEGACY_SSE_REGNO_P): New predicate.
1772 (SSE_REGNO_P): Use LEGACY_SSE_REGNO_P predicate.
1773 * config/i386/i386.c (zero_all_vector_registers):
1774 Use LEGACY_SSE_REGNO_P predicate.
1775 (ix86_register_priority): Use REX_INT_REGNO_P, REX_SSE_REGNO_P
1776 and EXT_REG_SSE_REGNO_P predicates.
1777 (ix86_hard_regno_call_part_clobbered): Use REX_SSE_REGNO_P
1778 and LEGACY_SSE_REGNO_P predicates.
1780 2021-11-17 Jason Merrill <jason@redhat.com>
1782 * doc/invoke.texi (C++ Dialect Options): Document
1783 -fimplicit-constexpr.
1785 2021-11-17 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
1787 * config/aarch64/aarch64-modes.def (VECTOR_MODE): New V8DI mode.
1788 * config/aarch64/aarch64.c (aarch64_hard_regno_mode_ok): Handle
1790 * config/aarch64/iterators.md (define_mode_attr nunits): Add entry
1793 2021-11-17 Martin Uecker <uecker@gcc.gnu.org>
1797 * gimplify.c (gimplify_var_or_parm_decl): Update comment.
1798 (gimplify_compound_lval): Gimplify base expression first.
1799 (gimplify_target_expr): Add comment.
1801 2021-11-17 Jakub Jelinek <jakub@redhat.com>
1803 PR tree-optimization/103192
1804 * tree-ssa-loop-im.c (move_computations_worker): Use
1805 reset_flow_sensitive_info instead of manually clearing
1806 SSA_NAME_RANGE_INFO and do it for all SSA_NAMEs, not just ones
1807 with integral types.
1809 2021-11-17 Jakub Jelinek <jakub@redhat.com>
1811 PR tree-optimization/103255
1812 * gimple-range-fold.cc (fold_using_range::range_of_address): Return
1813 range_nonzero rather than unadjusted base's range. Formatting fixes.
1815 2021-11-17 Richard Sandiford <richard.sandiford@arm.com>
1817 * doc/md.texi (cond_fmin@var{mode}, cond_fmax@var{mode}): Document.
1818 * optabs.def (cond_fmin_optab, cond_fmax_optab): New optabs.
1819 * internal-fn.def (COND_FMIN, COND_FMAX): New functions.
1820 * internal-fn.c (first_commutative_argument): Handle them.
1821 (FOR_EACH_COND_FN_PAIR): Likewise.
1822 * match.pd (UNCOND_BINARY, COND_BINARY): Likewise.
1823 * config/aarch64/aarch64-sve.md (cond_<fmaxmin><mode>): New
1826 2021-11-17 Kewen Lin <linkw@linux.ibm.com>
1828 * config/i386/i386.md (*add<dwi>3_doubleword, *addv<dwi>4_doubleword,
1829 *addv<dwi>4_doubleword_1, *sub<dwi>3_doubleword,
1830 *subv<dwi>4_doubleword, *subv<dwi>4_doubleword_1,
1831 *add<dwi>3_doubleword_cc_overflow_1, *divmodsi4_const,
1832 *neg<dwi>2_doubleword, *tls_dynamic_gnu2_combine_64_<mode>): Fix split
1835 2021-11-17 Andrew Pinski <apinski@marvell.com>
1837 PR tree-optimization/103288
1838 * tree-ssa-phiopt.c (value_replacement): Return early if middle
1839 block has more than one pred.
1841 2021-11-17 Kewen Lin <linkw@linux.ibm.com>
1843 * config/visium/visium.md (*add<mode>3_insn, *addsi3_insn, *addi3_insn,
1844 *sub<mode>3_insn, *subsi3_insn, *subdi3_insn, *neg<mode>2_insn,
1845 *negdi2_insn, *and<mode>3_insn, *ior<mode>3_insn, *xor<mode>3_insn,
1846 *one_cmpl<mode>2_insn, *ashl<mode>3_insn, *ashr<mode>3_insn,
1847 *lshr<mode>3_insn, *trunchiqi2_insn, *truncsihi2_insn,
1848 *truncdisi2_insn, *extendqihi2_insn, *extendqisi2_insn,
1849 *extendhisi2_insn, *extendsidi2_insn, *zero_extendqihi2_insn,
1850 *zero_extendqisi2_insn, *zero_extendsidi2_insn): Fix split condition.
1852 2021-11-17 Marek Polacek <polacek@redhat.com>
1854 PR preprocessor/103026
1855 * doc/invoke.texi: Document -Wbidi-chars.
1857 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
1860 * ipa-modref.c (ipa_merge_modref_summary_after_inlining): Fix clearing
1863 2021-11-16 Jan Hubicka <jh@suse.cz>
1865 * ipa-modref.c (get_modref_function_summary): Declare.
1866 * ipa-modref.h (get_modref_function_summary): New function.
1867 * tree-ssa-dse.c (clear_live_bytes_for_ref): Break out from ...
1868 (clear_bytes_written_by): ... here; also clear memory killed by
1871 2021-11-16 Iain Sandoe <iain@sandoe.co.uk>
1873 * ggc-common.c (gt_pch_save): If we cannot find a suitable
1874 memory segment for save, then error-out, do not try to
1876 (gt_pch_restore): Save the existing line table, and when
1877 the replacement is being read, use that when constructing
1880 2021-11-16 Peter Bergner <bergner@linux.ibm.com>
1883 * config/rs6000/mma.md (*vsx_assemble_pair): Add early-clobber for
1885 (*mma_assemble_acc): Likewise.
1887 2021-11-16 Martin Sebor <msebor@redhat.com>
1889 PR tree-optimization/102960
1890 * gimple-fold.c (get_range_strlen): Take bitmap as an argument rather
1891 than a pointer to it.
1892 (get_range_strlen_tree): Same. Remove bitmap allocation. Use
1894 (get_maxval_strlen): Use an auto_bitmap.
1895 * tree-ssa-strlen.c (get_range_strlen_dynamic): Factor out PHI
1897 (get_range_strlen_phi): ...into this function.
1898 Avoid assuming maximum string length is constant
1899 (printf_strlen_execute): Dump pointer query cache contents when
1900 details are requisted.
1902 2021-11-16 Jason Merrill <jason@redhat.com>
1904 * langhooks.h (struct lang_hooks): Adjust comment.
1905 * print-tree.c (print_node): Also call print_xnode hook for
1908 2021-11-16 Andrew Pinski <apinski@marvell.com>
1910 PR tree-optimization/103218
1911 * match.pd: New pattern for "((type)(a<0)) << SIGNBITOFA".
1913 2021-11-16 Claudiu Zissulescu <claziss@synopsys.com>
1915 * config/arc/arc.md (maddhisi4): Use a single move to accumulator.
1916 (umaddhisi4): Likewise.
1917 (machi): Update pattern.
1920 2021-11-16 Richard Biener <rguenther@suse.de>
1922 PR tree-optimization/102880
1923 * tree-ssa-dce.c (sort_phi_args): New function.
1924 (make_forwarders_with_degenerate_phis): Likewise.
1925 (perform_tree_ssa_dce): Call
1926 make_forwarders_with_degenerate_phis.
1928 2021-11-16 Richard Biener <rguenther@suse.de>
1930 PR tree-optimization/102880
1931 * tree-ssa-phiopt.c (tree_ssa_phiopt_worker): Push
1932 single_pred (bb1) condition to places that really need it.
1933 (match_simplify_replacement): Likewise.
1934 (value_replacement): Likewise.
1935 (replace_phi_edge_with_variable): Deal with extra edges
1938 2021-11-16 Martin Jambor <mjambor@suse.cz>
1940 * cfgexpand.c (expand_gimple_basic_block): Use build_debug_expr_decl,
1941 add a fixme note about the mode assignment perhaps being unnecessary.
1942 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call):
1944 (ipa_param_body_adjustments::mark_dead_statements): Likewise.
1945 (ipa_param_body_adjustments::reset_debug_stmts): Likewise.
1946 * tree-inline.c (remap_ssa_name): Likewise.
1947 (tree_function_versioning): Likewise.
1948 * tree-into-ssa.c (rewrite_debug_stmt_uses): Likewise.
1949 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
1950 * tree-ssa.c (insert_debug_temp_for_var_def): Likewise.
1952 2021-11-16 Jakub Jelinek <jakub@redhat.com>
1954 PR tree-optimization/103208
1955 * omp-expand.c (expand_omp_build_cond): New function.
1956 (expand_omp_for_init_counts, expand_omp_for_init_vars,
1957 expand_omp_for_static_nochunk, expand_omp_for_static_chunk): Use it.
1959 2021-11-16 Jakub Jelinek <jakub@redhat.com>
1961 PR tree-optimization/102009
1962 * gimple-ssa-warn-access.cc (pass_waccess::check_alloc_size_call):
1963 Punt if any of alloc_size arguments is out of bounds vs. number of
1966 2021-11-16 Roger Sayle <roger@nextmovesoftware.com>
1968 * config/i386/i386.md (*bmi2_rorx<mode3>_1): Make conditional
1969 on !optimize_function_for_size_p.
1970 (*<any_rotate><mode>3_1): Add preferred_for_size attribute.
1971 (define_splits): Conditionalize on !optimize_function_for_size_p.
1972 (*bmi2_rorxsi3_1_zext): Likewise.
1973 (*<any_rotate>si2_1_zext): Add preferred_for_size attribute.
1974 (define_splits): Conditionalize on !optimize_function_for_size_p.
1976 2021-11-16 Jan Hubicka <jh@suse.cz>
1979 * ipa-modref.c (merge_call_side_effects): Fix uninitialized
1982 2021-11-16 Andrew Pinski <apinski@marvell.com>
1984 PR tree-optimization/103245
1985 * match.pd: Combine the abs pattern matching using multiplication.
1986 Adding optional nop_convert too.
1988 2021-11-16 H.J. Lu <hjl.tools@gmail.com>
1990 PR middle-end/103268
1991 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Add a missing
1994 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
1996 * gimple-fold.c (gimple_fold_builtin_strncat): Use ranges to
1997 determine if it is safe to transform to strcat.
1998 (gimple_fold_builtin_snprintf): Likewise.
2000 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
2002 * gimple-fold.c (known_lower): New function.
2003 (gimple_fold_builtin_strncat_chk,
2004 gimple_fold_builtin_memory_chk, gimple_fold_builtin_stxcpy_chk,
2005 gimple_fold_builtin_stxncpy_chk,
2006 gimple_fold_builtin_snprintf_chk,
2007 gimple_fold_builtin_sprintf_chk): Use it.
2009 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
2011 * gimple-fold.c (dump_transformation): New function.
2012 (gimple_fold_builtin_stxcpy_chk,
2013 gimple_fold_builtin_stxncpy_chk): Use it. Simplify to
2014 BUILT_IN_STRNCPY if return value is not used.
2016 2021-11-15 H.J. Lu <hjl.tools@gmail.com>
2018 PR middle-end/103184
2019 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Check optab
2020 before transforming equivalent, but slighly different cases to
2021 their canonical forms.
2023 2021-11-15 Iain Sandoe <iain@sandoe.co.uk>
2026 * config/darwin.h (TARGET_DTORS_FROM_CXA_ATEXIT): New.
2027 * doc/tm.texi: Regenerated.
2028 * doc/tm.texi.in: Add TARGET_DTORS_FROM_CXA_ATEXIT hook.
2029 * ipa.c (cgraph_build_static_cdtor_1): Return the built
2031 (build_cxa_atexit_decl): New.
2032 (build_dso_handle_decl): New.
2033 (build_cxa_dtor_registrations): New.
2034 (compare_cdtor_tu_order): New.
2035 (build_cxa_atexit_fns): New.
2036 (ipa_cdtor_merge): If dtors_from_cxa_atexit is set,
2037 process the DTORs/CTORs accordingly.
2038 (pass_ipa_cdtor_merge::gate): Also run if
2039 dtors_from_cxa_atexit is set.
2040 * target.def (dtors_from_cxa_atexit): New hook.
2042 2021-11-15 Iain Sandoe <iain@sandoe.co.uk>
2044 * config.in: Regenerate.
2045 * configure: Regenerate.
2046 * configure.ac: Test ld64 for -platform-version support.
2048 2021-11-15 Thomas Schwinge <thomas@codesourcery.com>
2050 * diagnostic-spec.h (typedef xint_hash_t)
2051 (typedef xint_hash_map_t): Replace with...
2052 (typedef nowarn_map_t): ... this.
2053 (nowarn_map): Adjust.
2054 * diagnostic-spec.c (nowarn_map, suppress_warning_at): Likewise.
2056 2021-11-15 Thomas Schwinge <thomas@codesourcery.com>
2058 * profile.c (branch_prob): Use 'location_hash' for
2061 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
2063 PR tree-optimization/103207
2064 * value-range.cc (irange::set): Drop overflow.
2066 2021-11-15 Jakub Jelinek <jakub@redhat.com>
2068 * gimplify.c (optimize_target_teams): Only add OMP_CLAUSE_THREAD_LIMIT
2069 to OMP_TARGET_CLAUSES if it isn't there already.
2071 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
2073 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
2074 Remove useless code.
2075 (path_range_query::ssa_defined_in_bb): New.
2076 (path_range_query::ssa_range_in_phi): Avoid fold_range call that
2077 could trigger additional lookups.
2078 Do not use the cache for ARGs defined in this block.
2079 (path_range_query::compute_ranges_in_block): Use ssa_defined_in_bb.
2080 (path_range_query::maybe_register_phi_relation): Same.
2081 (path_range_query::range_of_stmt): Adjust comment.
2082 * gimple-range-path.h (ssa_defined_in_bb): New.
2084 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
2086 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
2087 Default to global range if nothing found.
2089 2021-11-15 Richard Biener <rguenther@suse.de>
2091 PR tree-optimization/103237
2092 * tree-vect-loop.c (vect_is_simple_reduction): Fail for
2093 double reductions with multiple inner loop LC PHI nodes.
2095 2021-11-15 Hongyu Wang <hongyu.wang@intel.com>
2098 * config/i386/i386-expand.c (ix86_expand_atomic_fetch_op_loop):
2099 New expand function.
2100 * config/i386/i386-options.c (ix86_target_string): Add
2101 -mrelax-cmpxchg-loop flag.
2102 (ix86_valid_target_attribute_inner_p): Likewise.
2103 * config/i386/i386-protos.h (ix86_expand_atomic_fetch_op_loop):
2104 New expand function prototype.
2105 * config/i386/i386.opt: Add -mrelax-cmpxchg-loop.
2106 * config/i386/sync.md (atomic_fetch_<logic><mode>): New expander
2108 (atomic_<logic>_fetch<mode>): Likewise.
2109 (atomic_fetch_nand<mode>): Likewise.
2110 (atomic_nand_fetch<mode>): Likewise.
2111 (atomic_fetch_<logic><mode>): New expander for DI,TI modes.
2112 (atomic_<logic>_fetch<mode>): Likewise.
2113 (atomic_fetch_nand<mode>): Likewise.
2114 (atomic_nand_fetch<mode>): Likewise.
2115 * doc/invoke.texi: Document -mrelax-cmpxchg-loop.
2117 2021-11-15 Richard Biener <rguenther@suse.de>
2119 PR tree-optimization/103219
2120 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Use single_exit
2121 to determine the exit for the VN region.
2123 2021-11-15 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
2125 * tree-ssa-loop.c (pass_vectorize): Move to tree-vectorizer.c.
2126 (pass_data_vectorize): Likewise.
2127 (make_pass_vectorize): Likewise.
2128 * tree-vectorizer.c (vectorize_loops): Merge with
2129 pass_vectorize::execute and replace cfun occurences with fun param.
2130 (adjust_simduid_builtins): Add fun param, replace cfun occurences with
2131 fun, and adjust callers approrpiately.
2132 (note_simd_array_uses): Likewise.
2133 (vect_loop_dist_alias_call): Likewise.
2134 (set_uid_loop_bbs): Likewise.
2135 (vect_transform_loops): Likewise.
2136 (try_vectorize_loop_1): Likewise.
2137 (try_vectorize_loop): Likewise.
2139 2021-11-15 Jakub Jelinek <jakub@redhat.com>
2142 * config/i386/sync.md (atomic_bit_test_and_set<mode>,
2143 atomic_bit_test_and_complement<mode>,
2144 atomic_bit_test_and_reset<mode>): Use OPTAB_WIDEN instead of
2147 2021-11-15 Maciej W. Rozycki <macro@embecosm.com>
2149 * config/vax/vax.c (vax_lra_p): New prototype and function.
2150 (TARGET_LRA_P): Wire it.
2151 * config/vax/vax.opt (mlra): New option.
2152 * doc/invoke.texi (Option Summary, VAX Options): Document the
2155 2021-11-14 Jan Hubicka <jh@suse.cz>
2157 * ipa-modref.h (struct modref_summary): Add nondeterministic
2158 and calls_interposable flags.
2159 * ipa-modref.c (modref_summary::modref_summary): Initialize new flags.
2160 (modref_summary::useful_p): Check new flags.
2161 (struct modref_summary_lto): Add nondeterministic and
2162 calls_interposable flags.
2163 (modref_summary_lto::modref_summary_lto): Initialize new flags.
2164 (modref_summary_lto::useful_p): Check new flags.
2165 (modref_summary::dump): Dump new flags.
2166 (modref_summary_lto::dump): Dump new flags.
2167 (ignore_nondeterminism_p): New function.
2168 (merge_call_side_effects): Merge new flags.
2169 (process_fnspec): Likewise.
2170 (analyze_load): Volatile access is nondeterministic.
2171 (analyze_store): Liekwise.
2172 (analyze_stmt): Volatile ASM is nondeterministic.
2173 (analyze_function): Clear new flags.
2174 (modref_summaries::duplicate): Duplicate new flags.
2175 (modref_summaries_lto::duplicate): Duplicate new flags.
2176 (modref_write): Stream new flags.
2177 (read_section): Stream new flags.
2178 (propagate_unknown_call): Update new flags.
2179 (modref_propagate_in_scc): Propagate new flags.
2180 * tree-ssa-alias.c (ref_maybe_used_by_call_p_1): Check
2182 * tree-ssa-structalias.c (determine_global_memory_access):
2185 2021-11-14 Maciej W. Rozycki <macro@embecosm.com>
2187 * config/vax/vax.h (SET_RATIO): New macro.
2188 * config/vax/vax.md (UNSPEC_SETMEM_FILL): New constant.
2189 (setmemhi): New expander.
2190 (setmemhi1): New insn and splitter.
2191 (*setmemhi1): New insn.
2193 2021-11-14 Jan Hubicka <hubicka@ucw.cz>
2195 * ipa-modref-tree.c (modref_access_node::update_for_kills): New
2197 (modref_access_node::merge_for_kills): Likewise.
2198 (modref_access_node::insert_kill): Likewise.
2199 * ipa-modref-tree.h (modref_access_node::update_for_kills,
2200 modref_access_node::merge_for_kills, modref_access_node::insert_kill):
2202 (modref_access_node::useful_for_kill): New member function.
2203 * ipa-modref.c (modref_summary::useful_p): Release useless kills.
2204 (lto_modref_summary): Add kills.
2205 (modref_summary::dump): Dump kills.
2206 (record_access): Add mdoref_access_node parameter.
2207 (record_access_lto): Likewise.
2208 (merge_call_side_effects): Merge kills.
2209 (analyze_call): Add ALWAYS_EXECUTED param and pass it around.
2210 (struct summary_ptrs): Add always_executed filed.
2211 (analyze_load): Update.
2212 (analyze_store): Update; record kills.
2213 (analyze_stmt): Add always_executed; record kills in clobbers.
2214 (analyze_function): Track always_executed.
2215 (modref_summaries::duplicate): Duplicate kills.
2216 (update_signature): Release kills.
2217 * ipa-modref.h (struct modref_summary): Add kills.
2218 * tree-ssa-alias.c (alias_stats): Add kill stats.
2219 (dump_alias_stats): Dump kill stats.
2220 (store_kills_ref_p): Break out from ...
2221 (stmt_kills_ref_p): Use it; handle modref info based kills.
2223 2021-11-14 Aldy Hernandez <aldyh@redhat.com>
2225 PR tree-optimization/103229
2226 * gimple-range-cache.cc (ssa_global_cache::clear): Do not pass
2227 null value to memset.
2229 2021-11-14 Jan Hubicka <hubicka@ucw.cz>
2231 * ipa-modref-tree.c (modref_access_node::get_call_arg): New member
2233 (modref_access_node::get_ao_ref): Likewise.
2234 * ipa-modref-tree.h (modref_access_node::get_call_arg): Declare.
2235 (modref_access_node::get_ao_ref): Declare.
2236 * tree-ssa-alias.c (modref_may_conflict): Use new accessors.
2237 * tree-ssa-dse.c (dse_optimize_call): Use new accessors.
2239 2021-11-13 Jan Hubicka <jh@suse.cz>
2242 * dbgcnt.def (ipa_attr): New counters.
2243 * ipa-pure-const.c: Include dbgcnt.c
2244 (ipa_make_function_const): Use debug counter.
2245 (ipa_make_function_pure): Likewise.
2246 (propagate_pure_const): Fix bug in my previous change.
2248 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
2250 * ipa-modref-tree.c (modref_access_node::range_info_useful_p):
2251 Offline from ipa-modref-tree.h.
2252 (modref_access_node::dump): Move from ipa-modref.c; make member
2254 * ipa-modref-tree.h (modref_access_node::range_info_useful_p.
2255 modref_access_node::dump): Declare.
2256 * ipa-modref.c (dump_access): Remove.
2257 (dump_records): Update.
2258 (dump_lto_records): Update.
2259 (record_access): Update.
2260 (record_access_lto): Update.
2262 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
2264 * ipa-modref.c (modref_summary::modref_summary): Clear new flags.
2265 (modref_summary::dump): Dump try_dse.
2266 (modref_summary::finalize): Add FUN attribute; compute try-dse.
2267 (analyze_function): Update.
2268 (read_section): Update.
2269 (update_signature): Update.
2270 (pass_ipa_modref::execute): Update.
2271 * ipa-modref.h (struct modref_summary):
2272 * tree-ssa-alias.c (ao_ref_init_from_ptr_and_range): Export.
2273 * tree-ssa-alias.h (ao_ref_init_from_ptr_and_range): Declare.
2274 * tree-ssa-dse.c (dse_optimize_call): New function.
2275 (dse_optimize_stmt): Use it.
2277 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
2279 * ipa-modref-tree.c: Move #if CHECKING_P to proper place.
2281 2021-11-13 Jan Hubicka <jh@suse.cz>
2284 (struct modref_access_node): Move longer member functions to
2286 (modref_ref_node::try_merge_with): Turn into modreef_acces_node member
2288 * ipa-modref-tree.c (modref_access_node::contains): Move here
2289 from ipa-modref-tree.h.
2290 (modref_access_node::update): Likewise.
2291 (modref_access_node::merge): Likewise.
2292 (modref_access_node::closer_pair_p): Likewise.
2293 (modref_access_node::forced_merge): Likewise.
2294 (modref_access_node::update2): Likewise.
2295 (modref_access_node::combined_offsets): Likewise.
2296 (modref_access_node::try_merge_with): Likewise.
2297 (modref_access_node::insert): Likewise.
2299 2021-11-13 Jan Hubicka <jh@suse.cz>
2301 * ipa-modref.c (modref_summary::global_memory_read_p): Remove.
2302 (modref_summary::global_memory_written_p): Remove.
2303 (modref_summary::dump): Dump new flags.
2304 (modref_summary::finalize): New member function.
2305 (analyze_function): Call it.
2306 (read_section): Call it.
2307 (update_signature): Call it.
2308 (pass_ipa_modref::execute): Call it.
2309 * ipa-modref.h (struct modref_summary): Remove
2310 global_memory_read_p and global_memory_written_p.
2311 Add global_memory_read, global_memory_written.
2312 * tree-ssa-structalias.c (determine_global_memory_access):
2315 2021-11-13 Jan Hubicka <jh@suse.cz>
2317 * ipa-fnsummary.c (compute_fn_summary): Use type_attribut_allowed_p
2318 * ipa-param-manipulation.c
2319 (ipa_param_adjustments::type_attribute_allowed_p):
2320 New member function.
2321 (drop_type_attribute_if_params_changed_p): New function.
2322 (build_adjusted_function_type): Use it.
2323 * ipa-param-manipulation.h: Add type_attribute_allowed_p.
2325 2021-11-13 David Malcolm <dmalcolm@redhat.com>
2327 * doc/invoke.texi (Static Analyzer Options): Add
2328 -Wno-analyzer-tainted-allocation-size,
2329 -Wno-analyzer-tainted-divisor, -Wno-analyzer-tainted-offset, and
2330 -Wno-analyzer-tainted-size to list. Add
2331 -Wanalyzer-tainted-allocation-size, -Wanalyzer-tainted-divisor,
2332 -Wanalyzer-tainted-offset, and -Wanalyzer-tainted-size to list
2333 of options effectively enabled by -fanalyzer.
2334 (-Wanalyzer-tainted-allocation-size): New.
2335 (-Wanalyzer-tainted-array-index): Tweak wording; add link to CWE.
2336 (-Wanalyzer-tainted-divisor): New.
2337 (-Wanalyzer-tainted-offset): New.
2338 (-Wanalyzer-tainted-size): New.
2340 2021-11-13 Jan Hubicka <jh@suse.cz>
2342 * attr-fnspec.h (attr_fnspec::arg_eaf_flags): Break out from ...
2343 * gimple.c (gimple_call_arg_flags): ... here.
2344 * ipa-modref.c (analyze_parms): Record flags known from fnspec.
2345 (modref_merge_call_site_flags): Use arg_eaf_flags.
2347 2021-11-13 Aldy Hernandez <aldyh@redhat.com>
2349 PR tree-optimization/103222
2350 * gimple-range-path.cc (path_range_query::compute_ranges_in_phis):
2352 (path_range_query::compute_ranges_in_block): Call
2353 compute_ranges_in_phis.
2354 * gimple-range-path.h (path_range_query::compute_ranges_in_phis):
2357 2021-11-13 Jan Hubicka <jh@suse.cz>
2359 * ipa-fnsummary.c (compute_fn_summary): Do not give up on signature
2360 changes on "fn spec" attribute; give up on varadic types.
2361 * ipa-param-manipulation.c: Include attribs.h.
2362 (build_adjusted_function_type): New parameter ARG_MODIFIED; if it is
2363 true remove "fn spec" attribute.
2364 (ipa_param_adjustments::build_new_function_type): Update.
2365 (ipa_param_body_adjustments::modify_formal_parameters): update.
2366 * ipa-sra.c: Include attribs.h.
2367 (ipa_sra_preliminary_function_checks): Do not check for TYPE_ATTRIBUTES.
2369 2021-11-13 Aldy Hernandez <aldyh@redhat.com>
2371 * gimple-range-path.cc (path_range_query::path_range_query): Merge
2373 (path_range_query::import_p): Move from header file.
2374 (path_range_query::~path_range_query): Adjust for combined ctors.
2375 * gimple-range-path.h: Merge ctors.
2376 (path_range_query::import_p): Move to .cc file.
2378 2021-11-13 Jan Hubicka <jh@suse.cz>
2380 PR tree-optimization/103182
2381 * ipa-modref.c (callee_to_caller_flags): Fix merging of flags.
2382 (modref_eaf_analysis::analyze_ssa_name): Fix merging of flags.
2384 2021-11-12 Stafford Horne <shorne@gmail.com>
2386 * config/or1k/or1k-protos.h (or1k_profile_hook): New function.
2387 * config/or1k/or1k.h (PROFILE_HOOK): Change macro to reference
2388 new function or1k_profile_hook.
2389 * config/or1k/or1k.c (struct machine_function): Add new field
2390 set_mcount_arg_insn.
2391 (or1k_profile_hook): New function.
2392 (or1k_init_pic_reg): Update to inject pic rtx after _mcount arg
2394 (or1k_frame_pointer_required): Frame pointer no longer needed
2397 2021-11-12 Jan Hubicka <jh@suse.cz>
2399 PR tree-optimization/103209
2400 * tree-ssa-structalias.c (find_func_aliases_for_call): Fix
2401 use of handle_rhs_call
2403 2021-11-12 Aldy Hernandez <aldyh@redhat.com>
2405 PR tree-optimization/103202
2406 * gimple-range-path.cc
2407 (path_range_query::compute_ranges_in_block): Solve PHI imports first.
2409 2021-11-12 Jan Hubicka <jh@suse.cz>
2411 * ipa-pure-const.c (propagate_pure_const): Remove redundant check;
2412 fix call of ipa_make_function_const and ipa_make_function_pure.
2414 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2416 * config/aarch64/aarch64.c (aarch64_vector_op::n_advsimd_ops): Delete.
2417 (aarch64_vector_op::m_seen_loads): Likewise.
2418 (aarch64_vector_costs::aarch64_vector_costs): Don't push to
2420 (aarch64_vector_op::count_ops): Remove vectype and factor parameters.
2421 Remove code that tries to predict different vec_flags from the
2423 (aarch64_vector_costs::add_stmt_cost): Update accordingly.
2424 Remove m_advsimd_ops handling.
2426 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2428 * config/aarch64/aarch64.c (aarch64_vector_costs::m_saw_sve_only_op)
2429 (aarch64_sve_only_stmt_p): Delete.
2430 (aarch64_vector_costs::prefer_unrolled_loop): New function,
2431 extracted from adjust_body_cost.
2432 (aarch64_vector_costs::better_main_loop_than_p): New function,
2433 using heuristics extracted from adjust_body_cost and
2434 adjust_body_cost_sve.
2435 (aarch64_vector_costs::adjust_body_cost_sve): Remove
2436 advsimd_cycles_per_iter and could_use_advsimd parameters.
2437 Update after changes above.
2438 (aarch64_vector_costs::adjust_body_cost): Update after changes above.
2440 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2442 * config/aarch64/aarch64.c (aarch64_vec_op_count::m_vf_factor):
2443 New member variable.
2444 (aarch64_vec_op_count::aarch64_vec_op_count): Add a parameter for it.
2445 (aarch64_vec_op_count::vf_factor): New function.
2446 (aarch64_vector_costs::aarch64_vector_costs): When costing for
2447 neoverse-512tvb, pass a vf_factor of 2 for the Neoverse V1 version
2449 (aarch64_vector_costs::adjust_body_cost): Read the vf factor
2450 instead of hard-coding 2.
2452 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2454 * config/aarch64/aarch64.c
2455 (aarch64_vec_op_count::rename_cycles_per_iter): New function.
2456 (aarch64_vec_op_count::min_nonpred_cycles_per_iter): Likewise.
2457 (aarch64_vec_op_count::min_pred_cycles_per_iter): Likewise.
2458 (aarch64_vec_op_count::min_cycles_per_iter): Likewise.
2459 (aarch64_vec_op_count::dump): Move earlier in file. Dump the
2460 above properties too.
2461 (aarch64_estimate_min_cycles_per_iter): Delete.
2462 (adjust_body_cost): Use aarch64_vec_op_count::min_cycles_per_iter
2463 instead of aarch64_estimate_min_cycles_per_iter. Rely on the dump
2464 routine to print CPI estimates.
2465 (adjust_body_cost_sve): Likewise. Use the other functions above
2466 instead of doing the work inline.
2468 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2470 * config/aarch64/aarch64.c (aarch64_vec_op_count): Allow default
2472 (aarch64_vec_op_count::base_issue_info): Remove handling of null
2474 (aarch64_vec_op_count::simd_issue_info): Likewise.
2475 (aarch64_vec_op_count::sve_issue_info): Likewise.
2476 (aarch64_vector_costs::m_ops): Turn into a vector.
2477 (aarch64_vector_costs::m_advsimd_ops): Likewise.
2478 (aarch64_vector_costs::aarch64_vector_costs): Add entries to
2479 the vectors based on aarch64_tune_params.
2480 (aarch64_vector_costs::analyze_loop_vinfo): Update the pred_ops
2481 of all entries in m_ops.
2482 (aarch64_vector_costs::add_stmt_cost): Call count_ops for all
2484 (aarch64_estimate_min_cycles_per_iter): Remove issue_info
2485 parameter and get the information from the ops instead.
2486 (aarch64_vector_costs::adjust_body_cost_sve): Take a
2487 aarch64_vec_issue_info instead of a aarch64_vec_op_count.
2488 (aarch64_vector_costs::adjust_body_cost): Update call accordingly.
2489 Exit earlier if m_ops is empty for either cost structure.
2491 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2493 * config/aarch64/aarch64.c (aarch64_vector_costs::m_scalar_ops)
2494 (aarch64_vector_costs::m_sve_ops): Replace with...
2495 (aarch64_vector_costs::m_ops): ...this.
2496 (aarch64_vector_costs::analyze_loop_vinfo): Update accordingly.
2497 (aarch64_vector_costs::adjust_body_cost_sve): Likewise.
2498 (aarch64_vector_costs::aarch64_vector_costs): Likewise.
2499 Initialize m_vec_flags here rather than in add_stmt_cost.
2500 (aarch64_vector_costs::count_ops): Test for scalar reductions too.
2501 Allow vectype to be null.
2502 (aarch64_vector_costs::add_stmt_cost): Call count_ops for scalar
2503 code too. Don't require vectype to be nonnull.
2504 (aarch64_vector_costs::adjust_body_cost): Take the loop_vec_info
2505 and scalar costs as parameters. Use the scalar costs to determine
2506 the cycles per iteration of the scalar loop, then multiply it
2507 by the estimated VF.
2508 (aarch64_vector_costs::finish_cost): Update call accordingly.
2510 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2512 * config/aarch64/aarch64.c (aarch64_dr_type): New function.
2513 (aarch64_vector_costs::count_ops): Use it rather than the
2514 vectype to determine floatness.
2516 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2518 * config/aarch64/aarch64.c (aarch64_sve_in_loop_reduction_latency):
2519 Remove vectype parameter and get floatness from the type of the
2521 (arch64_in_loop_reduction_latency): Likewise.
2522 (aarch64_detect_vector_stmt_subtype): Update caller.
2523 (aarch64_vector_costs::count_ops): Likewise.
2525 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2527 * config/aarch64/aarch64.c (aarch64_sve_op_count): Fold into...
2528 (aarch64_vec_op_count): ...this. Add a constructor.
2529 (aarch64_vec_op_count::vec_flags): New function.
2530 (aarch64_vec_op_count::base_issue_info): Likewise.
2531 (aarch64_vec_op_count::simd_issue_info): Likewise.
2532 (aarch64_vec_op_count::sve_issue_info): Likewise.
2533 (aarch64_vec_op_count::m_issue_info): New member variable.
2534 (aarch64_vec_op_count::m_vec_flags): Likewise.
2535 (aarch64_vector_costs): Add a constructor.
2536 (aarch64_vector_costs::m_sve_ops): Change type to aarch64_vec_op_count.
2537 (aarch64_vector_costs::aarch64_vector_costs): New function.
2538 Initialize m_scalar_ops, m_advsimd_ops and m_sve_ops.
2539 (aarch64_vector_costs::count_ops): Remove vec_flags and
2540 issue_info parameters, using the new aarch64_vec_op_count
2542 (aarch64_vector_costs::add_stmt_cost): Update call accordingly.
2543 (aarch64_sve_op_count::dump): Fold into...
2544 (aarch64_vec_op_count::dump): ..here.
2546 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2548 * config/aarch64/aarch64.c: Include tree-dfa.h.
2549 (aarch64_check_consecutive_mems): New function that takes MEM_EXPR
2550 and MEM_OFFSET into account.
2551 (aarch64_swap_ldrstr_operands): Use it.
2552 (aarch64_operands_ok_for_ldpstp): Likewise. Check that the
2553 address of the second memory doesn't depend on the result of
2556 2021-11-12 Jan Hubicka <hubicka@ucw.cz>
2558 * ipa-pure-const.c (ipa_make_function_pure): Fix exit condition.
2560 2021-11-12 Jan Hubicka <jh@suse.cz>
2562 PR tree-optimization/103175
2563 * ipa-modref.c (modref_lattice::merge): Add sanity check.
2564 (callee_to_caller_flags): Make flags adjustment sane.
2565 (modref_eaf_analysis::analyze_ssa_name): Likewise.
2567 2021-11-12 Jan Hubicka <jh@suse.cz>
2570 * ipa-modref.c (analyze_function, modref_propagate_in_scc): Do
2571 not mark pure/const function if there are side-effects.
2573 2021-11-12 Chung-Lin Tang <cltang@codesourcery.com>
2575 * tree.h (OMP_CLAUSE_MAP_RUNTIME_IMPLICIT_P): New access macro for
2576 'implicit' bit, using 'base.deprecated_flag' field of tree_node.
2577 * tree-pretty-print.c (dump_omp_clause): Add support for printing
2578 implicit attribute in tree dumping.
2579 * gimplify.c (gimplify_adjust_omp_clauses_1):
2580 Set OMP_CLAUSE_MAP_RUNTIME_IMPLICIT_P to 1 if map clause is implicitly
2582 (gimplify_adjust_omp_clauses): Adjust place of adding implicitly created
2583 clauses, from simple append, to starting of list, after non-map clauses.
2584 * omp-low.c (lower_omp_target): Add GOMP_MAP_IMPLICIT bits into kind
2585 values passed to libgomp for implicit maps.
2587 2021-11-12 Jakub Jelinek <jakub@redhat.com>
2589 * omp-builtins.def (BUILT_IN_GOMP_TEAMS): Remove.
2590 (BUILT_IN_GOMP_TEAMS4): New.
2591 * builtin-types.def (BT_FN_VOID_UINT_UINT): Remove.
2592 (BT_FN_BOOL_UINT_UINT_UINT_BOOL): New.
2593 * omp-low.c (lower_omp_teams): Use GOMP_teams4 instead of
2594 GOMP_teams, pass to it also num_teams lower-bound expression
2595 or a dup of upper-bound if it is missing and a flag whether
2596 it is the first call or not.
2598 2021-11-12 Martin Liska <mliska@suse.cz>
2600 PR tree-optimization/102497
2601 * gimple-predicate-analysis.cc (add_pred): Remove unused
2604 2021-11-12 Richard Biener <rguenther@suse.de>
2606 PR tree-optimization/103204
2607 * tree-ssa-sccvn.c (valueize_refs_1): Re-valueize the
2608 top operand after folding in an address.
2610 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
2612 * gimple-range-path.cc (path_range_query::path_range_query): New
2613 ctor without a ranger.
2614 (path_range_query::~path_range_query): Free ranger if necessary.
2615 (path_range_query::range_on_path_entry): Adjust m_ranger for pointer.
2616 (path_range_query::ssa_range_in_phi): Same.
2617 (path_range_query::compute_ranges_in_block): Same.
2618 (path_range_query::compute_imports): Same.
2619 (path_range_query::compute_ranges): Same.
2620 (path_range_query::range_of_stmt): Same.
2621 (path_range_query::compute_outgoing_relations): Same.
2622 * gimple-range-path.h (class path_range_query): New ctor.
2623 * tree-ssa-loop-ch.c (ch_base::copy_headers): Remove gimple_ranger
2624 as path_range_query allocates one.
2625 * tree-ssa-threadbackward.c (class back_threader): Remove m_ranger.
2626 (back_threader::~back_threader): Same.
2628 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
2630 * tree-ssa-threadbackward.c
2631 (back_threader_profitability::profitable_path_p): Remove loop
2632 crossing restriction.
2634 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
2636 * ipa-modref.c (analyze_function): Do pure/const discovery, return
2638 (pass_modref::execute): If pure/const is discovered fixup cfg.
2639 (ignore_edge): Do not ignore pure/const edges.
2640 (modref_propagate_in_scc): Do pure/const discovery, return true if
2641 cdtor was promoted pure/const.
2642 (pass_ipa_modref::execute): If needed remove unreachable functions.
2643 * ipa-pure-const.c (warn_function_noreturn): Fix whitespace.
2644 (warn_function_cold): Likewise.
2645 (skip_function_for_local_pure_const): Move earlier.
2646 (ipa_make_function_const): Break out from ...
2647 (ipa_make_function_pure): Break out from ...
2648 (propagate_pure_const): ... here.
2649 (pass_local_pure_const::execute): Use it.
2650 * ipa-utils.h (ipa_make_function_const): Declare.
2651 (ipa_make_function_pure): Declare.
2652 * passes.def: Move early modref after pure-const.
2654 2021-11-11 David Malcolm <dmalcolm@redhat.com>
2657 * diagnostic-show-locus.c (def_policy): Use def_tabstop.
2659 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2661 * config/aarch64/aarch64-builtins.c (TYPES_COMBINE): Delete.
2662 (TYPES_COMBINEP): Delete.
2663 * config/aarch64/aarch64-simd-builtins.def: Declare type-
2664 qualified builtins for vcombine_* intrinsics.
2665 * config/aarch64/arm_neon.h (vcombine_s8): Remove unnecessary
2667 (vcombine_s16): Likewise.
2668 (vcombine_s32): Likewise.
2669 (vcombine_f32): Likewise.
2670 (vcombine_u8): Use type-qualified builtin and remove casts.
2671 (vcombine_u16): Likewise.
2672 (vcombine_u32): Likewise.
2673 (vcombine_u64): Likewise.
2674 (vcombine_p8): Likewise.
2675 (vcombine_p16): Likewise.
2676 (vcombine_p64): Likewise.
2677 (vcombine_bf16): Remove unnecessary cast.
2678 * config/aarch64/iterators.md (VD_I): New mode iterator.
2679 (VDC_P): New mode iterator.
2681 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2683 * config/aarch64/aarch64-builtins.c (TYPES_LOAD1_U): Define.
2684 (TYPES_LOAD1_P): Define.
2685 (TYPES_STORE1_U): Define.
2686 (TYPES_STORE1P): Rename to...
2687 (TYPES_STORE1_P): This.
2688 (get_mem_type_for_load_store): Add unsigned and poly types.
2689 (aarch64_general_gimple_fold_builtin): Add unsigned and poly
2690 type-qualified builtin declarations.
2691 * config/aarch64/aarch64-simd-builtins.def: Declare type-
2692 qualified builtins for LD1/ST1.
2693 * config/aarch64/arm_neon.h (vld1_p8): Use type-qualified
2694 builtin and remove cast.
2695 (vld1_p16): Likewise.
2696 (vld1_u8): Likewise.
2697 (vld1_u16): Likewise.
2698 (vld1_u32): Likewise.
2699 (vld1q_p8): Likewise.
2700 (vld1q_p16): Likewise.
2701 (vld1q_p64): Likewise.
2702 (vld1q_u8): Likewise.
2703 (vld1q_u16): Likewise.
2704 (vld1q_u32): Likewise.
2705 (vld1q_u64): Likewise.
2706 (vst1_p8): Likewise.
2707 (vst1_p16): Likewise.
2708 (vst1_u8): Likewise.
2709 (vst1_u16): Likewise.
2710 (vst1_u32): Likewise.
2711 (vst1q_p8): Likewise.
2712 (vst1q_p16): Likewise.
2713 (vst1q_p64): Likewise.
2714 (vst1q_u8): Likewise.
2715 (vst1q_u16): Likewise.
2716 (vst1q_u32): Likewise.
2717 (vst1q_u64): Likewise.
2718 * config/aarch64/iterators.md (VALLP_NO_DI): New iterator.
2720 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2722 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
2723 builtins for vector reduction.
2724 * config/aarch64/arm_neon.h (vaddv_u8): Use type-qualified
2725 builtin and remove casts.
2726 (vaddv_u16): Likewise.
2727 (vaddv_u32): Likewise.
2728 (vaddvq_u8): Likewise.
2729 (vaddvq_u16): Likewise.
2730 (vaddvq_u32): Likewise.
2731 (vaddvq_u64): Likewise.
2733 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2735 * config/aarch64/aarch64-simd-builtins.def:
2736 * config/aarch64/arm_neon.h (vpaddq_u8): Use type-qualified
2737 builtin and remove casts.
2738 (vpaddq_u16): Likewise.
2739 (vpaddq_u32): Likewise.
2740 (vpaddq_u64): Likewise.
2741 (vpadd_u8): Likewise.
2742 (vpadd_u16): Likewise.
2743 (vpadd_u32): Likewise.
2744 (vpaddd_u64): Likewise.
2746 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2748 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
2749 builtins for [r]subhn[2].
2750 * config/aarch64/arm_neon.h (vsubhn_s16): Remove unnecessary
2752 (vsubhn_s32): Likewise.
2753 (vsubhn_s64): Likewise.
2754 (vsubhn_u16): Use type-qualified builtin and remove casts.
2755 (vsubhn_u32): Likewise.
2756 (vsubhn_u64): Likewise.
2757 (vrsubhn_s16): Remove unnecessary cast.
2758 (vrsubhn_s32): Likewise.
2759 (vrsubhn_s64): Likewise.
2760 (vrsubhn_u16): Use type-qualified builtin and remove casts.
2761 (vrsubhn_u32): Likewise.
2762 (vrsubhn_u64): Likewise.
2763 (vrsubhn_high_s16): Remove unnecessary cast.
2764 (vrsubhn_high_s32): Likewise.
2765 (vrsubhn_high_s64): Likewise.
2766 (vrsubhn_high_u16): Use type-qualified builtin and remove
2768 (vrsubhn_high_u32): Likewise.
2769 (vrsubhn_high_u64): Likewise.
2770 (vsubhn_high_s16): Remove unnecessary cast.
2771 (vsubhn_high_s32): Likewise.
2772 (vsubhn_high_s64): Likewise.
2773 (vsubhn_high_u16): Use type-qualified builtin and remove
2775 (vsubhn_high_u32): Likewise.
2776 (vsubhn_high_u64): Likewise.
2778 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2780 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
2781 builtins for [r]addhn[2].
2782 * config/aarch64/arm_neon.h (vaddhn_s16): Remove unnecessary
2784 (vaddhn_s32): Likewise.
2785 (vaddhn_s64): Likewise.
2786 (vaddhn_u16): Use type-qualified builtin and remove casts.
2787 (vaddhn_u32): Likewise.
2788 (vaddhn_u64): Likewise.
2789 (vraddhn_s16): Remove unnecessary cast.
2790 (vraddhn_s32): Likewise.
2791 (vraddhn_s64): Likewise.
2792 (vraddhn_u16): Use type-qualified builtin and remove casts.
2793 (vraddhn_u32): Likewise.
2794 (vraddhn_u64): Likewise.
2795 (vaddhn_high_s16): Remove unnecessary cast.
2796 (vaddhn_high_s32): Likewise.
2797 (vaddhn_high_s64): Likewise.
2798 (vaddhn_high_u16): Use type-qualified builtin and remove
2800 (vaddhn_high_u32): Likewise.
2801 (vaddhn_high_u64): Likewise.
2802 (vraddhn_high_s16): Remove unnecessary cast.
2803 (vraddhn_high_s32): Likewise.
2804 (vraddhn_high_s64): Likewise.
2805 (vraddhn_high_u16): Use type-qualified builtin and remove
2807 (vraddhn_high_u32): Likewise.
2808 (vraddhn_high_u64): Likewise.
2810 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2812 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
2813 qualifiers in generator macros for uhsub builtins.
2814 * config/aarch64/arm_neon.h (vhsub_s8): Remove unnecessary
2816 (vhsub_s16): Likewise.
2817 (vhsub_s32): Likewise.
2818 (vhsub_u8): Use type-qualified builtin and remove casts.
2819 (vhsub_u16): Likewise.
2820 (vhsub_u32): Likewise.
2821 (vhsubq_s8): Remove unnecessary cast.
2822 (vhsubq_s16): Likewise.
2823 (vhsubq_s32): Likewise.
2824 (vhsubq_u8): Use type-qualified builtin and remove casts.
2825 (vhsubq_u16): Likewise.
2826 (vhsubq_u32): Likewise.
2828 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2830 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
2831 qualifiers in generator macros for u[r]hadd builtins.
2832 * config/aarch64/arm_neon.h (vhadd_s8): Remove unnecessary
2834 (vhadd_s16): Likewise.
2835 (vhadd_s32): Likewise.
2836 (vhadd_u8): Use type-qualified builtin and remove casts.
2837 (vhadd_u16): Likewise.
2838 (vhadd_u32): Likewise.
2839 (vhaddq_s8): Remove unnecessary cast.
2840 (vhaddq_s16): Likewise.
2841 (vhaddq_s32): Likewise.
2842 (vhaddq_u8): Use type-qualified builtin and remove casts.
2843 (vhaddq_u16): Likewise.
2844 (vhaddq_u32): Likewise.
2845 (vrhadd_s8): Remove unnecessary cast.
2846 (vrhadd_s16): Likewise.
2847 (vrhadd_s32): Likewise.
2848 (vrhadd_u8): Use type-qualified builtin and remove casts.
2849 (vrhadd_u16): Likewise.
2850 (vrhadd_u32): Likewise.
2851 (vrhaddq_s8): Remove unnecessary cast.
2852 (vrhaddq_s16): Likewise.
2853 (vrhaddq_s32): Likewise.
2854 (vrhaddq_u8): Use type-wualified builtin and remove casts.
2855 (vrhaddq_u16): Likewise.
2856 (vrhaddq_u32): Likewise.
2858 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2860 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
2861 qualifiers in generator macros for usub[lw][2] builtins.
2862 * config/aarch64/arm_neon.h (vsubl_s8): Remove unnecessary
2864 (vsubl_s16): Likewise.
2865 (vsubl_s32): Likewise.
2866 (vsubl_u8): Use type-qualified builtin and remove casts.
2867 (vsubl_u16): Likewise.
2868 (vsubl_u32): Likewise.
2869 (vsubl_high_s8): Remove unnecessary cast.
2870 (vsubl_high_s16): Likewise.
2871 (vsubl_high_s32): Likewise.
2872 (vsubl_high_u8): Use type-qualified builtin and remove casts.
2873 (vsubl_high_u16): Likewise.
2874 (vsubl_high_u32): Likewise.
2875 (vsubw_s8): Remove unnecessary casts.
2876 (vsubw_s16): Likewise.
2877 (vsubw_s32): Likewise.
2878 (vsubw_u8): Use type-qualified builtin and remove casts.
2879 (vsubw_u16): Likewise.
2880 (vsubw_u32): Likewise.
2881 (vsubw_high_s8): Remove unnecessary cast.
2882 (vsubw_high_s16): Likewise.
2883 (vsubw_high_s32): Likewise.
2884 (vsubw_high_u8): Use type-qualified builtin and remove casts.
2885 (vsubw_high_u16): Likewise.
2886 (vsubw_high_u32): Likewise.
2888 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2890 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
2891 qualifiers in generator macros for uadd[lw][2] builtins.
2892 * config/aarch64/arm_neon.h (vaddl_s8): Remove unnecessary
2894 (vaddl_s16): Likewise.
2895 (vaddl_s32): Likewise.
2896 (vaddl_u8): Use type-qualified builtin and remove casts.
2897 (vaddl_u16): Likewise.
2898 (vaddl_u32): Likewise.
2899 (vaddl_high_s8): Remove unnecessary cast.
2900 (vaddl_high_s16): Likewise.
2901 (vaddl_high_s32): Likewise.
2902 (vaddl_high_u8): Use type-qualified builtin and remove casts.
2903 (vaddl_high_u16): Likewise.
2904 (vaddl_high_u32): Likewise.
2905 (vaddw_s8): Remove unnecessary cast.
2906 (vaddw_s16): Likewise.
2907 (vaddw_s32): Likewise.
2908 (vaddw_u8): Use type-qualified builtin and remove casts.
2909 (vaddw_u16): Likewise.
2910 (vaddw_u32): Likewise.
2911 (vaddw_high_s8): Remove unnecessary cast.
2912 (vaddw_high_s16): Likewise.
2913 (vaddw_high_s32): Likewise.
2914 (vaddw_high_u8): Use type-qualified builtin and remove casts.
2915 (vaddw_high_u16): Likewise.
2916 (vaddw_high_u32): Likewise.
2918 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2920 * config/aarch64/aarch64-simd-builtins.def: Declare type-
2921 qualified builtins for [R]SHRN[2].
2922 * config/aarch64/arm_neon.h (vshrn_n_u16): Use type-qualified
2923 builtin and remove casts.
2924 (vshrn_n_u32): Likewise.
2925 (vshrn_n_u64): Likewise.
2926 (vrshrn_high_n_u16): Likewise.
2927 (vrshrn_high_n_u32): Likewise.
2928 (vrshrn_high_n_u64): Likewise.
2929 (vrshrn_n_u16): Likewise.
2930 (vrshrn_n_u32): Likewise.
2931 (vrshrn_n_u64): Likewise.
2932 (vshrn_high_n_u16): Likewise.
2933 (vshrn_high_n_u32): Likewise.
2934 (vshrn_high_n_u64): Likewise.
2936 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2938 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
2939 type-qualified builtins for XTN[2].
2940 * config/aarch64/arm_neon.h (vmovn_high_u16): Use type-
2941 qualified builtin and remove casts.
2942 (vmovn_high_u32): Likewise.
2943 (vmovn_high_u64): Likewise.
2944 (vmovn_u16): Likewise.
2945 (vmovn_u32): Likewise.
2946 (vmovn_u64): Likewise.
2948 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2950 * config/aarch64/aarch64-simd-builtins.def: Use poly type
2951 qualifier in builtin generator macros.
2952 * config/aarch64/arm_neon.h (vmul_p8): Use type-qualified
2953 builtin and remove casts.
2954 (vmulq_p8): Likewise.
2955 (vmull_high_p8): Likewise.
2956 (vmull_p8): Likewise.
2958 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2960 * config/aarch64/aarch64-simd-builtins.def: Declare type-
2961 qualified builtin generators for unsigned MLA/MLS intrinsics.
2962 * config/aarch64/arm_neon.h (vmla_n_u16): Use type-qualified
2964 (vmla_n_u32): Likewise.
2965 (vmla_u8): Likewise.
2966 (vmla_u16): Likewise.
2967 (vmla_u32): Likewise.
2968 (vmlaq_n_u16): Likewise.
2969 (vmlaq_n_u32): Likewise.
2970 (vmlaq_u8): Likewise.
2971 (vmlaq_u16): Likewise.
2972 (vmlaq_u32): Likewise.
2973 (vmls_n_u16): Likewise.
2974 (vmls_n_u32): Likewise.
2975 (vmls_u8): Likewise.
2976 (vmls_u16): Likewise.
2977 (vmls_u32): Likewise.
2978 (vmlsq_n_u16): Likewise.
2979 (vmlsq_n_u32): Likewise.
2980 (vmlsq_u8): Likewise.
2981 (vmlsq_u16): Likewise.
2982 (vmlsq_u32): Likewise.
2984 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
2986 * ipa-modref.c (modref_summary::useful_p): Check also for side-effects
2987 with looping const/pure.
2988 (modref_summary_lto::useful_p): Likewise.
2989 (merge_call_side_effects): Merge side effects before early exit
2991 (process_fnspec): Also handle pure functions.
2992 (analyze_call): Do not early exit on looping pure const.
2993 (propagate_unknown_call): Also handle nontrivial SCC as side-effect.
2994 (modref_propagate_in_scc): Update.
2996 2021-11-11 Richard Biener <rguenther@suse.de>
2998 PR tree-optimization/103190
2999 * tree-ssa-reassoc.c (insert_stmt_after): Only assert on asm goto.
3001 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
3003 * gimple-range-path.cc (path_range_query::add_copies_to_imports):
3005 (path_range_query::compute_imports): ...this. Adapt it so it can
3006 be passed the imports bitmap instead of working on m_imports.
3007 (path_range_query::compute_ranges): Call compute_imports in all
3008 cases unless an imports bitmap is passed.
3009 * gimple-range-path.h (path_range_query::compute_imports): New.
3010 (path_range_query::add_copies_to_imports): Remove.
3011 * tree-ssa-threadbackward.c (back_threader::resolve_def): Remove.
3012 (back_threader::find_paths_to_names): Inline resolve_def.
3013 (back_threader::find_paths): Call compute_imports.
3014 (back_threader::resolve_phi): Adjust comment.
3016 2021-11-11 Richard Biener <rguenther@suse.de>
3018 PR tree-optimization/103188
3019 * tree-ssa-loop-ch.c (should_duplicate_loop_header_p):
3020 Remove query parameter, split out check for size
3022 (ch_base::m_ranger, cb_base::m_query): Remove.
3023 (ch_base::copy_headers): Split processing loop into
3024 analysis around which we allocate and use ranger and
3025 transform where we do not.
3026 (pass_ch::execute): Do not allocate/free ranger here.
3027 (pass_ch_vect::execute): Likewise.
3029 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
3031 * ipa-pure-const.c (propagate_pure_const): Self recursion is
3034 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
3036 * cgraph.c (set_noreturn_flag_1): New function.
3037 (cgraph_node::set_noreturn_flag): New member function
3038 * cgraph.h (cgraph_node::set_noreturn_flags): Declare.
3039 * ipa-pure-const.c (pass_local_pure_const::execute): Use it.
3041 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
3042 Richard Biener <rguenther@suse.de>
3044 * tree-ssa-loop-ch.c (entry_loop_condition_is_static): Resolve
3045 statically to the edge remaining in the loop.
3047 2021-11-11 Richard Biener <rguenther@suse.de>
3049 PR middle-end/103181
3050 * tree-eh.c (operation_could_trap_helper_p): Properly
3051 check vector constants for a zero element for integer
3052 division. Separate floating point and integer division code.
3054 2021-11-11 Jakub Jelinek <jakub@redhat.com>
3057 * dwarf2out.c (field_byte_offset): Do the PCC_BITFIELD_TYPE_MATTERS
3058 handling only for DECL_BIT_FIELD_TYPE decls.
3060 2021-11-11 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
3063 * config/aarch64/aarch64.c (aarch64_process_target_attr): Check if
3064 token is arch extension without leading '+' and emit appropriate
3065 diagnostic for the same.
3067 2021-11-11 Jakub Jelinek <jakub@redhat.com>
3069 * tree.h (OMP_CLAUSE_NUM_TEAMS_EXPR): Rename to ...
3070 (OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR): ... this.
3071 (OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR): Define.
3072 * tree.c (omp_clause_num_ops): Increase num ops for
3073 OMP_CLAUSE_NUM_TEAMS to 2.
3074 * tree-pretty-print.c (dump_omp_clause): Print optional lower bound
3075 for OMP_CLAUSE_NUM_TEAMS.
3076 * gimplify.c (gimplify_scan_omp_clauses): Gimplify
3077 OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR if non-NULL.
3078 (optimize_target_teams): Use OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR instead
3079 of OMP_CLAUSE_NUM_TEAMS_EXPR. Handle OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR.
3080 * omp-low.c (lower_omp_teams): Use OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR
3081 instead of OMP_CLAUSE_NUM_TEAMS_EXPR.
3082 * omp-expand.c (expand_teams_call, get_target_arguments): Likewise.
3084 2021-11-11 Richard Biener <rguenther@suse.de>
3086 * cfganal.c (find_pdom): Remove.
3087 (control_dependences::find_control_dependence): Remove
3088 special-casing of entry block, call get_immediate_dominator
3090 * gimple-predicate-analysis.cc (find_pdom): Remove.
3091 (find_dom): Likewise.
3092 (find_control_equiv_block): Call get_immediate_dominator
3094 (compute_control_dep_chain): Likewise.
3095 (predicate::init_from_phi_def): Likewise.
3097 2021-11-11 Richard Biener <rguenther@suse.de>
3099 * cfganal.h (control_dependences::control_dependence_map):
3101 (control_dependences::m_bitmaps): New.
3102 * cfganal.c (control_dependences::set_control_dependence_map_bit):
3104 (control_dependences::clear_control_dependence_bitmap):
3106 (control_dependences::find_control_dependence): Do not
3107 find_edge for the abnormal edge test.
3108 (control_dependences::control_dependences): Instead do not
3109 add abnormal edges to the edge list. Adjust.
3110 (control_dependences::~control_dependences): Likewise.
3111 (control_dependences::get_edges_dependent_on): Likewise.
3112 * function-tests.c: Include bitmap.h.
3114 2021-11-11 Kewen Lin <linkw@linux.ibm.com>
3116 * doc/invoke.texi: Change references to "future cpu" to "power10",
3117 "-mcpu=future" to "-mcpu=power10". Adjust words for float128.
3119 2021-11-11 Cui,Lili <lili.cui@intel.com>
3121 * config/i386/i386-options.c (m_CORE_AVX2): Remove Alderlake
3123 (processor_cost_table): Use alderlake_cost for Alderlake.
3124 * config/i386/i386.c (ix86_sched_init_global): Handle Alderlake.
3125 * config/i386/x86-tune-costs.h (struct processor_costs): Add alderlake
3127 * config/i386/x86-tune-sched.c (ix86_issue_rate): Change Alderlake
3129 (ix86_adjust_cost): Handle Alderlake.
3130 * config/i386/x86-tune.def (X86_TUNE_SCHEDULE): Enable for Alderlake.
3131 (X86_TUNE_PARTIAL_REG_DEPENDENCY): Likewise.
3132 (X86_TUNE_SSE_PARTIAL_REG_DEPENDENCY): Likewise.
3133 (X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY): Likewise.
3134 (X86_TUNE_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
3135 (X86_TUNE_MEMORY_MISMATCH_STALL): Likewise.
3136 (X86_TUNE_USE_LEAVE): Likewise.
3137 (X86_TUNE_PUSH_MEMORY): Likewise.
3138 (X86_TUNE_USE_INCDEC): Likewise.
3139 (X86_TUNE_INTEGER_DFMODE_MOVES): Likewise.
3140 (X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Likewise.
3141 (X86_TUNE_USE_SAHF): Likewise.
3142 (X86_TUNE_USE_BT): Likewise.
3143 (X86_TUNE_AVOID_FALSE_DEP_FOR_BMI): Likewise.
3144 (X86_TUNE_ONE_IF_CONV_INSN): Likewise.
3145 (X86_TUNE_AVOID_MFENCE): Likewise.
3146 (X86_TUNE_USE_SIMODE_FIOP): Likewise.
3147 (X86_TUNE_EXT_80387_CONSTANTS): Likewise.
3148 (X86_TUNE_SSE_UNALIGNED_LOAD_OPTIMAL): Likewise.
3149 (X86_TUNE_SSE_UNALIGNED_STORE_OPTIMAL): Likewise.
3150 (X86_TUNE_SSE_TYPELESS_STORES): Likewise.
3151 (X86_TUNE_SSE_LOAD0_BY_PXOR): Likewise.
3152 (X86_TUNE_AVOID_4BYTE_PREFIXES): Likewise.
3153 (X86_TUNE_USE_GATHER): Disable for Alderlake.
3154 (X86_TUNE_AVX256_MOVE_BY_PIECES): Likewise.
3155 (X86_TUNE_AVX256_STORE_BY_PIECES): Likewise.
3157 2021-11-11 liuhongt <hongtao.liu@intel.com>
3160 * config/i386/sse.md (V_128_256): Extend to V8HF/V16HF.
3161 (avxsizesuffix): Ditto.
3163 2021-11-11 Kito Cheng <kito.cheng@sifive.com>
3165 * common/config/riscv/riscv-common.c (riscv_subset_list::to_string): Fix
3166 wrong marco checking.
3168 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
3170 PR tree-optimization/102906
3171 * tree-ssa-loop-ch.c (entry_loop_condition_is_static): New.
3172 (should_duplicate_loop_header_p): Call entry_loop_condition_is_static.
3173 (class ch_base): Add m_ranger and m_query.
3174 (ch_base::copy_headers): Pass m_query to
3175 entry_loop_condition_is_static.
3176 (pass_ch::execute): Allocate and deallocate m_ranger and
3178 (pass_ch_vect::execute): Same.
3180 2021-11-10 Andrew Pinski <apinski@marvell.com>
3183 * config/aarch64/aarch64-simd.md (aarch64_simd_dup<mode>):
3184 Use vwcore iterator for the r constraint output string.
3186 2021-11-10 qing zhao <qing.zhao@oracle.com>
3188 * internal-fn.c (expand_DEFERRED_INIT): Apply pattern initialization
3189 only when have_insn_for return true for the mode. Fix a memory leak.
3191 2021-11-10 Christophe Lyon <christophe.lyon@foss.st.com>
3193 * config/arm/arm.c (cortexa9_extra_costs, cortexa8_extra_costs,
3194 cortexa5_extra_costs, cortexa7_extra_costs,
3195 cortexa12_extra_costs, cortexa15_extra_costs, v7m_extra_costs):
3196 Initialize movi, dup and extract costing fields.
3198 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
3200 * gimple-range-path.cc (path_range_query::path_range_query): Do
3202 (path_range_query::dump): Change m_path uses to non-pointer.
3203 (path_range_query::defined_outside_path): Same.
3204 (path_range_query::set_path): Same.
3205 (path_range_query::add_copies_to_imports): Same.
3206 (path_range_query::range_of_stmt): Same.
3207 (path_range_query::compute_outgoing_relations): Same.
3208 (path_range_query::compute_ranges): Imports are now optional.
3209 Implement overload that takes an edge.
3210 * gimple-range-path.h (class path_range_query): Make imports
3211 optional for compute_ranges. Add compute_ranges(edge) overload.
3212 Make m_path an auto_vec instead of a pointer and adjust
3215 2021-11-10 Tamar Christina <tamar.christina@arm.com>
3217 * tree-vectorizer.h (struct scalar_cond_masked_key): Add inverted_p.
3218 (default_hash_traits<scalar_conf_masked_key>): Likewise.
3219 * tree-vect-stmts.c (vectorizable_condition): Check if inverse of mask
3221 * tree-vectorizer.c (scalar_cond_masked_key::get_cond_ops_from_tree):
3222 Register mask inverses.
3224 2021-11-10 Tamar Christina <tamar.christina@arm.com>
3226 * tree-vectorizer.c (vectorize_loops): Do local CSE through RPVN upon
3227 successful vectorization.
3229 2021-11-10 Andrew MacLeod <amacleod@redhat.com>
3231 * gimple-range-cache.cc (sbr_vector::grow): New.
3232 (sbr_vector::set_bb_range): Call grow.
3233 (sbr_vector::get_bb_range): Same.
3234 (sbr_vector::bb_range_p): Remove assert.
3236 2021-11-10 Tamar Christina <tamar.christina@arm.com>
3238 * config/aarch64/aarch64-simd.md (*aarch64_topbits_shuffle<mode>_le
3239 ,*aarch64_topbits_shuffle<mode>_be): Remove.
3241 2021-11-10 Jan Hubicka <jh@suse.cz>
3243 * ipa-modref.c: Include tree-eh.h
3244 (modref_summary::modref_summary): Initialize side_effects.
3245 (struct modref_summary_lto): New bool field side_effects.
3246 (modref_summary_lto::modref_summary_lto): Initialize side_effects.
3247 (modref_summary::dump): Dump side_effects.
3248 (modref_summary_lto::dump): Dump side_effects.
3249 (merge_call_side_effects): Merge side effects.
3250 (process_fnspec): Calls to non-const/pure or looping
3251 function is a side effect.
3252 (analyze_call): Self-recursion is a side-effect; handle
3254 (analyze_load): Watch for volatile and throwing memory.
3255 (analyze_store): Likewise.
3256 (analyze_stmt): Watch for volatitle asm.
3257 (analyze_function): Handle side_effects.
3258 (modref_summaries::duplicate): Duplicate side_effects.
3259 (modref_summaries_lto::duplicate): Likewise.
3260 (modref_write): Stream side_effects.
3261 (read_section): Likewise.
3262 (update_signature): Update.
3263 (propagate_unknown_call): Handle side_effects.
3264 (modref_propagate_in_scc): Likewise.
3265 * ipa-modref.h (struct modref_summary): Add side_effects.
3266 * ipa-pure-const.c (special_builtin_state): Rename to ...
3267 (builtin_safe_for_const_function_p): ... this one.
3268 (check_call): Update.
3269 (finite_function_p): Break out from ...
3270 (propagate_pure_const): ... here
3271 * ipa-utils.h (finite_function): Declare.
3273 2021-11-10 Lucas A. M. Magalhães <lamm@linux.ibm.com>
3275 * config.gcc (powerpc*-*-*): Remove -rpath from
3276 --with-advance-toolchain.
3278 2021-11-10 Marek Polacek <polacek@redhat.com>
3281 * attribs.c (struct scoped_attributes): Add a bool member.
3282 (lookup_scoped_attribute_spec): Forward declare.
3283 (register_scoped_attributes): New bool parameter, defaulted to
3285 (handle_ignored_attributes_option): New function.
3286 (free_attr_data): New function.
3287 (init_attributes): Call handle_ignored_attributes_option.
3288 (attr_namespace_ignored_p): New function.
3289 (decl_attributes): Check attr_namespace_ignored_p before
3291 * attribs.h (free_attr_data): Declare.
3292 (register_scoped_attributes): Adjust declaration.
3293 (handle_ignored_attributes_option): Declare.
3294 (canonicalize_attr_name): New function template.
3295 (canonicalize_attr_name): Use it.
3296 * common.opt (Wattributes=): New option with a variable.
3297 * doc/extend.texi: Document #pragma GCC diagnostic ignored_attributes.
3298 * doc/invoke.texi: Document -Wno-attributes=.
3299 * opts.c (common_handle_option) <case OPT_Wattributes_>: Handle.
3300 * plugin.h (register_scoped_attributes): Adjust declaration.
3301 * toplev.c (compile_file): Call free_attr_data.
3303 2021-11-10 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
3305 * config/arm/arm-cpus.in (cortex-a710): New CPU.
3306 * config/arm/arm-tables.opt: Regenerate.
3307 * config/arm/arm-tune.md: Regenerate.
3308 * doc/invoke.texi: Update docs.
3310 2021-11-10 Andre Vieira <andre.simoesdiasvieira@arm.com>
3312 * config/aarch64/aarch64-builtins.c
3313 (aarch64_general_gimple_fold_builtin): Mark argument as unused.
3315 2021-11-10 Martin Liska <mliska@suse.cz>
3317 * lto-wrapper.c (merge_and_complain): Make the first argument
3320 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3322 * config/aarch64/iterators.md (optab): Use fmax_nan instead of
3323 smax_nan and fmin_nan instead of smin_nan.
3324 (maxmin_uns): Rename to...
3325 (fmaxmin): ...this and make the same changes. Remove entries
3326 unrelated to fmax* and fmin*.
3327 * config/aarch64/aarch64.md (<maxmin_uns><mode>3): Rename to...
3328 (<fmaxmin><mode>3): ...this.
3329 * config/aarch64/aarch64-simd.md (aarch64_<maxmin_uns>p<mode>):
3331 (aarch64_<optab>p<mode>): ...this.
3332 (<maxmin_uns><mode>3): Rename to...
3333 (<fmaxmin><mode>3): ...this.
3334 (reduc_<maxmin_uns>_scal_<mode>): Rename to...
3335 (reduc_<optab>_scal_<mode>): ...this and update gen* call.
3336 (aarch64_reduc_<maxmin_uns>_internal<mode>): Rename to...
3337 (aarch64_reduc_<optab>_internal<mode>): ...this.
3338 (aarch64_reduc_<maxmin_uns>_internalv2si): Rename to...
3339 (aarch64_reduc_<optab>_internalv2si): ...this.
3340 * config/aarch64/aarch64-sve.md (<maxmin_uns><mode>3): Rename to...
3341 (<fmaxmin><mode>3): ...this.
3342 * config/aarch64/aarch64-simd-builtins.def (smax_nan, smin_nan)
3344 (fmax_nan, fmin_nan): ...this.
3345 * config/aarch64/arm_neon.h (vmax_f32, vmax_f64, vmaxq_f32, vmaxq_f64)
3346 (vmin_f32, vmin_f64, vminq_f32, vminq_f64, vmax_f16, vmaxq_f16)
3347 (vmin_f16, vminq_f16): Update accordingly.
3349 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3351 * tree-vectorizer.h (vector_costs::finish_cost): Take the
3352 corresponding scalar costs as a parameter.
3353 (finish_cost): Likewise.
3354 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost)
3355 (vect_estimate_min_profitable_iters): Update accordingly.
3356 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Likewise.
3357 * tree-vectorizer.c (vector_costs::finish_cost): Likewise.
3358 * config/aarch64/aarch64.c (aarch64_vector_costs::finish_cost):
3360 * config/rs6000/rs6000.c (rs6000_cost_data::finish_cost): Likewise.
3362 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3364 * tree-vectorizer.h (_loop_vec_info::scalar_costs): New member
3366 (_loop_vec_info::single_scalar_iteration_cost): Delete.
3367 (LOOP_VINFO_SINGLE_SCALAR_ITERATION_COST): Delete.
3368 (vector_costs::total_cost): New function.
3369 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
3370 after above changes.
3371 (_loop_vec_info::~_loop_vec_info): Delete scalar_costs.
3372 (vect_compute_single_scalar_iteration_cost): Store the costs
3373 in loop_vinfo->scalar_costs.
3374 (vect_estimate_min_profitable_iters): Get the scalar cost from
3375 loop_vinfo->scalar_costs.
3377 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3379 * tree-vectorizer.h (vector_costs::better_main_loop_than_p)
3380 (vector_costs::better_epilogue_loop_than_p)
3381 (vector_costs::compare_inside_loop_cost)
3382 (vector_costs::compare_outside_loop_cost): Likewise.
3383 * tree-vectorizer.c (vector_costs::better_main_loop_than_p)
3384 (vector_costs::better_epilogue_loop_than_p)
3385 (vector_costs::compare_inside_loop_cost)
3386 (vector_costs::compare_outside_loop_cost): New functions,
3387 containing code moved from...
3388 * tree-vect-loop.c (vect_better_loop_vinfo_p): ...here.
3390 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3392 * tree-vectorizer.h (_loop_vec_info): Remove vec_outside_cost
3393 and vec_inside_cost.
3394 (vector_costs::outside_cost): New function.
3395 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
3397 (vect_estimate_min_profitable_iters): Likewise.
3398 (vect_better_loop_vinfo_p): Get the inside and outside costs
3399 from the loop_vec_infos' vector_costs.
3401 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3403 * tree-vectorizer.h (vec_info::target_cost_data): Replace with...
3404 (_loop_vec_info::vector_costs): ...this.
3405 (LOOP_VINFO_TARGET_COST_DATA): Delete.
3406 * tree-vectorizer.c (vec_info::vec_info): Remove target_cost_data
3408 (vec_info::~vec_info): Remove corresponding delete.
3409 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Initialize
3410 vector_costs to null.
3411 (_loop_vec_info::~_loop_vec_info): Delete vector_costs.
3412 (vect_analyze_loop_operations): Update after above changes.
3413 (vect_analyze_loop_2): Likewise.
3414 (vect_estimate_min_profitable_iters): Likewise.
3415 * tree-vect-slp.c (vect_slp_analyze_operations): Likewise.
3417 2021-11-10 Jan Hubicka <hubicka@ucw.cz>
3419 * tree-core.h (EAF_DIRECT): Remove.
3420 (EAF_NOCLOBBER): Remove.
3421 (EAF_UNUSED): Remove.
3422 (EAF_NOESCAPE): Remove.
3423 (EAF_NO_DIRECT_CLOBBER): New.
3424 (EAF_NO_INDIRECT_CLOBBER): New.
3425 (EAF_NODIRECTESCAPE): Remove.
3426 (EAF_NO_DIRECT_ESCAPE): New.
3427 (EAF_NO_INDIRECT_ESCAPE): New.
3428 (EAF_NOT_RETURNED): Remove.
3429 (EAF_NOT_RETURNED_INDIRECTLY): New.
3430 (EAF_NOREAD): Remove.
3431 (EAF_NO_DIRECT_READ): New.
3432 (EAF_NO_INDIRECT_READ): New.
3433 * gimple.c (gimple_call_arg_flags): Update for new flags.
3434 (gimple_call_retslot_flags): Update for new flags.
3435 * ipa-modref.c (dump_eaf_flags): Likewise.
3436 (remove_useless_eaf_flags): Likewise.
3437 (deref_flags): Likewise.
3438 (modref_lattice::init): Likewise.
3439 (modref_lattice::merge): Likewise.
3440 (modref_lattice::merge_direct_load): Likewise.
3441 (modref_lattice::merge_direct_store): Likewise.
3442 (modref_eaf_analysis::merge_call_lhs_flags): Likewise.
3443 (callee_to_caller_flags): Likewise.
3444 (modref_eaf_analysis::analyze_ssa_name): Likewise.
3445 (modref_eaf_analysis::propagate): Likewise.
3446 (modref_merge_call_site_flags): Likewise.
3447 * ipa-modref.h (interposable_eaf_flags): Likewise.
3448 * tree-ssa-alias.c: (ref_maybe_used_by_call_p_1) Likewise.
3449 * tree-ssa-structalias.c (handle_call_arg): Likewise.
3450 (handle_rhs_call): Likewise.
3451 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Likewise.
3453 2021-11-10 Tamar Christina <tamar.christina@arm.com>
3455 * tree-vect-slp-patterns.c (complex_mul_pattern::matches): Remove l1node.
3457 2021-11-10 Andre Vieira <andre.simoesdiasvieira@arm.com>
3459 * config/aarch64/aarch64-builtins.c
3460 (aarch64_general_gimple_fold_builtin): Change pointer alignment and
3463 2021-11-10 Jan Hubicka <jh@suse.cz>
3465 * ipa-modref-tree.h (modref_tree::remap_params): Fix off-by-one error.
3467 2021-11-10 H.J. Lu <hongjiu.lu@intel.com>
3468 Hongtao Liu <hongtao.liu@intel.com>
3470 PR middle-end/102566
3471 * match.pd (nop_atomic_bit_test_and_p): New match.
3472 * tree-ssa-ccp.c (convert_atomic_bit_not): New function.
3473 (gimple_nop_atomic_bit_test_and_p): New prototype.
3474 (optimize_atomic_bit_test_and): Transform equivalent, but slighly
3475 different cases to their canonical forms.
3477 2021-11-10 liuhongt <hongtao.liu@intel.com>
3479 PR tree-optimization/103126
3480 * tree-vect-loop.c (neutral_op_for_reduction): Remove static.
3481 * tree-vectorizer.h (neutral_op_for_reduction): Declare.
3482 * tree-if-conv.c : Include tree-vectorizer.h.
3483 (is_cond_scalar_reduction): Handle
3484 BIT_XOR_EXPR/BIT_IOR_EXPR/BIT_AND_EXPR.
3485 (convert_scalar_cond_reduction): Ditto.
3487 2021-11-10 konglin1 <lingling.kong@intel.com>
3489 * config/i386/sse.md (cmul<conj_op><mode>3): add new define_expand.
3490 (cmla<conj_op><mode>4): Likewise
3492 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
3494 * tree-ssa-threadedge.c: Do not include
3495 gimple-ssa-evrp-analyze.h.
3496 * value-pointer-equiv.cc: Same.
3498 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
3500 * tree-ssa-threadbackward.c
3501 (back_threader::maybe_register_path_dump): Abstract path dumping...
3502 (dump_path): ...here.
3503 (back_threader::resolve_phi): Call dump_path.
3506 2021-11-10 konglin1 <lingling.kong@intel.com>
3508 * config/i386/sse.md (fma_<complexpairopname>_<mode>_pair):
3509 Add new define_insn.
3510 (fma_<mode>_fmaddc_bcst): Add new define_insn_and_split.
3511 (fma_<mode>_fcmaddc_bcst): Likewise
3513 2021-11-10 liuhongt <hongtao.liu@intel.com>
3516 * match.pd: Simplify (trunc)fmax/fmin((extend)a, (extend)b) to
3519 2021-11-10 Andrew Pinski <apinski@marvell.com>
3522 * config/aarch64/aarch64.c (aarch64_evpc_ins): Don't use target
3523 as an input, use original one.
3525 2021-11-10 Sandra Loosemore <sandra@codesourcery.com>
3527 * config/nios2/nios2.c (nios2_can_inline_p): New.
3528 (TARGET_CAN_INLINE_P): Define.
3530 2021-11-09 Bill Schmidt <wschmidt@linux.ibm.com>
3532 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_builtin):
3533 Disable gimple fold for RS6000_BIF_{XVMINDP,XVMINSP,VMINFP} and
3534 RS6000_BIF_{XVMAXDP,XVMAXSP,VMAXFP} when fast-math is not set.
3535 (lxvrse_expand_builtin): Modify the expansion for sign extension.
3536 All extensions are done within VSX registers.
3538 2021-11-09 Andrew Pinski <apinski@marvell.com>
3540 PR tree-optimization/10352
3541 * match.pd: Remove check of TYPE_PRECISION for
3544 2021-11-09 Andrew MacLeod <amacleod@redhat.com>
3546 * function.c (allocate_struct_function): Don't set x_range_query.
3547 * function.h (get_range_query): Move to value-query.h.
3548 * gimple-range.cc (enable_ranger): Check that query is currently NULL.
3549 (disable_ranger): Clear function current query field.
3550 * value-query.cc (get_global_range_query): Relocate to:
3551 * value-query.h (get_global_range_query): Here and inline.
3552 (get_range_query): Relocate here from function.h.
3554 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3556 * tree-ssa-threadbackward.c
3557 (back_threader::maybe_register_path_dump): New.
3558 (back_threader::maybe_register_path): Call maybe_register_path_dump.
3560 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3562 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
3563 Return NULL when unprofitable.
3565 2021-11-09 Martin Jambor <mjambor@suse.cz>
3567 * tree.h (build_debug_expr_decl): Declare.
3568 * tree.c (build_debug_expr_decl): New function.
3569 * cfgexpand.c (avoid_deep_ter_for_debug): Use build_debug_expr_decl
3570 instead of building a DEBUG_EXPR_DECL.
3571 * ipa-param-manipulation.c
3572 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
3573 * omp-simd-clone.c (ipa_simd_modify_stmt_ops): Likewise.
3574 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Likewise.
3575 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
3576 * tree-ssa-reassoc.c (make_new_ssa_for_def): Likewise.
3578 2021-11-09 Bill Schmidt <wschmidt@linux.ibm.com>
3580 * config/rs6000/rs6000-call.c (rs6000_type_string): New function.
3581 (def_builtin): Change debug formatting for easier parsing and
3582 include more information.
3583 (rs6000_init_builtins): Add dump of autogenerated builtins.
3584 (altivec_init_builtins): Dump __builtin_altivec_mask_for_load for
3587 2021-11-09 Marek Polacek <polacek@redhat.com>
3589 * ipa-modref.c (analyze_parms): Fix a typo.
3591 2021-11-09 Kito Cheng <kito.cheng@sifive.com>
3594 * common/config/riscv/riscv-common.c (multi_letter_subset_rank): Remove
3595 assertion for Z*-ext.
3597 2021-11-09 Jan Hubicka <hubicka@ucw.cz>
3599 * ipa-modref.c (analyze_parms): Add past_flags, past_retslot_flags
3600 and past_static_chain; merge past summary with current one.
3601 (analyze_function): Update.
3603 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3605 * doc/invoke.texi (Invoking GCC): Document --param=threader-debug.
3607 2021-11-09 Martin Liska <mliska@suse.cz>
3609 * print-rtl.c (rtx_writer::rtx_writer): Make the compilation
3610 conditional based on
3611 * print-rtl.h (class rtx_writer): Likewise.
3613 2021-11-09 Thomas Schwinge <thomas@codesourcery.com>
3615 * input.h (location_hash): Use 'BUILTINS_LOCATION' as spare value
3616 for 'Deleted'. Turn into a '#define'.
3618 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3620 * gimple-range-path.cc (path_range_query::dump): Clean up.
3621 (path_range_query::compute_ranges): Same.
3622 * value-relation.cc (path_oracle::dump): Same.
3624 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3626 * dumpfile.c (dump_options): Remove TDF_THREADING entry.
3627 * dumpfile.h (enum dump_flag): Remove TDF_THREADING and adjust
3629 * flag-types.h (enum threader_debug): New.
3630 * gimple-range-path.cc (DEBUG_SOLVER): Use param_threader_debug.
3631 * params.opt: Add entry for --param=threader-debug=.
3633 2021-11-09 Thomas Schwinge <thomas@codesourcery.com>
3635 PR middle-end/101204
3637 * diagnostic-spec.h (typedef xint_hash_t): Turn into...
3638 (struct xint_hash_t): ... this.
3639 * doc/gty.texi: Update.
3641 2021-11-09 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
3643 * config/arm/arm-cpus.in (armv9): New define.
3644 (ARMv9a): New group.
3645 (armv9-a): New arch definition.
3646 * config/arm/arm-tables.opt: Regenerate.
3647 * config/arm/arm.h (BASE_ARCH_9A): New arch enum value.
3648 * config/arm/t-aprofile: Added armv9-a and armv9+simd.
3649 * config/arm/t-arm-elf: Added arm9-a, v9_fps and all_v9_archs
3650 to MULTILIB_MATCHES.
3651 * config/arm/t-multilib: Added v9_a_nosimd_variants and
3652 v9_a_simd_variants to MULTILIB_MATCHES.
3653 * doc/invoke.texi: Update docs.
3655 2021-11-09 Martin Jambor <mjambor@suse.cz>
3658 * ipa-param-manipulation.c (replace_with_mapped_expr): Early
3659 return with error_mark_mode when part of expression is mapped to
3661 (ipa_param_body_adjustments::remap_with_debug_expressions): Set
3662 mapped value to NULL if walk_tree returns error_mark_mode.
3664 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3666 * tree-ssa-strlen.c (compare_nonzero_chars): Pass statement
3668 (get_addr_stridx): Same.
3670 (get_range_strlen_dynamic): Same.
3671 (handle_builtin_strlen): Same.
3672 (handle_builtin_strchr): Same.
3673 (handle_builtin_strcpy): Same.
3674 (maybe_diag_stxncpy_trunc): Same.
3675 (handle_builtin_stxncpy_strncat): Same.
3676 (handle_builtin_memcpy): Same.
3677 (handle_builtin_strcat): Same.
3678 (handle_alloc_call): Same.
3679 (handle_builtin_memset): Same.
3680 (handle_builtin_string_cmp): Same.
3681 (handle_pointer_plus): Same.
3682 (count_nonzero_bytes_addr): Same.
3683 (count_nonzero_bytes): Same.
3684 (handle_store): Same.
3685 (fold_strstr_to_strncmp): Same.
3686 (handle_integral_assign): Same.
3687 (check_and_optimize_stmt): Same.
3688 (class strlen_dom_walker): Replace evrp with ranger.
3689 (strlen_dom_walker::before_dom_children): Remove evrp.
3690 (strlen_dom_walker::after_dom_children): Remove evrp.
3691 * gimple-ssa-warn-access.cc (maybe_check_access_sizes):
3692 Restrict sprintf output.
3694 2021-11-09 Martin Liska <mliska@suse.cz>
3696 * genconditions.c (write_one_condition): Add const qualifier
3699 2021-11-09 Jeff Law <jeffreyalaw@gmail.com>
3701 * match.pd: New pattern to simplify (1 << n) & M ==/!= 0 for M
3704 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
3706 * config/rs6000/rs6000-builtin-new.def (VEC_INIT_V16QI): Use
3707 escape-newline support.
3708 (VEC_INIT_V4SI): Likewise.
3709 (VEC_INIT_V8HI): Likewise.
3710 (PACK_V1TI): Likewise.
3712 (VFIRSTMISMATCHOREOSINDEX_V16QI): Likewise.
3713 (VFIRSTMISMATCHOREOSINDEX_V8HI): Likewise.
3714 (VFIRSTMISMATCHOREOSINDEX_V4SI): Likewise.
3718 (FMAF128_ODD): Likewise.
3719 (VSCEQPUO): Likewise.
3721 (VSIEQPF): Likewise.
3722 (VSTDCQP): Likewise.
3723 (PACK_TD): Likewise.
3724 (TABORTDC): Likewise.
3725 (TABORTDCI): Likewise.
3726 (SE_LXVRBX): Likewise.
3727 (SE_LXVRHX): Likewise.
3728 (SE_LXVRWX): Likewise.
3729 (SE_LXVRDX): Likewise.
3730 (VREPLACE_UN_UV2DI): Likewise.
3731 (VREPLACE_UN_UV4SI): Likewise.
3732 (VREPLACE_UN_V2DI): Likewise.
3733 (VREPLACE_ELT_UV2DI): Likewise.
3734 (VREPLACE_ELT_V2DI): Likewise.
3735 (ZE_LXVRBX): Likewise.
3736 (ZE_LXVRHX): Likewise.
3737 (ZE_LXVRWX): Likewise.
3738 (ZE_LXVRDX): Likewise.
3740 (CNTLZDM): Likewise.
3741 (CNTTZDM): Likewise.
3744 (PMXVBF16GER2): Likewise.
3745 (PMXVBF16GER2_INTERNAL): Likewise.
3746 (PMXVBF16GER2NN): Likewise.
3747 (PMXVBF16GER2NN_INTERNAL): Likewise.
3748 (PMXVBF16GER2NP): Likewise.
3749 (PMXVBF16GER2NP_INTERNAL): Likewise.
3750 (PMXVBF16GER2PN): Likewise.
3751 (PMXVBF16GER2PN_INTERNAL): Likewise.
3752 (PMXVBF16GER2PP): Likewise.
3753 (PMXVBF16GER2PP_INTERNAL): Likewise.
3754 (PMXVF16GER2): Likewise.
3755 (PMXVF16GER2_INTERNAL): Likewise.
3756 (PMXVF16GER2NN): Likewise.
3757 (PMXVF16GER2NN_INTERNAL): Likewise.
3758 (PMXVF16GER2NP): Likewise.
3759 (PMXVF16GER2NP_INTERNAL): Likewise.
3760 (PMXVF16GER2PN): Likewise.
3761 (PMXVF16GER2PN_INTERNAL): Likewise.
3762 (PMXVF16GER2PP): Likewise.
3763 (PMXVF16GER2PP_INTERNAL): Likewise.
3764 (PMXVF32GER_INTERNAL): Likewise.
3765 (PMXVF32GERNN): Likewise.
3766 (PMXVF32GERNN_INTERNAL): Likewise.
3767 (PMXVF32GERNP): Likewise.
3768 (PMXVF32GERNP_INTERNAL): Likewise.
3769 (PMXVF32GERPN): Likewise.
3770 (PMXVF32GERPN_INTERNAL): Likewise.
3771 (PMXVF32GERPP): Likewise.
3772 (PMXVF32GERPP_INTERNAL): Likewise.
3773 (PMXVF64GER): Likewise.
3774 (PMXVF64GER_INTERNAL): Likewise.
3775 (PMXVF64GERNN): Likewise.
3776 (PMXVF64GERNN_INTERNAL): Likewise.
3777 (PMXVF64GERNP): Likewise.
3778 (PMXVF64GERNP_INTERNAL): Likewise.
3779 (PMXVF64GERPN): Likewise.
3780 (PMXVF64GERPN_INTERNAL): Likewise.
3781 (PMXVF64GERPP): Likewise.
3782 (PMXVF64GERPP_INTERNAL): Likewise.
3783 (PMXVI16GER2): Likewise.
3784 (PMXVI16GER2_INTERNAL): Likewise.
3785 (PMXVI16GER2PP): Likewise.
3786 (PMXVI16GER2PP_INTERNAL): Likewise.
3787 (PMXVI16GER2S): Likewise.
3788 (PMXVI16GER2S_INTERNAL): Likewise.
3789 (PMXVI16GER2SPP): Likewise.
3790 (PMXVI16GER2SPP_INTERNAL): Likewise.
3791 (PMXVI4GER8): Likewise.
3792 (PMXVI4GER8_INTERNAL): Likewise.
3793 (PMXVI4GER8PP): Likewise.
3794 (PMXVI4GER8PP_INTERNAL): Likewise.
3795 (PMXVI8GER4): Likewise.
3796 (PMXVI8GER4_INTERNAL): Likewise.
3797 (PMXVI8GER4PP): Likewise.
3798 (PMXVI8GER4PP_INTERNAL): Likewise.
3799 (PMXVI8GER4SPP): Likewise.
3800 (PMXVI8GER4SPP_INTERNAL): Likewise.
3801 * config/rs6000/rs6000-gen-builtins.c (MAXLINES): New macro.
3802 (linebuf): Increase size.
3803 (lines): New variable.
3804 (lastline): Likewise.
3805 (real_line_pos): New function.
3806 (diag): Change signature.
3807 (bif_diag): Change signature; support escape-newline handling.
3808 (ovld_diag): Likewise.
3809 (fatal): Move earlier.
3810 (consume_whitespace): Adjust diag call.
3811 (advance_line): Add escape-newline handling; call fatal.
3812 (safe_inc_pos): Adjust diag call.
3813 (match_identifier): Likewise.
3814 (match_integer): Likewise.
3815 (match_to_right_bracket): Call fatal instead of diag; adjust diag
3817 (match_basetype): Adjust diag calls.
3818 (match_bracketed_pair): Likewise.
3819 (match_const_restriction): Likewise.
3820 (match_type): Likewise.
3821 (parse_args): Likewise.
3822 (parse_bif_attrs): Likewise.
3823 (complete_vector_type): Likewise.
3824 (complete_base_type): Likewise.
3825 (parse_prototype): Likewise.
3826 (parse_bif_entry): Likewise.
3827 (parse_bif_stanza): Likewise.
3828 (parse_ovld_entry): Likewise.
3829 (parse_ovld_stanza): Likewise.
3830 (main): Allocate buffers for lines[].
3832 2021-11-08 Eric Botcazou <ebotcazou@adacore.com>
3834 * config/i386/i386.c (legitimize_pic_address): Adjust comment and
3835 use the REG argument on the CM_LARGE_PIC code path as well.
3836 * config/i386/predicates.md (gotoff_operand): Do not treat VxWorks
3837 specially with the large code models.
3839 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
3841 * gimple.c (gimple_call_static_chain_flags): Revert the workaround
3842 allowing interposition since issues with binds_to_local_def were
3845 2021-11-08 Andrew MacLeod <amacleod@redhat.com>
3847 PR tree-optimization/103122
3848 * gimple-range.cc (gimple_ranger::range_of_expr): Request the cache
3849 entry with "calulate new values" set to false.
3851 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
3853 * builtins.c (is_simple_builtin): Add builitin_dwarf_cfa
3854 and builtin_return_address.
3855 (builtin_fnspec): Annotate builtin_return,
3856 bulitin_eh_pointer, builtin_eh_filter, builtin_unwind_resume,
3857 builtin_cxa_end_cleanup, builtin_eh_copy_values,
3858 builtin_frame_address, builtin_apply_args,
3859 builtin_asan_before_dynamic_init, builtin_asan_after_dynamic_init,
3860 builtin_prefetch, builtin_dwarf_cfa, builtin_return_addrss
3862 * ipa-pure-const.c (special_builtin_state): Add builtin_dwarf_cfa
3863 and builtin_return_address.
3865 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
3867 PR tree-optimization/103177
3868 * passes.def: Move uncprop after pure/const and modref.
3870 2021-11-08 Martin Jambor <mjambor@suse.cz>
3874 * tree-inline.c (remap_gimple_stmt): Unshare the expression without
3875 location before invoking remap_with_debug_expressions on it.
3876 * ipa-param-manipulation.c
3877 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
3879 2021-11-08 David Edelsohn <dje.gcc@gmail.com>
3881 * config/rs6000/vsx.md (vsx_splat_v4si_di): Revert "wa"
3884 2021-11-08 Richard Biener <rguenther@suse.de>
3886 * cfgloop.c (verify_loop_structure): Use a temporary BB flag
3887 instead of an sbitmap to cache irreducible state.
3889 2021-11-08 Aldy Hernandez <aldyh@redhat.com>
3891 PR tree-optimization/103120
3892 * gimple-range-path.cc (path_range_query::range_defined_in_block):
3893 Bail if there's a cache entry.
3895 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
3897 * config/rs6000/rs6000.c (rs6000_builtin_reciprocal): Use
3898 rs6000_builtin_decls_x when appropriate.
3899 (add_condition_to_bb): Likewise.
3900 (rs6000_atomic_assign_expand_fenv): Likewise.
3902 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
3904 * config/rs6000/rs6000-call.c (rs6000_new_builtin_decl): New function.
3905 (rs6000_builtin_decl): Call it.
3907 2021-11-08 Martin Liska <mliska@suse.cz>
3909 * cgraph.c (cgraph_node::dump): Dump it from decl.
3911 2021-11-08 Martin Liska <mliska@suse.cz>
3913 PR gcov-profile/100520
3914 * coverage.c (coverage_compute_profile_id): Strip .gk when
3915 compare debug is used.
3916 * system.h (endswith): New function.
3918 2021-11-08 Martin Liska <mliska@suse.cz>
3920 * cgraph.c (cgraph_node::dump): Dump static_chain_decl.
3922 2021-11-08 Thomas Schwinge <thomas@codesourcery.com>
3924 * config/rs6000/rbtree.c: Fix 'Copyright (C) 2020-21' into '2020-2021'
3925 * config/rs6000/rbtree.h: Likewise.
3926 * config/rs6000/rs6000-builtin-new.def: Likewise.
3927 * config/rs6000/rs6000-gen-builtins.c: Likewise.
3928 * config/rs6000/rs6000-overload.def: Likewise.
3930 2021-11-08 Richard Sandiford <richard.sandiford@arm.com>
3932 * config/aarch64/aarch64.c (aarch64_ldn_stn_vectors): New function.
3933 (aarch64_address_cost): Use it instead of testing for CImode and
3936 2021-11-08 Richard Sandiford <richard.sandiford@arm.com>
3938 * genmodes.c (emit_insn_modes_h): Define NUM_MODE_* macros.
3939 * expmed.h (NUM_MODE_INT): Delete in favor of genmodes definitions.
3940 (NUM_MODE_PARTIAL_INT, NUM_MODE_VECTOR_INT): Likewise.
3941 * real.h (real_format_for_mode): Use NUM_MODE_FLOAT and
3942 NUM_MODE_DECIMAL_FLOAT.
3943 (REAL_MODE_FORMAT): Likewise.
3945 2021-11-08 Andre Vieira <andre.simoesdiasvieira@arm.com>
3947 * tree-vect-loop.c (vect_better_loop_vinfo_p): Change how epilogue loop
3950 2021-11-08 Richard Biener <rguenther@suse.de>
3952 * tree-vectorizer.h (vect_create_loop_vinfo): Add main_loop_info
3954 * tree-vect-loop.c (vect_create_loop_vinfo): Likewise. Set
3955 LOOP_VINFO_ORIG_LOOP_INFO and conditionalize set of
3956 LOOP_VINFO_NITERS_ASSUMPTIONS.
3957 (vect_analyze_loop_1): Adjust.
3958 (vect_analyze_loop): Move loop constraint setting and
3959 SCEV/niter reset here from vect_create_loop_vinfo to perform
3961 (vect_analyze_loop_form): Move dumping of symbolic niters
3962 here from vect_create_loop_vinfo.
3964 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
3966 * ipa-modref-tree.h (enum modref_special_parms): New enum.
3967 (struct modref_access_node): update for special parms.
3968 (struct modref_ref_node): Likewise.
3969 (struct modref_parm_map): Likewise.
3970 (struct modref_tree): Likewise.
3971 * ipa-modref.c (dump_access): Likewise.
3972 (get_access): Detect static chain.
3973 (parm_map_for_arg): Take tree as arg instead of
3975 (merge_call_side_effects): Compute map for static chain.
3976 (process_fnspec): Update.
3977 (struct escape_point): Remove retslot_arg and static_chain_arg.
3978 (analyze_parms): Update.
3979 (compute_parm_map): Update.
3980 (propagate_unknown_call): Update.
3981 (modref_propagate_in_scc): Update.
3982 (modref_merge_call_site_flags): Update.
3983 (ipa_merge_modref_summary_after_inlining): Update.
3984 * tree-ssa-alias.c (modref_may_conflict): Handle static chain.
3985 * ipa-modref-tree.c (test_merge): Update.
3987 2021-11-08 Haochen Gui <guihaoc@gcc.gnu.org>
3989 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_builtin): Disable
3990 gimple fold for VSX_BUILTIN_XVMINDP, ALTIVEC_BUILTIN_VMINFP,
3991 VSX_BUILTIN_XVMAXDP, ALTIVEC_BUILTIN_VMAXFP when fast-math is not
3994 2021-11-08 liuhongt <hongtao.liu@intel.com>
3996 PR tree-optimization/103077
3997 * doc/invoke.texi (Options That Control Optimization):
3998 Update documentation for -ftree-loop-vectorize and
3999 -ftree-slp-vectorize which are enabled by default at -02.
4001 2021-11-08 liuhongt <hongtao.liu@intel.com>
4004 * match.pd (Simplifcation (trunc)copysign((extend)a, (extend)b)
4005 to .COPYSIGN (a, b)): Add !HONOR_SNANS.
4007 2021-11-08 liuhongt <hongtao.liu@intel.com>
4010 * match.pd: Simplify
4011 (trunc)fma ((extend)a, (extend)b, (extend)c) to IFN_FMA (a, b,
4012 c) under flag_unsafe_math_optimizations.
4014 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
4017 * params.opt (modref-max-depth): Add range.
4018 (modref-max-adjustments): Fix range.
4020 2021-11-07 Aldy Hernandez <aldyh@redhat.com>
4022 * tree-pass.h (make_pass_vrp_threader): Remove.
4023 * tree-ssa-threadbackward.c
4024 (back_threader_profitability::profitable_path_p): Remove
4025 ASSERT_EXPR references.
4026 * tree-ssa-threadedge.c (jt_state::register_equivs_stmt): Same.
4027 * tree-vrp.c (vrp_folder::simplify_casted_conds): Same.
4028 (execute_vrp): Same.
4029 (class hybrid_threader): Remove.
4030 (hybrid_threader::hybrid_threader): Remove.
4031 (hybrid_threader::~hybrid_threader): Remove.
4032 (hybrid_threader::before_dom_children): Remove.
4033 (hybrid_threader::after_dom_children): Remove.
4034 (execute_vrp_threader): Remove.
4035 (class pass_vrp_threader): Remove.
4036 (make_pass_vrp_threader): Remove.
4038 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
4040 * gimple.c (gimple_call_arg_flags): Use interposable_eaf_flags.
4041 (gimple_call_retslot_flags): Likewise.
4042 (gimple_call_static_chain_flags): Likewise.
4043 * ipa-modref.c (remove_useless_eaf_flags): Do not remove everything for
4045 (modref_summary::useful_p): Likewise.
4046 (modref_summary_lto::useful_p): Likewise.
4047 (analyze_parms): Do not give up on NOVOPS.
4048 (analyze_function): When dumping report chnages in EAF flags
4049 between IPA and local pass.
4050 (modref_merge_call_site_flags): Compute implicit eaf flags
4051 based on callee ecf_flags and fnspec; if the function does not
4052 bind to current defs use interposable_eaf_flags.
4053 (modref_propagate_flags_in_scc): Update.
4054 * ipa-modref.h (interposable_eaf_flags): New function.
4056 2021-11-07 Bill Schmidt <wschmidt@linux.ibm.com>
4058 * config/rs6000/rs6000-call.c (rs6000_expand_new_builtin): New
4060 (rs6000_invalid_new_builtin): New function.
4061 (rs6000_expand_builtin): Call rs6000_expand_new_builtin.
4062 (rs6000_expand_ldst_mask): New function.
4063 (new_cpu_expand_builtin): Likewise.
4064 (elemrev_icode): Likewise.
4065 (ldv_expand_builtin): Likewise.
4066 (lxvrse_expand_builtin): Likewise.
4067 (lxvrze_expand_builtin): Likewise.
4068 (stv_expand_builtin): Likewise.
4069 (new_mma_expand_builtin): Likewise.
4070 (new_htm_spr_num): Likewise.
4071 (new_htm_expand_builtin): Likewise.
4072 (rs6000_expand_new_builtin): Likewise.
4073 (rs6000_init_builtins): Initialize altivec_builtin_mask_for_load.
4075 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
4077 * ipa-modref.c (modref_lattice): Add do_dataflow,
4078 changed and propagate_to fields.
4079 (modref_lattice::release): Free propagate_to
4080 (modref_lattice::merge): Do not give up early on unknown
4082 (modref_lattice::merge_deref): Likewise.
4083 (modref_eaf_analysis): Update toplevel comment.
4084 (modref_eaf_analysis::analyze_ssa_name): Record postponned ssa names;
4085 do optimistic dataflow initialization.
4086 (modref_eaf_analysis::merge_with_ssa_name): Build dataflow graph.
4087 (modref_eaf_analysis::propagate): New member function.
4088 (analyze_parms): Update to new API of modref_eaf_analysis.
4090 2021-11-06 Jan Hubicka <hubicka@ucw.cz>
4092 * cgraph.h (cgraph_node::can_be_discarded_p): Do not
4093 return true on functions from other partition.
4095 2021-11-06 David Edelsohn <dje.gcc@gmail.com>
4097 * config/rs6000/vsx.md (vsx_splat_v4si): Change constraints to "wa".
4098 (vsx_splat_v4si_di): Change constraint to "wa".
4100 2021-11-06 Aldy Hernandez <aldyh@redhat.com>
4102 PR tree-optimization/103061
4103 * value-relation.cc (path_oracle::path_oracle): Initialize
4105 (path_oracle::killing_def): Set m_killed_defs.
4106 (path_oracle::query_relation): Do not look at the root oracle for
4108 * value-relation.h (class path_oracle): Add m_killed_defs.
4110 2021-11-06 Aldy Hernandez <aldyh@redhat.com>
4112 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
4113 Remove gotos and other cleanups.
4115 2021-11-05 Jan Hubicka <hubicka@ucw.cz>
4118 * ipa-modref-tree.h (modref_tree::insert): Do nothing for
4119 paradoxical and zero sized accesses.
4121 2021-11-05 Jan Hubicka <hubicka@ucw.cz>
4124 * ipa-modref-tree.h (struct modref_access_node): Avoid left shift
4127 2021-11-05 Iain Sandoe <iain@sandoe.co.uk>
4129 * config/darwin.h (ASM_GENERATE_INTERNAL_LABEL): Add LTRAMP
4130 to the list of symbol prefixes that must be made linker-
4133 2021-11-05 Iain Sandoe <iain@sandoe.co.uk>
4135 * config.host: Add support for aarch64-*-darwin.
4136 * config/aarch64/host-aarch64-darwin.c: New file.
4137 * config/aarch64/x-darwin: New file.
4139 2021-11-05 Andrew MacLeod <amacleod@redhat.com>
4141 PR tree-optimization/103093
4142 * gimple-range-gori.cc (range_def_chain::get_imports): Remove assert.
4144 2021-11-05 Andrew MacLeod <amacleod@redhat.com>
4146 PR tree-optimization/102943
4147 * gimple-range-cache.cc (class update_list): New.
4148 (update_list::add): Replace add_to_update.
4149 (update_list::pop): New.
4150 (ranger_cache::ranger_cache): Adjust.
4151 (ranger_cache::~ranger_cache): Adjust.
4152 (ranger_cache::add_to_update): Delete.
4153 (ranger_cache::propagate_cache): Adjust to new class.
4154 (ranger_cache::propagate_updated_value): Ditto.
4155 (ranger_cache::fill_block_cache): Ditto.
4156 * gimple-range-cache.h (class ranger_cache): Adjust to update class.
4158 2021-11-05 Richard Biener <rguenther@suse.de>
4160 * tree-vect-loop.c (vect_analyze_loop): Remove obsolete
4161 comment and expand on another one. Combine nested if.
4163 2021-11-05 John David Anglin <danglin@gcc.gnu.org>
4166 * config/pa/pa-modes.def: Add OImode integer type.
4167 * config/pa/pa.c (pa_scalar_mode_supported_p): Allow TImode
4169 * config/pa/pa.h (MIN_UNITS_PER_WORD) Define to MIN_UNITS_PER_WORD
4170 to UNITS_PER_WORD if IN_LIBGCC2.
4171 * config/pa/pa.md (addti3, addvti3, subti3, subvti3, negti2,
4172 negvti2, ashlti3, shrpd_internal): New patterns.
4173 Change some multi instruction types to multi.
4175 2021-11-05 Jakub Jelinek <jakub@redhat.com>
4178 * config/i386/i386.h
4179 (stringop_algs::stringop_strategy::stringop_strategy): Make the ctor
4182 2021-11-05 Wilco Dijkstra <wdijkstr@arm.com>
4185 * config/aarch64/aarch64.c (aarch64_mov_operand_p): Strip the salt
4187 * config/aarch64/constraints.md: Support const in Usw.
4189 2021-11-05 John David Anglin <danglin@gcc.gnu.org>
4191 * config/pa/pa.h (PREFERRED_DEBUGGING_TYPE): Define to DWARF2_DEBUG.
4192 * config/pa/pa64-hpux.h (PREFERRED_DEBUGGING_TYPE): Remove define.
4194 2021-11-05 Richard Biener <rguenther@suse.de>
4196 * tree-vectorizer.h (vec_info_shared::n_stmts): Add.
4197 (LOOP_VINFO_N_STMTS): Likewise.
4198 (vec_info_for_bb): Remove unused function.
4199 * tree-vectorizer.c (vec_info_shared::vec_info_shared):
4200 Initialize n_stmts member.
4201 * tree-vect-loop.c: Remove INCLUDE_FUNCTIONAL.
4202 (vect_create_loop_vinfo): Do not set loop->aux.
4203 (vect_analyze_loop_2): Do not get n_stmts as argument,
4204 instead use LOOP_VINFO_N_STMTS. Set LOOP_VINFO_VECTORIZABLE_P
4206 (vect_analyze_loop_1): Remove callback, get the mode iterator
4207 and autodetected_vector_mode as argument, advancing the
4208 iterator and initializing autodetected_vector_mode here.
4209 (vect_analyze_loop): Split analysis loop into two, first
4210 processing main loops only and then epilogues.
4212 2021-11-05 Martin Jambor <mjambor@suse.cz>
4214 * ipa-prop.c (compute_complex_assign_jump_func): Remove
4215 unnecessary check for RECORD_TYPE.
4217 2021-11-05 Gerald Pfeifer <gerald@pfeifer.com>
4219 * doc/install.texi2html: Do not generate old.html any longer.
4221 2021-11-05 Martin Liska <mliska@suse.cz>
4224 * opts.c (finish_options): Reset flag_gtoggle when it is used.
4226 2021-11-05 Jakub Jelinek <jakub@redhat.com>
4229 * dwarf2out.c (add_const_value_attribute): Add MODE argument, use it
4230 in CONST_WIDE_INT handling. Adjust recursive calls.
4231 (add_location_or_const_value_attribute): Pass DECL_MODE (decl) to
4232 new add_const_value_attribute argument.
4233 (tree_add_const_value_attribute): Pass TYPE_MODE (type) to new
4234 add_const_value_attribute argument.
4236 2021-11-05 Rasmus Villemoes <rasmus.villemoes@prevas.dk>
4238 * config/vx-common.h: Test value of TARGET_VXWORKS7 rather
4241 2021-11-05 Richard Biener <rguenther@suse.de>
4243 * tree-vectorizer.h (struct vect_loop_form_info): New.
4244 (vect_analyze_loop_form): Adjust.
4245 (vect_create_loop_vinfo): New.
4246 * tree-parloops.c (gather_scalar_reductions): Adjust for
4247 vect_analyze_loop_form API change.
4248 * tree-vect-loop.c: Include <functional>.
4249 (vect_analyze_loop_form_1): Rename to vect_analyze_loop_form,
4250 take struct vect_loop_form_info as output parameter and adjust.
4251 (vect_analyze_loop_form): Rename to vect_create_loop_vinfo and
4252 split out call to the original vect_analyze_loop_form_1.
4253 (vect_reanalyze_as_main_loop): Rename to...
4254 (vect_analyze_loop_1): ... this, factor out the call to
4255 vect_analyze_loop_form and generalize to be able to use it twice ...
4256 (vect_analyze_loop): ... here. Perform vect_analyze_loop_form
4259 2021-11-05 Xionghu Luo <luoxhu@linux.ibm.com>
4262 * config/rs6000/fusion.md: Regenerate.
4263 * config/rs6000/genfusion.pl: Fix incorrect clobber constraint.
4265 2021-11-04 Andreas Krebbel <krebbel@linux.ibm.com>
4267 * config/s390/s390.h (STACK_CHECK_MOVING_SP): New macro
4270 2021-11-04 Tamar Christina <tamar.christina@arm.com>
4272 * config/aarch64/aarch64-builtins.c
4273 (aarch64_general_gimple_fold_builtin): Add ashl, sshl, ushl, ashr,
4274 ashr_simd, lshr, lshr_simd.
4275 * config/aarch64/aarch64-simd-builtins.def (lshr): Use USHIFTIMM.
4276 * config/aarch64/arm_neon.h (vshr_n_u8, vshr_n_u16, vshr_n_u32,
4277 vshrq_n_u8, vshrq_n_u16, vshrq_n_u32, vshrq_n_u64): Fix type hack.
4279 2021-11-04 Tamar Christina <tamar.christina@arm.com>
4281 * match.pd: New negate+shift pattern.
4283 2021-11-04 Andrew MacLeod <amacleod@redhat.com>
4285 PR tree-optimization/103079
4286 * gimple-range-gori.cc (gimple_range_calc_op1): Treat undefined as
4288 (gimple_range_calc_op2): Ditto.
4290 2021-11-04 Martin Jambor <mjambor@suse.cz>
4293 * ipa-param-manipulation.h (class ipa_param_body_adjustments): New
4294 members remap_with_debug_expressions, m_dead_ssa_debug_equiv,
4295 m_dead_stmt_debug_equiv and prepare_debug_expressions. Added
4296 parameter to mark_dead_statements.
4297 * ipa-param-manipulation.c: Include tree-phinodes.h and cfgexpand.h.
4298 (ipa_param_body_adjustments::mark_dead_statements): New parameter
4299 debugstack, push into it all SSA names used in debug statements,
4300 produce m_dead_ssa_debug_equiv mapping for the removed param.
4301 (replace_with_mapped_expr): New function.
4302 (ipa_param_body_adjustments::remap_with_debug_expressions): Likewise.
4303 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
4304 (ipa_param_body_adjustments::common_initialization): Gather and
4305 procecc SSA which will be removed but are in debug statements. Simplify.
4306 (ipa_param_body_adjustments::ipa_param_body_adjustments): Initialize
4308 * tree-inline.c (remap_gimple_stmt): Create a debug bind when possible
4309 when avoiding a copy of an unnecessary statement. Remap removed SSA
4310 names in existing debug statements.
4311 (tree_function_versioning): Do not create DEBUG_EXPR_DECL for removed
4312 parameters if we have already done so.
4314 2021-11-04 Jan Hubicka <hubicka@ucw.cz>
4317 * gimple.c (gimple_call_static_chain_flags): Handle case when
4318 nested function does not bind locally.
4320 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
4322 * config/aarch64/aarch64.c (aarch64_function_value): Generate
4323 a register rtx for Neon vector-tuple modes.
4324 (aarch64_layout_arg): Likewise.
4326 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
4328 * lower-subreg.c (simple_move): Prevent decomposition if
4329 modes are not tieable.
4331 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
4332 Richard Sandiford <richard.sandiford@arm.com>
4334 * config/aarch64/aarch64-builtins.c (v2x8qi_UP): Define.
4335 (v2x4hi_UP): Likewise.
4336 (v2x4hf_UP): Likewise.
4337 (v2x4bf_UP): Likewise.
4338 (v2x2si_UP): Likewise.
4339 (v2x2sf_UP): Likewise.
4340 (v2x1di_UP): Likewise.
4341 (v2x1df_UP): Likewise.
4342 (v2x16qi_UP): Likewise.
4343 (v2x8hi_UP): Likewise.
4344 (v2x8hf_UP): Likewise.
4345 (v2x8bf_UP): Likewise.
4346 (v2x4si_UP): Likewise.
4347 (v2x4sf_UP): Likewise.
4348 (v2x2di_UP): Likewise.
4349 (v2x2df_UP): Likewise.
4350 (v3x8qi_UP): Likewise.
4351 (v3x4hi_UP): Likewise.
4352 (v3x4hf_UP): Likewise.
4353 (v3x4bf_UP): Likewise.
4354 (v3x2si_UP): Likewise.
4355 (v3x2sf_UP): Likewise.
4356 (v3x1di_UP): Likewise.
4357 (v3x1df_UP): Likewise.
4358 (v3x16qi_UP): Likewise.
4359 (v3x8hi_UP): Likewise.
4360 (v3x8hf_UP): Likewise.
4361 (v3x8bf_UP): Likewise.
4362 (v3x4si_UP): Likewise.
4363 (v3x4sf_UP): Likewise.
4364 (v3x2di_UP): Likewise.
4365 (v3x2df_UP): Likewise.
4366 (v4x8qi_UP): Likewise.
4367 (v4x4hi_UP): Likewise.
4368 (v4x4hf_UP): Likewise.
4369 (v4x4bf_UP): Likewise.
4370 (v4x2si_UP): Likewise.
4371 (v4x2sf_UP): Likewise.
4372 (v4x1di_UP): Likewise.
4373 (v4x1df_UP): Likewise.
4374 (v4x16qi_UP): Likewise.
4375 (v4x8hi_UP): Likewise.
4376 (v4x8hf_UP): Likewise.
4377 (v4x8bf_UP): Likewise.
4378 (v4x4si_UP): Likewise.
4379 (v4x4sf_UP): Likewise.
4380 (v4x2di_UP): Likewise.
4381 (v4x2df_UP): Likewise.
4382 (TYPES_GETREGP): Delete.
4383 (TYPES_SETREGP): Likewise.
4384 (TYPES_LOADSTRUCT_U): Define.
4385 (TYPES_LOADSTRUCT_P): Likewise.
4386 (TYPES_LOADSTRUCT_LANE_U): Likewise.
4387 (TYPES_LOADSTRUCT_LANE_P): Likewise.
4388 (TYPES_STORE1P): Move for consistency.
4389 (TYPES_STORESTRUCT_U): Define.
4390 (TYPES_STORESTRUCT_P): Likewise.
4391 (TYPES_STORESTRUCT_LANE_U): Likewise.
4392 (TYPES_STORESTRUCT_LANE_P): Likewise.
4393 (aarch64_simd_tuple_types): Define.
4394 (aarch64_lookup_simd_builtin_type): Handle tuple type lookup.
4395 (aarch64_init_simd_builtin_functions): Update frontend lookup
4396 for builtin functions after handling arm_neon.h pragma.
4397 (register_tuple_type): Manually set modes of single-integer
4398 tuple types. Record tuple types.
4399 * config/aarch64/aarch64-modes.def
4400 (ADV_SIMD_D_REG_STRUCT_MODES): Define D-register tuple modes.
4401 (ADV_SIMD_Q_REG_STRUCT_MODES): Define Q-register tuple modes.
4402 (SVE_MODES): Give single-vector modes priority over vector-
4404 (VECTOR_MODES_WITH_PREFIX): Set partial-vector mode order to
4405 be after all single-vector modes.
4406 * config/aarch64/aarch64-simd-builtins.def: Update builtin
4407 generator macros to reflect modifications to the backend
4409 * config/aarch64/aarch64-simd.md (aarch64_simd_ld2<mode>):
4410 Use vector-tuple mode iterator and rename to...
4411 (aarch64_simd_ld2<vstruct_elt>): This.
4412 (aarch64_simd_ld2r<mode>): Use vector-tuple mode iterator and
4414 (aarch64_simd_ld2r<vstruct_elt>): This.
4415 (aarch64_vec_load_lanesoi_lane<mode>): Use vector-tuple mode
4416 iterator and rename to...
4417 (aarch64_vec_load_lanes<mode>_lane<vstruct_elt>): This.
4418 (vec_load_lanesoi<mode>): Use vector-tuple mode iterator and
4420 (vec_load_lanes<mode><vstruct_elt>): This.
4421 (aarch64_simd_st2<mode>): Use vector-tuple mode iterator and
4423 (aarch64_simd_st2<vstruct_elt>): This.
4424 (aarch64_vec_store_lanesoi_lane<mode>): Use vector-tuple mode
4425 iterator and rename to...
4426 (aarch64_vec_store_lanes<mode>_lane<vstruct_elt>): This.
4427 (vec_store_lanesoi<mode>): Use vector-tuple mode iterator and
4429 (vec_store_lanes<mode><vstruct_elt>): This.
4430 (aarch64_simd_ld3<mode>): Use vector-tuple mode iterator and
4432 (aarch64_simd_ld3<vstruct_elt>): This.
4433 (aarch64_simd_ld3r<mode>): Use vector-tuple mode iterator and
4435 (aarch64_simd_ld3r<vstruct_elt>): This.
4436 (aarch64_vec_load_lanesci_lane<mode>): Use vector-tuple mode
4437 iterator and rename to...
4438 (vec_load_lanesci<mode>): This.
4439 (aarch64_simd_st3<mode>): Use vector-tuple mode iterator and
4441 (aarch64_simd_st3<vstruct_elt>): This.
4442 (aarch64_vec_store_lanesci_lane<mode>): Use vector-tuple mode
4443 iterator and rename to...
4444 (vec_store_lanesci<mode>): This.
4445 (aarch64_simd_ld4<mode>): Use vector-tuple mode iterator and
4447 (aarch64_simd_ld4<vstruct_elt>): This.
4448 (aarch64_simd_ld4r<mode>): Use vector-tuple mode iterator and
4450 (aarch64_simd_ld4r<vstruct_elt>): This.
4451 (aarch64_vec_load_lanesxi_lane<mode>): Use vector-tuple mode
4452 iterator and rename to...
4453 (vec_load_lanesxi<mode>): This.
4454 (aarch64_simd_st4<mode>): Use vector-tuple mode iterator and
4456 (aarch64_simd_st4<vstruct_elt>): This.
4457 (aarch64_vec_store_lanesxi_lane<mode>): Use vector-tuple mode
4458 iterator and rename to...
4459 (vec_store_lanesxi<mode>): This.
4460 (mov<mode>): Define for Neon vector-tuple modes.
4461 (aarch64_ld1x3<VALLDIF:mode>): Use vector-tuple mode iterator
4463 (aarch64_ld1x3<vstruct_elt>): This.
4464 (aarch64_ld1_x3_<mode>): Use vector-tuple mode iterator and
4466 (aarch64_ld1_x3_<vstruct_elt>): This.
4467 (aarch64_ld1x4<VALLDIF:mode>): Use vector-tuple mode iterator
4469 (aarch64_ld1x4<vstruct_elt>): This.
4470 (aarch64_ld1_x4_<mode>): Use vector-tuple mode iterator and
4472 (aarch64_ld1_x4_<vstruct_elt>): This.
4473 (aarch64_st1x2<VALLDIF:mode>): Use vector-tuple mode iterator
4475 (aarch64_st1x2<vstruct_elt>): This.
4476 (aarch64_st1_x2_<mode>): Use vector-tuple mode iterator and
4478 (aarch64_st1_x2_<vstruct_elt>): This.
4479 (aarch64_st1x3<VALLDIF:mode>): Use vector-tuple mode iterator
4481 (aarch64_st1x3<vstruct_elt>): This.
4482 (aarch64_st1_x3_<mode>): Use vector-tuple mode iterator and
4484 (aarch64_st1_x3_<vstruct_elt>): This.
4485 (aarch64_st1x4<VALLDIF:mode>): Use vector-tuple mode iterator
4487 (aarch64_st1x4<vstruct_elt>): This.
4488 (aarch64_st1_x4_<mode>): Use vector-tuple mode iterator and
4490 (aarch64_st1_x4_<vstruct_elt>): This.
4491 (*aarch64_mov<mode>): Define for vector-tuple modes.
4492 (*aarch64_be_mov<mode>): Likewise.
4493 (aarch64_ld<VSTRUCT:nregs>r<VALLDIF:mode>): Use vector-tuple
4494 mode iterator and rename to...
4495 (aarch64_ld<nregs>r<vstruct_elt>): This.
4496 (aarch64_ld2<mode>_dreg): Use vector-tuple mode iterator and
4498 (aarch64_ld2<vstruct_elt>_dreg): This.
4499 (aarch64_ld3<mode>_dreg): Use vector-tuple mode iterator and
4501 (aarch64_ld3<vstruct_elt>_dreg): This.
4502 (aarch64_ld4<mode>_dreg): Use vector-tuple mode iterator and
4504 (aarch64_ld4<vstruct_elt>_dreg): This.
4505 (aarch64_ld<VSTRUCT:nregs><VDC:mode>): Use vector-tuple mode
4506 iterator and rename to...
4507 (aarch64_ld<nregs><vstruct_elt>): Use vector-tuple mode
4508 iterator and rename to...
4509 (aarch64_ld<VSTRUCT:nregs><VQ:mode>): Use vector-tuple mode
4510 (aarch64_ld1x2<VQ:mode>): Delete.
4511 (aarch64_ld1x2<VDC:mode>): Use vector-tuple mode iterator and
4513 (aarch64_ld1x2<vstruct_elt>): This.
4514 (aarch64_ld<VSTRUCT:nregs>_lane<VALLDIF:mode>): Use vector-
4515 tuple mode iterator and rename to...
4516 (aarch64_ld<nregs>_lane<vstruct_elt>): This.
4517 (aarch64_get_dreg<VSTRUCT:mode><VDC:mode>): Delete.
4518 (aarch64_get_qreg<VSTRUCT:mode><VQ:mode>): Likewise.
4519 (aarch64_st2<mode>_dreg): Use vector-tuple mode iterator and
4521 (aarch64_st2<vstruct_elt>_dreg): This.
4522 (aarch64_st3<mode>_dreg): Use vector-tuple mode iterator and
4524 (aarch64_st3<vstruct_elt>_dreg): This.
4525 (aarch64_st4<mode>_dreg): Use vector-tuple mode iterator and
4527 (aarch64_st4<vstruct_elt>_dreg): This.
4528 (aarch64_st<VSTRUCT:nregs><VDC:mode>): Use vector-tuple mode
4529 iterator and rename to...
4530 (aarch64_st<nregs><vstruct_elt>): This.
4531 (aarch64_st<VSTRUCT:nregs><VQ:mode>): Use vector-tuple mode
4532 iterator and rename to aarch64_st<nregs><vstruct_elt>.
4533 (aarch64_st<VSTRUCT:nregs>_lane<VALLDIF:mode>): Use vector-
4534 tuple mode iterator and rename to...
4535 (aarch64_st<nregs>_lane<vstruct_elt>): This.
4536 (aarch64_set_qreg<VSTRUCT:mode><VQ:mode>): Delete.
4537 (aarch64_simd_ld1<mode>_x2): Use vector-tuple mode iterator
4539 (aarch64_simd_ld1<vstruct_elt>_x2): This.
4540 * config/aarch64/aarch64.c (aarch64_advsimd_struct_mode_p):
4541 Refactor to include new vector-tuple modes.
4542 (aarch64_classify_vector_mode): Add cases for new vector-
4544 (aarch64_advsimd_partial_struct_mode_p): Define.
4545 (aarch64_advsimd_full_struct_mode_p): Likewise.
4546 (aarch64_advsimd_vector_array_mode): Likewise.
4547 (aarch64_sve_data_mode): Change location in file.
4548 (aarch64_array_mode): Handle case of Neon vector-tuple modes.
4549 (aarch64_hard_regno_nregs): Handle case of partial Neon
4551 (aarch64_classify_address): Refactor to include handling of
4552 Neon vector-tuple modes.
4553 (aarch64_print_operand): Print "d" for "%R" for a partial
4554 Neon vector structure.
4555 (aarch64_expand_vec_perm_1): Use new vector-tuple mode.
4556 (aarch64_modes_tieable_p): Prevent tieing Neon partial struct
4557 modes with scalar machines modes larger than 8 bytes.
4558 (aarch64_can_change_mode_class): Don't allow changes between
4559 partial and full Neon vector-structure modes.
4560 * config/aarch64/arm_neon.h (vst2_lane_f16): Use updated
4561 builtin and remove boiler-plate code for opaque mode.
4562 (vst2_lane_f32): Likewise.
4563 (vst2_lane_f64): Likewise.
4564 (vst2_lane_p8): Likewise.
4565 (vst2_lane_p16): Likewise.
4566 (vst2_lane_p64): Likewise.
4567 (vst2_lane_s8): Likewise.
4568 (vst2_lane_s16): Likewise.
4569 (vst2_lane_s32): Likewise.
4570 (vst2_lane_s64): Likewise.
4571 (vst2_lane_u8): Likewise.
4572 (vst2_lane_u16): Likewise.
4573 (vst2_lane_u32): Likewise.
4574 (vst2_lane_u64): Likewise.
4575 (vst2q_lane_f16): Likewise.
4576 (vst2q_lane_f32): Likewise.
4577 (vst2q_lane_f64): Likewise.
4578 (vst2q_lane_p8): Likewise.
4579 (vst2q_lane_p16): Likewise.
4580 (vst2q_lane_p64): Likewise.
4581 (vst2q_lane_s8): Likewise.
4582 (vst2q_lane_s16): Likewise.
4583 (vst2q_lane_s32): Likewise.
4584 (vst2q_lane_s64): Likewise.
4585 (vst2q_lane_u8): Likewise.
4586 (vst2q_lane_u16): Likewise.
4587 (vst2q_lane_u32): Likewise.
4588 (vst2q_lane_u64): Likewise.
4589 (vst3_lane_f16): Likewise.
4590 (vst3_lane_f32): Likewise.
4591 (vst3_lane_f64): Likewise.
4592 (vst3_lane_p8): Likewise.
4593 (vst3_lane_p16): Likewise.
4594 (vst3_lane_p64): Likewise.
4595 (vst3_lane_s8): Likewise.
4596 (vst3_lane_s16): Likewise.
4597 (vst3_lane_s32): Likewise.
4598 (vst3_lane_s64): Likewise.
4599 (vst3_lane_u8): Likewise.
4600 (vst3_lane_u16): Likewise.
4601 (vst3_lane_u32): Likewise.
4602 (vst3_lane_u64): Likewise.
4603 (vst3q_lane_f16): Likewise.
4604 (vst3q_lane_f32): Likewise.
4605 (vst3q_lane_f64): Likewise.
4606 (vst3q_lane_p8): Likewise.
4607 (vst3q_lane_p16): Likewise.
4608 (vst3q_lane_p64): Likewise.
4609 (vst3q_lane_s8): Likewise.
4610 (vst3q_lane_s16): Likewise.
4611 (vst3q_lane_s32): Likewise.
4612 (vst3q_lane_s64): Likewise.
4613 (vst3q_lane_u8): Likewise.
4614 (vst3q_lane_u16): Likewise.
4615 (vst3q_lane_u32): Likewise.
4616 (vst3q_lane_u64): Likewise.
4617 (vst4_lane_f16): Likewise.
4618 (vst4_lane_f32): Likewise.
4619 (vst4_lane_f64): Likewise.
4620 (vst4_lane_p8): Likewise.
4621 (vst4_lane_p16): Likewise.
4622 (vst4_lane_p64): Likewise.
4623 (vst4_lane_s8): Likewise.
4624 (vst4_lane_s16): Likewise.
4625 (vst4_lane_s32): Likewise.
4626 (vst4_lane_s64): Likewise.
4627 (vst4_lane_u8): Likewise.
4628 (vst4_lane_u16): Likewise.
4629 (vst4_lane_u32): Likewise.
4630 (vst4_lane_u64): Likewise.
4631 (vst4q_lane_f16): Likewise.
4632 (vst4q_lane_f32): Likewise.
4633 (vst4q_lane_f64): Likewise.
4634 (vst4q_lane_p8): Likewise.
4635 (vst4q_lane_p16): Likewise.
4636 (vst4q_lane_p64): Likewise.
4637 (vst4q_lane_s8): Likewise.
4638 (vst4q_lane_s16): Likewise.
4639 (vst4q_lane_s32): Likewise.
4640 (vst4q_lane_s64): Likewise.
4641 (vst4q_lane_u8): Likewise.
4642 (vst4q_lane_u16): Likewise.
4643 (vst4q_lane_u32): Likewise.
4644 (vst4q_lane_u64): Likewise.
4645 (vtbl3_s8): Likewise.
4646 (vtbl3_u8): Likewise.
4647 (vtbl3_p8): Likewise.
4648 (vtbl4_s8): Likewise.
4649 (vtbl4_u8): Likewise.
4650 (vtbl4_p8): Likewise.
4651 (vld1_u8_x3): Likewise.
4652 (vld1_s8_x3): Likewise.
4653 (vld1_u16_x3): Likewise.
4654 (vld1_s16_x3): Likewise.
4655 (vld1_u32_x3): Likewise.
4656 (vld1_s32_x3): Likewise.
4657 (vld1_u64_x3): Likewise.
4658 (vld1_s64_x3): Likewise.
4659 (vld1_f16_x3): Likewise.
4660 (vld1_f32_x3): Likewise.
4661 (vld1_f64_x3): Likewise.
4662 (vld1_p8_x3): Likewise.
4663 (vld1_p16_x3): Likewise.
4664 (vld1_p64_x3): Likewise.
4665 (vld1q_u8_x3): Likewise.
4666 (vld1q_s8_x3): Likewise.
4667 (vld1q_u16_x3): Likewise.
4668 (vld1q_s16_x3): Likewise.
4669 (vld1q_u32_x3): Likewise.
4670 (vld1q_s32_x3): Likewise.
4671 (vld1q_u64_x3): Likewise.
4672 (vld1q_s64_x3): Likewise.
4673 (vld1q_f16_x3): Likewise.
4674 (vld1q_f32_x3): Likewise.
4675 (vld1q_f64_x3): Likewise.
4676 (vld1q_p8_x3): Likewise.
4677 (vld1q_p16_x3): Likewise.
4678 (vld1q_p64_x3): Likewise.
4679 (vld1_u8_x2): Likewise.
4680 (vld1_s8_x2): Likewise.
4681 (vld1_u16_x2): Likewise.
4682 (vld1_s16_x2): Likewise.
4683 (vld1_u32_x2): Likewise.
4684 (vld1_s32_x2): Likewise.
4685 (vld1_u64_x2): Likewise.
4686 (vld1_s64_x2): Likewise.
4687 (vld1_f16_x2): Likewise.
4688 (vld1_f32_x2): Likewise.
4689 (vld1_f64_x2): Likewise.
4690 (vld1_p8_x2): Likewise.
4691 (vld1_p16_x2): Likewise.
4692 (vld1_p64_x2): Likewise.
4693 (vld1q_u8_x2): Likewise.
4694 (vld1q_s8_x2): Likewise.
4695 (vld1q_u16_x2): Likewise.
4696 (vld1q_s16_x2): Likewise.
4697 (vld1q_u32_x2): Likewise.
4698 (vld1q_s32_x2): Likewise.
4699 (vld1q_u64_x2): Likewise.
4700 (vld1q_s64_x2): Likewise.
4701 (vld1q_f16_x2): Likewise.
4702 (vld1q_f32_x2): Likewise.
4703 (vld1q_f64_x2): Likewise.
4704 (vld1q_p8_x2): Likewise.
4705 (vld1q_p16_x2): Likewise.
4706 (vld1q_p64_x2): Likewise.
4707 (vld1_s8_x4): Likewise.
4708 (vld1q_s8_x4): Likewise.
4709 (vld1_s16_x4): Likewise.
4710 (vld1q_s16_x4): Likewise.
4711 (vld1_s32_x4): Likewise.
4712 (vld1q_s32_x4): Likewise.
4713 (vld1_u8_x4): Likewise.
4714 (vld1q_u8_x4): Likewise.
4715 (vld1_u16_x4): Likewise.
4716 (vld1q_u16_x4): Likewise.
4717 (vld1_u32_x4): Likewise.
4718 (vld1q_u32_x4): Likewise.
4719 (vld1_f16_x4): Likewise.
4720 (vld1q_f16_x4): Likewise.
4721 (vld1_f32_x4): Likewise.
4722 (vld1q_f32_x4): Likewise.
4723 (vld1_p8_x4): Likewise.
4724 (vld1q_p8_x4): Likewise.
4725 (vld1_p16_x4): Likewise.
4726 (vld1q_p16_x4): Likewise.
4727 (vld1_s64_x4): Likewise.
4728 (vld1_u64_x4): Likewise.
4729 (vld1_p64_x4): Likewise.
4730 (vld1q_s64_x4): Likewise.
4731 (vld1q_u64_x4): Likewise.
4732 (vld1q_p64_x4): Likewise.
4733 (vld1_f64_x4): Likewise.
4734 (vld1q_f64_x4): Likewise.
4735 (vld2_s64): Likewise.
4736 (vld2_u64): Likewise.
4737 (vld2_f64): Likewise.
4738 (vld2_s8): Likewise.
4739 (vld2_p8): Likewise.
4740 (vld2_p64): Likewise.
4741 (vld2_s16): Likewise.
4742 (vld2_p16): Likewise.
4743 (vld2_s32): Likewise.
4744 (vld2_u8): Likewise.
4745 (vld2_u16): Likewise.
4746 (vld2_u32): Likewise.
4747 (vld2_f16): Likewise.
4748 (vld2_f32): Likewise.
4749 (vld2q_s8): Likewise.
4750 (vld2q_p8): Likewise.
4751 (vld2q_s16): Likewise.
4752 (vld2q_p16): Likewise.
4753 (vld2q_p64): Likewise.
4754 (vld2q_s32): Likewise.
4755 (vld2q_s64): Likewise.
4756 (vld2q_u8): Likewise.
4757 (vld2q_u16): Likewise.
4758 (vld2q_u32): Likewise.
4759 (vld2q_u64): Likewise.
4760 (vld2q_f16): Likewise.
4761 (vld2q_f32): Likewise.
4762 (vld2q_f64): Likewise.
4763 (vld3_s64): Likewise.
4764 (vld3_u64): Likewise.
4765 (vld3_f64): Likewise.
4766 (vld3_s8): Likewise.
4767 (vld3_p8): Likewise.
4768 (vld3_s16): Likewise.
4769 (vld3_p16): Likewise.
4770 (vld3_s32): Likewise.
4771 (vld3_u8): Likewise.
4772 (vld3_u16): Likewise.
4773 (vld3_u32): Likewise.
4774 (vld3_f16): Likewise.
4775 (vld3_f32): Likewise.
4776 (vld3_p64): Likewise.
4777 (vld3q_s8): Likewise.
4778 (vld3q_p8): Likewise.
4779 (vld3q_s16): Likewise.
4780 (vld3q_p16): Likewise.
4781 (vld3q_s32): Likewise.
4782 (vld3q_s64): Likewise.
4783 (vld3q_u8): Likewise.
4784 (vld3q_u16): Likewise.
4785 (vld3q_u32): Likewise.
4786 (vld3q_u64): Likewise.
4787 (vld3q_f16): Likewise.
4788 (vld3q_f32): Likewise.
4789 (vld3q_f64): Likewise.
4790 (vld3q_p64): Likewise.
4791 (vld4_s64): Likewise.
4792 (vld4_u64): Likewise.
4793 (vld4_f64): Likewise.
4794 (vld4_s8): Likewise.
4795 (vld4_p8): Likewise.
4796 (vld4_s16): Likewise.
4797 (vld4_p16): Likewise.
4798 (vld4_s32): Likewise.
4799 (vld4_u8): Likewise.
4800 (vld4_u16): Likewise.
4801 (vld4_u32): Likewise.
4802 (vld4_f16): Likewise.
4803 (vld4_f32): Likewise.
4804 (vld4_p64): Likewise.
4805 (vld4q_s8): Likewise.
4806 (vld4q_p8): Likewise.
4807 (vld4q_s16): Likewise.
4808 (vld4q_p16): Likewise.
4809 (vld4q_s32): Likewise.
4810 (vld4q_s64): Likewise.
4811 (vld4q_u8): Likewise.
4812 (vld4q_u16): Likewise.
4813 (vld4q_u32): Likewise.
4814 (vld4q_u64): Likewise.
4815 (vld4q_f16): Likewise.
4816 (vld4q_f32): Likewise.
4817 (vld4q_f64): Likewise.
4818 (vld4q_p64): Likewise.
4819 (vld2_dup_s8): Likewise.
4820 (vld2_dup_s16): Likewise.
4821 (vld2_dup_s32): Likewise.
4822 (vld2_dup_f16): Likewise.
4823 (vld2_dup_f32): Likewise.
4824 (vld2_dup_f64): Likewise.
4825 (vld2_dup_u8): Likewise.
4826 (vld2_dup_u16): Likewise.
4827 (vld2_dup_u32): Likewise.
4828 (vld2_dup_p8): Likewise.
4829 (vld2_dup_p16): Likewise.
4830 (vld2_dup_p64): Likewise.
4831 (vld2_dup_s64): Likewise.
4832 (vld2_dup_u64): Likewise.
4833 (vld2q_dup_s8): Likewise.
4834 (vld2q_dup_p8): Likewise.
4835 (vld2q_dup_s16): Likewise.
4836 (vld2q_dup_p16): Likewise.
4837 (vld2q_dup_s32): Likewise.
4838 (vld2q_dup_s64): Likewise.
4839 (vld2q_dup_u8): Likewise.
4840 (vld2q_dup_u16): Likewise.
4841 (vld2q_dup_u32): Likewise.
4842 (vld2q_dup_u64): Likewise.
4843 (vld2q_dup_f16): Likewise.
4844 (vld2q_dup_f32): Likewise.
4845 (vld2q_dup_f64): Likewise.
4846 (vld2q_dup_p64): Likewise.
4847 (vld3_dup_s64): Likewise.
4848 (vld3_dup_u64): Likewise.
4849 (vld3_dup_f64): Likewise.
4850 (vld3_dup_s8): Likewise.
4851 (vld3_dup_p8): Likewise.
4852 (vld3_dup_s16): Likewise.
4853 (vld3_dup_p16): Likewise.
4854 (vld3_dup_s32): Likewise.
4855 (vld3_dup_u8): Likewise.
4856 (vld3_dup_u16): Likewise.
4857 (vld3_dup_u32): Likewise.
4858 (vld3_dup_f16): Likewise.
4859 (vld3_dup_f32): Likewise.
4860 (vld3_dup_p64): Likewise.
4861 (vld3q_dup_s8): Likewise.
4862 (vld3q_dup_p8): Likewise.
4863 (vld3q_dup_s16): Likewise.
4864 (vld3q_dup_p16): Likewise.
4865 (vld3q_dup_s32): Likewise.
4866 (vld3q_dup_s64): Likewise.
4867 (vld3q_dup_u8): Likewise.
4868 (vld3q_dup_u16): Likewise.
4869 (vld3q_dup_u32): Likewise.
4870 (vld3q_dup_u64): Likewise.
4871 (vld3q_dup_f16): Likewise.
4872 (vld3q_dup_f32): Likewise.
4873 (vld3q_dup_f64): Likewise.
4874 (vld3q_dup_p64): Likewise.
4875 (vld4_dup_s64): Likewise.
4876 (vld4_dup_u64): Likewise.
4877 (vld4_dup_f64): Likewise.
4878 (vld4_dup_s8): Likewise.
4879 (vld4_dup_p8): Likewise.
4880 (vld4_dup_s16): Likewise.
4881 (vld4_dup_p16): Likewise.
4882 (vld4_dup_s32): Likewise.
4883 (vld4_dup_u8): Likewise.
4884 (vld4_dup_u16): Likewise.
4885 (vld4_dup_u32): Likewise.
4886 (vld4_dup_f16): Likewise.
4887 (vld4_dup_f32): Likewise.
4888 (vld4_dup_p64): Likewise.
4889 (vld4q_dup_s8): Likewise.
4890 (vld4q_dup_p8): Likewise.
4891 (vld4q_dup_s16): Likewise.
4892 (vld4q_dup_p16): Likewise.
4893 (vld4q_dup_s32): Likewise.
4894 (vld4q_dup_s64): Likewise.
4895 (vld4q_dup_u8): Likewise.
4896 (vld4q_dup_u16): Likewise.
4897 (vld4q_dup_u32): Likewise.
4898 (vld4q_dup_u64): Likewise.
4899 (vld4q_dup_f16): Likewise.
4900 (vld4q_dup_f32): Likewise.
4901 (vld4q_dup_f64): Likewise.
4902 (vld4q_dup_p64): Likewise.
4903 (vld2_lane_u8): Likewise.
4904 (vld2_lane_u16): Likewise.
4905 (vld2_lane_u32): Likewise.
4906 (vld2_lane_u64): Likewise.
4907 (vld2_lane_s8): Likewise.
4908 (vld2_lane_s16): Likewise.
4909 (vld2_lane_s32): Likewise.
4910 (vld2_lane_s64): Likewise.
4911 (vld2_lane_f16): Likewise.
4912 (vld2_lane_f32): Likewise.
4913 (vld2_lane_f64): Likewise.
4914 (vld2_lane_p8): Likewise.
4915 (vld2_lane_p16): Likewise.
4916 (vld2_lane_p64): Likewise.
4917 (vld2q_lane_u8): Likewise.
4918 (vld2q_lane_u16): Likewise.
4919 (vld2q_lane_u32): Likewise.
4920 (vld2q_lane_u64): Likewise.
4921 (vld2q_lane_s8): Likewise.
4922 (vld2q_lane_s16): Likewise.
4923 (vld2q_lane_s32): Likewise.
4924 (vld2q_lane_s64): Likewise.
4925 (vld2q_lane_f16): Likewise.
4926 (vld2q_lane_f32): Likewise.
4927 (vld2q_lane_f64): Likewise.
4928 (vld2q_lane_p8): Likewise.
4929 (vld2q_lane_p16): Likewise.
4930 (vld2q_lane_p64): Likewise.
4931 (vld3_lane_u8): Likewise.
4932 (vld3_lane_u16): Likewise.
4933 (vld3_lane_u32): Likewise.
4934 (vld3_lane_u64): Likewise.
4935 (vld3_lane_s8): Likewise.
4936 (vld3_lane_s16): Likewise.
4937 (vld3_lane_s32): Likewise.
4938 (vld3_lane_s64): Likewise.
4939 (vld3_lane_f16): Likewise.
4940 (vld3_lane_f32): Likewise.
4941 (vld3_lane_f64): Likewise.
4942 (vld3_lane_p8): Likewise.
4943 (vld3_lane_p16): Likewise.
4944 (vld3_lane_p64): Likewise.
4945 (vld3q_lane_u8): Likewise.
4946 (vld3q_lane_u16): Likewise.
4947 (vld3q_lane_u32): Likewise.
4948 (vld3q_lane_u64): Likewise.
4949 (vld3q_lane_s8): Likewise.
4950 (vld3q_lane_s16): Likewise.
4951 (vld3q_lane_s32): Likewise.
4952 (vld3q_lane_s64): Likewise.
4953 (vld3q_lane_f16): Likewise.
4954 (vld3q_lane_f32): Likewise.
4955 (vld3q_lane_f64): Likewise.
4956 (vld3q_lane_p8): Likewise.
4957 (vld3q_lane_p16): Likewise.
4958 (vld3q_lane_p64): Likewise.
4959 (vld4_lane_u8): Likewise.
4960 (vld4_lane_u16): Likewise.
4961 (vld4_lane_u32): Likewise.
4962 (vld4_lane_u64): Likewise.
4963 (vld4_lane_s8): Likewise.
4964 (vld4_lane_s16): Likewise.
4965 (vld4_lane_s32): Likewise.
4966 (vld4_lane_s64): Likewise.
4967 (vld4_lane_f16): Likewise.
4968 (vld4_lane_f32): Likewise.
4969 (vld4_lane_f64): Likewise.
4970 (vld4_lane_p8): Likewise.
4971 (vld4_lane_p16): Likewise.
4972 (vld4_lane_p64): Likewise.
4973 (vld4q_lane_u8): Likewise.
4974 (vld4q_lane_u16): Likewise.
4975 (vld4q_lane_u32): Likewise.
4976 (vld4q_lane_u64): Likewise.
4977 (vld4q_lane_s8): Likewise.
4978 (vld4q_lane_s16): Likewise.
4979 (vld4q_lane_s32): Likewise.
4980 (vld4q_lane_s64): Likewise.
4981 (vld4q_lane_f16): Likewise.
4982 (vld4q_lane_f32): Likewise.
4983 (vld4q_lane_f64): Likewise.
4984 (vld4q_lane_p8): Likewise.
4985 (vld4q_lane_p16): Likewise.
4986 (vld4q_lane_p64): Likewise.
4987 (vqtbl2_s8): Likewise.
4988 (vqtbl2_u8): Likewise.
4989 (vqtbl2_p8): Likewise.
4990 (vqtbl2q_s8): Likewise.
4991 (vqtbl2q_u8): Likewise.
4992 (vqtbl2q_p8): Likewise.
4993 (vqtbl3_s8): Likewise.
4994 (vqtbl3_u8): Likewise.
4995 (vqtbl3_p8): Likewise.
4996 (vqtbl3q_s8): Likewise.
4997 (vqtbl3q_u8): Likewise.
4998 (vqtbl3q_p8): Likewise.
4999 (vqtbl4_s8): Likewise.
5000 (vqtbl4_u8): Likewise.
5001 (vqtbl4_p8): Likewise.
5002 (vqtbl4q_s8): Likewise.
5003 (vqtbl4q_u8): Likewise.
5004 (vqtbl4q_p8): Likewise.
5005 (vqtbx2_s8): Likewise.
5006 (vqtbx2_u8): Likewise.
5007 (vqtbx2_p8): Likewise.
5008 (vqtbx2q_s8): Likewise.
5009 (vqtbx2q_u8): Likewise.
5010 (vqtbx2q_p8): Likewise.
5011 (vqtbx3_s8): Likewise.
5012 (vqtbx3_u8): Likewise.
5013 (vqtbx3_p8): Likewise.
5014 (vqtbx3q_s8): Likewise.
5015 (vqtbx3q_u8): Likewise.
5016 (vqtbx3q_p8): Likewise.
5017 (vqtbx4_s8): Likewise.
5018 (vqtbx4_u8): Likewise.
5019 (vqtbx4_p8): Likewise.
5020 (vqtbx4q_s8): Likewise.
5021 (vqtbx4q_u8): Likewise.
5022 (vqtbx4q_p8): Likewise.
5023 (vst1_s64_x2): Likewise.
5024 (vst1_u64_x2): Likewise.
5025 (vst1_f64_x2): Likewise.
5026 (vst1_s8_x2): Likewise.
5027 (vst1_p8_x2): Likewise.
5028 (vst1_s16_x2): Likewise.
5029 (vst1_p16_x2): Likewise.
5030 (vst1_s32_x2): Likewise.
5031 (vst1_u8_x2): Likewise.
5032 (vst1_u16_x2): Likewise.
5033 (vst1_u32_x2): Likewise.
5034 (vst1_f16_x2): Likewise.
5035 (vst1_f32_x2): Likewise.
5036 (vst1_p64_x2): Likewise.
5037 (vst1q_s8_x2): Likewise.
5038 (vst1q_p8_x2): Likewise.
5039 (vst1q_s16_x2): Likewise.
5040 (vst1q_p16_x2): Likewise.
5041 (vst1q_s32_x2): Likewise.
5042 (vst1q_s64_x2): Likewise.
5043 (vst1q_u8_x2): Likewise.
5044 (vst1q_u16_x2): Likewise.
5045 (vst1q_u32_x2): Likewise.
5046 (vst1q_u64_x2): Likewise.
5047 (vst1q_f16_x2): Likewise.
5048 (vst1q_f32_x2): Likewise.
5049 (vst1q_f64_x2): Likewise.
5050 (vst1q_p64_x2): Likewise.
5051 (vst1_s64_x3): Likewise.
5052 (vst1_u64_x3): Likewise.
5053 (vst1_f64_x3): Likewise.
5054 (vst1_s8_x3): Likewise.
5055 (vst1_p8_x3): Likewise.
5056 (vst1_s16_x3): Likewise.
5057 (vst1_p16_x3): Likewise.
5058 (vst1_s32_x3): Likewise.
5059 (vst1_u8_x3): Likewise.
5060 (vst1_u16_x3): Likewise.
5061 (vst1_u32_x3): Likewise.
5062 (vst1_f16_x3): Likewise.
5063 (vst1_f32_x3): Likewise.
5064 (vst1_p64_x3): Likewise.
5065 (vst1q_s8_x3): Likewise.
5066 (vst1q_p8_x3): Likewise.
5067 (vst1q_s16_x3): Likewise.
5068 (vst1q_p16_x3): Likewise.
5069 (vst1q_s32_x3): Likewise.
5070 (vst1q_s64_x3): Likewise.
5071 (vst1q_u8_x3): Likewise.
5072 (vst1q_u16_x3): Likewise.
5073 (vst1q_u32_x3): Likewise.
5074 (vst1q_u64_x3): Likewise.
5075 (vst1q_f16_x3): Likewise.
5076 (vst1q_f32_x3): Likewise.
5077 (vst1q_f64_x3): Likewise.
5078 (vst1q_p64_x3): Likewise.
5079 (vst1_s8_x4): Likewise.
5080 (vst1q_s8_x4): Likewise.
5081 (vst1_s16_x4): Likewise.
5082 (vst1q_s16_x4): Likewise.
5083 (vst1_s32_x4): Likewise.
5084 (vst1q_s32_x4): Likewise.
5085 (vst1_u8_x4): Likewise.
5086 (vst1q_u8_x4): Likewise.
5087 (vst1_u16_x4): Likewise.
5088 (vst1q_u16_x4): Likewise.
5089 (vst1_u32_x4): Likewise.
5090 (vst1q_u32_x4): Likewise.
5091 (vst1_f16_x4): Likewise.
5092 (vst1q_f16_x4): Likewise.
5093 (vst1_f32_x4): Likewise.
5094 (vst1q_f32_x4): Likewise.
5095 (vst1_p8_x4): Likewise.
5096 (vst1q_p8_x4): Likewise.
5097 (vst1_p16_x4): Likewise.
5098 (vst1q_p16_x4): Likewise.
5099 (vst1_s64_x4): Likewise.
5100 (vst1_u64_x4): Likewise.
5101 (vst1_p64_x4): Likewise.
5102 (vst1q_s64_x4): Likewise.
5103 (vst1q_u64_x4): Likewise.
5104 (vst1q_p64_x4): Likewise.
5105 (vst1_f64_x4): Likewise.
5106 (vst1q_f64_x4): Likewise.
5107 (vst2_s64): Likewise.
5108 (vst2_u64): Likewise.
5109 (vst2_f64): Likewise.
5110 (vst2_s8): Likewise.
5111 (vst2_p8): Likewise.
5112 (vst2_s16): Likewise.
5113 (vst2_p16): Likewise.
5114 (vst2_s32): Likewise.
5115 (vst2_u8): Likewise.
5116 (vst2_u16): Likewise.
5117 (vst2_u32): Likewise.
5118 (vst2_f16): Likewise.
5119 (vst2_f32): Likewise.
5120 (vst2_p64): Likewise.
5121 (vst2q_s8): Likewise.
5122 (vst2q_p8): Likewise.
5123 (vst2q_s16): Likewise.
5124 (vst2q_p16): Likewise.
5125 (vst2q_s32): Likewise.
5126 (vst2q_s64): Likewise.
5127 (vst2q_u8): Likewise.
5128 (vst2q_u16): Likewise.
5129 (vst2q_u32): Likewise.
5130 (vst2q_u64): Likewise.
5131 (vst2q_f16): Likewise.
5132 (vst2q_f32): Likewise.
5133 (vst2q_f64): Likewise.
5134 (vst2q_p64): Likewise.
5135 (vst3_s64): Likewise.
5136 (vst3_u64): Likewise.
5137 (vst3_f64): Likewise.
5138 (vst3_s8): Likewise.
5139 (vst3_p8): Likewise.
5140 (vst3_s16): Likewise.
5141 (vst3_p16): Likewise.
5142 (vst3_s32): Likewise.
5143 (vst3_u8): Likewise.
5144 (vst3_u16): Likewise.
5145 (vst3_u32): Likewise.
5146 (vst3_f16): Likewise.
5147 (vst3_f32): Likewise.
5148 (vst3_p64): Likewise.
5149 (vst3q_s8): Likewise.
5150 (vst3q_p8): Likewise.
5151 (vst3q_s16): Likewise.
5152 (vst3q_p16): Likewise.
5153 (vst3q_s32): Likewise.
5154 (vst3q_s64): Likewise.
5155 (vst3q_u8): Likewise.
5156 (vst3q_u16): Likewise.
5157 (vst3q_u32): Likewise.
5158 (vst3q_u64): Likewise.
5159 (vst3q_f16): Likewise.
5160 (vst3q_f32): Likewise.
5161 (vst3q_f64): Likewise.
5162 (vst3q_p64): Likewise.
5163 (vst4_s64): Likewise.
5164 (vst4_u64): Likewise.
5165 (vst4_f64): Likewise.
5166 (vst4_s8): Likewise.
5167 (vst4_p8): Likewise.
5168 (vst4_s16): Likewise.
5169 (vst4_p16): Likewise.
5170 (vst4_s32): Likewise.
5171 (vst4_u8): Likewise.
5172 (vst4_u16): Likewise.
5173 (vst4_u32): Likewise.
5174 (vst4_f16): Likewise.
5175 (vst4_f32): Likewise.
5176 (vst4_p64): Likewise.
5177 (vst4q_s8): Likewise.
5178 (vst4q_p8): Likewise.
5179 (vst4q_s16): Likewise.
5180 (vst4q_p16): Likewise.
5181 (vst4q_s32): Likewise.
5182 (vst4q_s64): Likewise.
5183 (vst4q_u8): Likewise.
5184 (vst4q_u16): Likewise.
5185 (vst4q_u32): Likewise.
5186 (vst4q_u64): Likewise.
5187 (vst4q_f16): Likewise.
5188 (vst4q_f32): Likewise.
5189 (vst4q_f64): Likewise.
5190 (vst4q_p64): Likewise.
5191 (vtbx4_s8): Likewise.
5192 (vtbx4_u8): Likewise.
5193 (vtbx4_p8): Likewise.
5194 (vld1_bf16_x2): Likewise.
5195 (vld1q_bf16_x2): Likewise.
5196 (vld1_bf16_x3): Likewise.
5197 (vld1q_bf16_x3): Likewise.
5198 (vld1_bf16_x4): Likewise.
5199 (vld1q_bf16_x4): Likewise.
5200 (vld2_bf16): Likewise.
5201 (vld2q_bf16): Likewise.
5202 (vld2_dup_bf16): Likewise.
5203 (vld2q_dup_bf16): Likewise.
5204 (vld3_bf16): Likewise.
5205 (vld3q_bf16): Likewise.
5206 (vld3_dup_bf16): Likewise.
5207 (vld3q_dup_bf16): Likewise.
5208 (vld4_bf16): Likewise.
5209 (vld4q_bf16): Likewise.
5210 (vld4_dup_bf16): Likewise.
5211 (vld4q_dup_bf16): Likewise.
5212 (vst1_bf16_x2): Likewise.
5213 (vst1q_bf16_x2): Likewise.
5214 (vst1_bf16_x3): Likewise.
5215 (vst1q_bf16_x3): Likewise.
5216 (vst1_bf16_x4): Likewise.
5217 (vst1q_bf16_x4): Likewise.
5218 (vst2_bf16): Likewise.
5219 (vst2q_bf16): Likewise.
5220 (vst3_bf16): Likewise.
5221 (vst3q_bf16): Likewise.
5222 (vst4_bf16): Likewise.
5223 (vst4q_bf16): Likewise.
5224 (vld2_lane_bf16): Likewise.
5225 (vld2q_lane_bf16): Likewise.
5226 (vld3_lane_bf16): Likewise.
5227 (vld3q_lane_bf16): Likewise.
5228 (vld4_lane_bf16): Likewise.
5229 (vld4q_lane_bf16): Likewise.
5230 (vst2_lane_bf16): Likewise.
5231 (vst2q_lane_bf16): Likewise.
5232 (vst3_lane_bf16): Likewise.
5233 (vst3q_lane_bf16): Likewise.
5234 (vst4_lane_bf16): Likewise.
5235 (vst4q_lane_bf16): Likewise.
5236 * config/aarch64/geniterators.sh: Modify iterator regex to
5237 match new vector-tuple modes.
5238 * config/aarch64/iterators.md (insn_count): Extend mode
5239 attribute with vector-tuple type information.
5241 (Vendreg): Likewise.
5244 (VSTRUCT_2D): New mode iterator.
5245 (VSTRUCT_2DNX): Likewise.
5246 (VSTRUCT_2DX): Likewise.
5247 (VSTRUCT_2Q): Likewise.
5248 (VSTRUCT_2QD): Likewise.
5249 (VSTRUCT_3D): Likewise.
5250 (VSTRUCT_3DNX): Likewise.
5251 (VSTRUCT_3DX): Likewise.
5252 (VSTRUCT_3Q): Likewise.
5253 (VSTRUCT_3QD): Likewise.
5254 (VSTRUCT_4D): Likewise.
5255 (VSTRUCT_4DNX): Likewise.
5256 (VSTRUCT_4DX): Likewise.
5257 (VSTRUCT_4Q): Likewise.
5258 (VSTRUCT_4QD): Likewise.
5259 (VSTRUCT_D): Likewise.
5260 (VSTRUCT_Q): Likewise.
5261 (VSTRUCT_QD): Likewise.
5262 (VSTRUCT_ELT): New mode attribute.
5263 (vstruct_elt): Likewise.
5264 * genmodes.c (VECTOR_MODE): Add default prefix and order
5266 (VECTOR_MODE_WITH_PREFIX): Define.
5267 (make_vector_mode): Add mode prefix and order parameters.
5269 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
5271 * expmed.c (extract_bit_field_1): Ensure modes are tieable.
5273 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
5275 * expr.c (emit_group_load_1): Remove historic workaround.
5277 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
5279 * config/aarch64/aarch64-builtins.c (aarch64_init_simd_builtins):
5280 Factor out main loop to...
5281 (aarch64_init_simd_builtin_functions): This new function.
5282 (register_tuple_type): Define.
5283 (aarch64_scalar_builtin_type_p): Define.
5284 (handle_arm_neon_h): Define.
5285 * config/aarch64/aarch64-c.c (aarch64_pragma_aarch64): Handle
5286 pragma for arm_neon.h.
5287 * config/aarch64/aarch64-protos.h (aarch64_advsimd_struct_mode_p):
5289 (handle_arm_neon_h): Likewise.
5290 * config/aarch64/aarch64.c (aarch64_advsimd_struct_mode_p):
5291 Remove static modifier.
5292 * config/aarch64/arm_neon.h (target): Remove Neon vector
5293 structure type definitions.
5295 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
5297 PR tree-optimization/102943
5298 * gimple-range-path.cc (path_range_query::range_on_path_entry):
5299 Prefer range_of_expr unless there are no statements in the BB.
5301 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
5303 PR tree-optimization/102943
5304 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
5305 Avoid duplicate calculation of paths.
5307 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
5309 PR tree-optimization/102943
5310 * gimple-range-path.cc (path_range_query::compute_phi_relations):
5311 Only compute relations for SSA names in the import list.
5312 (path_range_query::compute_outgoing_relations): Same.
5313 * gimple-range-path.h (path_range_query::import_p): New.
5315 2021-11-04 Richard Biener <rguenther@suse.de>
5317 PR rtl-optimization/103075
5318 * simplify-rtx.c (exact_int_to_float_conversion_p): Return
5319 false for a VOIDmode operand.
5321 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
5323 * config/aarch64/aarch64.c (aarch64_vector_costs): Make member
5324 variables private and add "m_" to their names. Remove is_loop.
5325 (aarch64_record_potential_advsimd_unrolling): Replace with...
5326 (aarch64_vector_costs::record_potential_advsimd_unrolling): ...this.
5327 (aarch64_analyze_loop_vinfo): Replace with...
5328 (aarch64_vector_costs::analyze_loop_vinfo): ...this.
5329 Move initialization of (m_)vec_flags to add_stmt_cost.
5330 (aarch64_analyze_bb_vinfo): Delete.
5331 (aarch64_count_ops): Replace with...
5332 (aarch64_vector_costs::count_ops): ...this.
5333 (aarch64_vector_costs::add_stmt_cost): Set m_vec_flags,
5334 using m_costing_for_scalar to test whether we're costing
5335 scalar or vector code.
5336 (aarch64_adjust_body_cost_sve): Replace with...
5337 (aarch64_vector_costs::adjust_body_cost_sve): ...this.
5338 (aarch64_adjust_body_cost): Replace with...
5339 (aarch64_vector_costs::adjust_body_cost): ...this.
5340 (aarch64_vector_costs::finish_cost): Use m_vinfo instead of is_loop.
5342 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
5344 * target.def (targetm.vectorize.init_cost): Replace with...
5345 (targetm.vectorize.create_costs): ...this.
5346 (targetm.vectorize.add_stmt_cost): Delete.
5347 (targetm.vectorize.finish_cost): Likewise.
5348 (targetm.vectorize.destroy_cost_data): Likewise.
5349 * doc/tm.texi.in (TARGET_VECTORIZE_INIT_COST): Replace with...
5350 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
5351 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
5352 (TARGET_VECTORIZE_FINISH_COST): Likewise.
5353 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
5354 * doc/tm.texi: Regenerate.
5355 * tree-vectorizer.h (vec_info::vec_info): Remove target_cost_data
5357 (vec_info::target_cost_data): Change from a void * to a vector_costs *.
5358 (vector_costs): New class.
5359 (init_cost): Take a vec_info and return a vector_costs.
5360 (dump_stmt_cost): Remove data parameter.
5361 (add_stmt_cost): Replace vinfo and data parameters with a vector_costs.
5362 (add_stmt_costs): Likewise.
5363 (finish_cost): Replace data parameter with a vector_costs.
5364 (destroy_cost_data): Delete.
5365 * tree-vectorizer.c (dump_stmt_cost): Remove data argument and
5367 (vec_info::vec_info): Remove the target_cost_data parameter and
5368 initialize the member variable to null instead.
5369 (vec_info::~vec_info): Delete target_cost_data instead of calling
5371 (vector_costs::add_stmt_cost): New function.
5372 (vector_costs::finish_cost): Likewise.
5373 (vector_costs::record_stmt_cost): Likewise.
5374 (vector_costs::adjust_cost_for_freq): Likewise.
5375 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
5376 call to vec_info::vec_info.
5377 (vect_compute_single_scalar_iteration_cost): Update after above
5378 changes to costing interface.
5379 (vect_analyze_loop_operations): Likewise.
5380 (vect_estimate_min_profitable_iters): Likewise.
5381 (vect_analyze_loop_2): Initialize LOOP_VINFO_TARGET_COST_DATA
5382 at the start_over point, where it needs to be recreated after
5383 trying without slp. Update retry code accordingly.
5384 * tree-vect-slp.c (_bb_vec_info::_bb_vec_info): Update call
5385 to vec_info::vec_info.
5386 (vect_slp_analyze_operation): Update after above changes to costing
5388 (vect_bb_vectorization_profitable_p): Likewise.
5389 * targhooks.h (default_init_cost): Replace with...
5390 (default_vectorize_create_costs): ...this.
5391 (default_add_stmt_cost): Delete.
5392 (default_finish_cost, default_destroy_cost_data): Likewise.
5393 * targhooks.c (default_init_cost): Replace with...
5394 (default_vectorize_create_costs): ...this.
5395 (default_add_stmt_cost): Delete, moving logic to vector_costs instead.
5396 (default_finish_cost, default_destroy_cost_data): Delete.
5397 * config/aarch64/aarch64.c (aarch64_vector_costs): Inherit from
5398 vector_costs. Add a constructor.
5399 (aarch64_init_cost): Replace with...
5400 (aarch64_vectorize_create_costs): ...this.
5401 (aarch64_add_stmt_cost): Replace with...
5402 (aarch64_vector_costs::add_stmt_cost): ...this. Use record_stmt_cost
5403 to adjust the cost for inner loops.
5404 (aarch64_finish_cost): Replace with...
5405 (aarch64_vector_costs::finish_cost): ...this.
5406 (aarch64_destroy_cost_data): Delete.
5407 (TARGET_VECTORIZE_INIT_COST): Replace with...
5408 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
5409 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
5410 (TARGET_VECTORIZE_FINISH_COST): Likewise.
5411 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
5412 * config/i386/i386.c (ix86_vector_costs): New structure.
5413 (ix86_init_cost): Replace with...
5414 (ix86_vectorize_create_costs): ...this.
5415 (ix86_add_stmt_cost): Replace with...
5416 (ix86_vector_costs::add_stmt_cost): ...this. Use adjust_cost_for_freq
5417 to adjust the cost for inner loops.
5418 (ix86_finish_cost, ix86_destroy_cost_data): Delete.
5419 (TARGET_VECTORIZE_INIT_COST): Replace with...
5420 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
5421 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
5422 (TARGET_VECTORIZE_FINISH_COST): Likewise.
5423 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
5424 * config/rs6000/rs6000.c (TARGET_VECTORIZE_INIT_COST): Replace with...
5425 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
5426 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
5427 (TARGET_VECTORIZE_FINISH_COST): Likewise.
5428 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
5429 (rs6000_cost_data): Inherit from vector_costs.
5430 Add a constructor. Drop loop_info, cost and costing_for_scalar
5431 in favor of the corresponding vector_costs member variables.
5432 Add "m_" to the names of the remaining member variables and
5434 (rs6000_density_test): Replace with...
5435 (rs6000_cost_data::density_test): ...this.
5436 (rs6000_init_cost): Replace with...
5437 (rs6000_vectorize_create_costs): ...this.
5438 (rs6000_update_target_cost_per_stmt): Replace with...
5439 (rs6000_cost_data::update_target_cost_per_stmt): ...this.
5440 (rs6000_add_stmt_cost): Replace with...
5441 (rs6000_cost_data::add_stmt_cost): ...this. Use adjust_cost_for_freq
5442 to adjust the cost for inner loops.
5443 (rs6000_adjust_vect_cost_per_loop): Replace with...
5444 (rs6000_cost_data::adjust_vect_cost_per_loop): ...this.
5445 (rs6000_finish_cost): Replace with...
5446 (rs6000_cost_data::finish_cost): ...this. Group loop code
5447 into a single if statement and pass the loop_vinfo down to
5449 (rs6000_destroy_cost_data): Delete.
5451 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
5453 PR tree-optimization/103062
5454 PR tree-optimization/103062
5455 * value-pointer-equiv.cc (ssa_equiv_stack::ssa_equiv_stack):
5456 Increase size of allocation by 1.
5457 (ssa_equiv_stack::push_replacement): Grow as needed.
5458 (ssa_equiv_stack::get_replacement): Same.
5459 (pointer_equiv_analyzer::pointer_equiv_analyzer): Same.
5460 (pointer_equiv_analyzer::~pointer_equiv_analyzer): Remove delete.
5461 (pointer_equiv_analyzer::set_global_equiv): Grow as needed.
5462 (pointer_equiv_analyzer::get_equiv): Same.
5463 (pointer_equiv_analyzer::get_equiv_expr): Remove const.
5464 * value-pointer-equiv.h (class pointer_equiv_analyzer): Remove
5465 const markers. Use auto_vec instead of tree *.
5467 2021-11-04 Richard Biener <rguenther@suse.de>
5469 * tree-ssa-sccvn.c (vn_nary_op_insert_into): Remove always
5470 true parameter and inline valueization.
5471 (vn_nary_op_lookup_1): Inline valueization from ...
5472 (vn_nary_op_compute_hash): ... here and remove it here.
5473 * tree-ssa-pre.c (phi_translate_1): Do not valueize
5474 before vn_nary_lookup_pieces.
5475 (get_representative_for): Mark created SSA representatives
5478 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
5480 * simplify-rtx.c (simplify_context::simplify_gen_vec_select): Assert
5481 that the operand has a vector mode. Use subreg_lowpart_offset
5482 to test whether an index corresponds to the low part.
5484 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
5486 * read-rtl.c: Remove dead !GENERATOR_FILE block.
5487 * read-rtl-function.c (function_reader::consolidate_singletons):
5488 Generate canonical CONST_VECTORs.
5490 2021-11-04 liuhongt <hongtao.liu@intel.com>
5493 * config/i386/predicates.md (reg_or_notreg_operand): Rename to ..
5494 (regmem_or_bitnot_regmem_operand): .. and extend to handle
5496 * config/i386/sse.md (*<avx512>_vpternlog<mode>_1): Force_reg
5497 the operands which are required to be register_operand.
5498 (*<avx512>_vpternlog<mode>_2): Ditto.
5499 (*<avx512>_vpternlog<mode>_3): Ditto.
5500 (*<avx512>_vternlog<mode>_all): Disallow embeded broadcast for
5501 vector HFmodes since it's not a real AVX512FP16 instruction.
5503 2021-11-04 liuhongt <hongtao.liu@intel.com>
5506 * match.pd: simplify (trunc)copysign((extend)a, (extend)b) to
5507 .COPYSIGN (a,b) when a and b are same type as the truncation
5508 type and has less precision than extend type.
5510 2021-11-04 Richard Biener <rguenther@suse.de>
5512 * doc/generic.texi: Update TARGET_MEM_REF and MEM_REF
5515 2021-11-04 Hongyu Wang <hongyu.wang@intel.com>
5517 * config/i386/sse.md (VI2_AVX512VNNIBW): New mode iterator.
5518 (VI1_AVX512VNNI): Likewise.
5519 (SDOT_VPDP_SUF): New mode_attr.
5522 (sdot_prod<mode>): Use VI2_AVX512F iterator, expand to
5523 vpdpwssd when VNNI targets available.
5524 (usdot_prod<mode>): New expander for vector QImode.
5526 2021-11-04 Hongyu Wang <hongyu.wang@intel.com>
5528 * config/i386/amxtileintrin.h (_tile_loadd_internal): Add
5529 parentheses to base and stride.
5530 (_tile_stream_loadd_internal): Likewise.
5531 (_tile_stored_internal): Likewise.
5533 2021-11-03 Maciej W. Rozycki <macro@embecosm.com>
5535 * config/riscv/riscv.c (riscv_class_max_nregs): Swap the
5536 arguments to `reg_class_subset_p'.
5538 2021-11-03 Joseph Myers <joseph@codesourcery.com>
5541 * fold-const.c (fold_init): New function.
5542 * fold-const.h (fold_init): New prototype.
5544 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
5546 * params.opt (param_vrp2_mode): Make ranger the default for VRP2.
5548 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
5550 * gimple-range-fold.cc (fold_using_range::range_of_phi): Don't import
5551 a range from edge if arg == phidef.
5553 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
5555 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Test
5556 for constant before any other processing.
5558 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
5560 * flag-types.h (RANGER_DEBUG_ALL): Fix values.
5562 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
5564 * gimple-range.cc (gimple_ranger::gimple_ranger): Initialize current_bb.
5565 (gimple_ranger::range_of_expr): Pick up range_on_entry when there is
5566 no explcit context and current_bb is set.
5567 (gimple_ranger::fold_stmt): New.
5568 * gimple-range.h (current_bb, fold_stmt): New.
5569 * tree-vrp.c (rvrp_folder::fold_stmt): Call ranger's fold_stmt.
5571 2021-11-03 Richard Biener <rguenther@suse.de>
5573 PR tree-optimization/102970
5574 * tree-ssa-pre.c (phi_translate_1): Drop clique and base
5575 when translating a MEM_REF over a backedge.
5577 2021-11-03 Philipp Tomsich <philipp.tomsich@vrull.eu>
5579 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New Ampere-1 core.
5580 * config/aarch64/aarch64-tune.md: Regenerate.
5581 * config/aarch64/aarch64-cost-tables.h: Add extra costs for Ampere-1.
5582 * config/aarch64/aarch64.c: Add tuning structures for Ampere-1.
5583 * doc/invoke.texi: Add documentation for Ampere-1 core.
5585 2021-11-03 Wilco Dijkstra <wdijkstr@arm.com>
5587 * config/aarch64/aarch64.md (movsi): Add alternative for GOT accesses.
5589 (ldr_got_small_<mode>): Remove pattern.
5590 (ldr_got_small_sidi): Likewise.
5591 * config/aarch64/aarch64.c (aarch64_load_symref_appropriately): Keep
5592 GOT accesses as moves.
5593 (aarch64_print_operand): Correctly print got_lo12 in L specifier.
5594 (aarch64_mov_operand_p): Make GOT accesses valid move operands.
5595 * config/aarch64/constraints.md: Add new constraint Usw for GOT access.
5597 2021-11-03 Martin Liska <mliska@suse.cz>
5599 * gcov.c (read_line): Remove dead variable.
5601 2021-11-03 Martin Liska <mliska@suse.cz>
5604 * ipa-fnsummary.c (edge_predicate_pool): Rename predicate class to ipa_predicate.
5605 (ipa_fn_summary::account_size_time): Likewise.
5606 (edge_set_predicate): Likewise.
5607 (set_hint_predicate): Likewise.
5608 (add_freqcounting_predicate): Likewise.
5609 (evaluate_conditions_for_known_args): Likewise.
5610 (evaluate_properties_for_edge): Likewise.
5611 (remap_freqcounting_preds_after_dup): Likewise.
5612 (ipa_fn_summary_t::duplicate): Likewise.
5613 (set_cond_stmt_execution_predicate): Likewise.
5614 (set_switch_stmt_execution_predicate): Likewise.
5615 (compute_bb_predicates): Likewise.
5616 (will_be_nonconstant_expr_predicate): Likewise.
5617 (will_be_nonconstant_predicate): Likewise.
5618 (phi_result_unknown_predicate): Likewise.
5619 (predicate_for_phi_result): Likewise.
5620 (analyze_function_body): Likewise.
5621 (compute_fn_summary): Likewise.
5622 (summarize_calls_size_and_time): Likewise.
5623 (estimate_calls_size_and_time): Likewise.
5624 (ipa_call_context::estimate_size_and_time): Likewise.
5625 (remap_edge_summaries): Likewise.
5626 (remap_freqcounting_predicate): Likewise.
5627 (ipa_merge_fn_summary_after_inlining): Likewise.
5628 (ipa_update_overall_fn_summary): Likewise.
5629 (read_ipa_call_summary): Likewise.
5630 (inline_read_section): Likewise.
5631 * ipa-fnsummary.h (struct ipa_freqcounting_predicate): Likewise.
5632 * ipa-predicate.c (predicate::add_clause): Likewise.
5633 (ipa_predicate::add_clause): Likewise.
5634 (predicate::or_with): Likewise.
5635 (ipa_predicate::or_with): Likewise.
5636 (predicate::evaluate): Likewise.
5637 (ipa_predicate::evaluate): Likewise.
5638 (predicate::probability): Likewise.
5639 (ipa_predicate::probability): Likewise.
5640 (dump_condition): Likewise.
5641 (dump_clause): Likewise.
5642 (predicate::dump): Likewise.
5643 (ipa_predicate::dump): Likewise.
5644 (predicate::debug): Likewise.
5645 (ipa_predicate::debug): Likewise.
5646 (predicate::remap_after_duplication): Likewise.
5647 (ipa_predicate::remap_after_duplication): Likewise.
5648 (predicate::remap_after_inlining): Likewise.
5649 (ipa_predicate::remap_after_inlining): Likewise.
5650 (predicate::stream_in): Likewise.
5651 (ipa_predicate::stream_in): Likewise.
5652 (predicate::stream_out): Likewise.
5653 (ipa_predicate::stream_out): Likewise.
5654 (add_condition): Likewise.
5655 * ipa-predicate.h (class predicate): Likewise.
5656 (class ipa_predicate): Likewise.
5657 (add_condition): Likewise.
5659 2021-11-03 Richard Biener <rguenther@suse.de>
5661 * bitmap.h (bitmap_bit_p): Change the return type to bool.
5662 * bitmap.c (bitmap_bit_p): Likewise.
5663 * sbitmap.h (bitmap_bit_p): Likewise.
5664 (bitmap_set_bit): Return whether the bit changed.
5665 (bitmap_clear_bit): Likewise.
5666 * tree-ssa.c (verify_vssa): Make use of the changed state
5667 from bitmap_set_bit.
5669 2021-11-03 Richard Biener <rguenther@suse.de>
5671 PR middle-end/103033
5672 * internal-fn.c (expand_DEFERRED_INIT): Elide the
5673 native_interpret_expr path in favor of folding the
5674 VIEW_CONVERT_EXPR generated when punning the RHS.
5676 2021-11-03 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
5678 * config/s390/s390.c (s390_loop_unroll_adjust): In case of early
5681 2021-11-03 Jan Hubicka <hubicka@ucw.cz>
5684 * ipa-modref.c (callee_to_caller_flags): New function.
5685 (modref_eaf_analysis::analyze_ssa_name): Use it.
5686 (ipa_merge_modref_summary_after_inlining): Fix whitespace.
5688 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
5690 * ipa-modref.c (modref_eaf_analysis::analyze_ssa_name): Revert
5693 2021-11-02 Roger Sayle <roger@nextmovesoftware.com>
5694 Uroš Bizjak <ubizjak@gmail.com>
5696 * config/i386/i386.md (<any_rotate>ti3): Provide expansion for
5697 rotations by non-constant amounts.
5699 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
5701 * ipa-modref.c: Fix anonymous namespace placement.
5702 (class modref_eaf_analysis): New class.
5703 (analyze_ssa_name_flags): Turn to ...
5704 (modref_eaf_analysis::analyze_ssa_name): ... this one.
5705 (merge_call_lhs_flags): Turn to ...
5706 (modref_eaf_analysis::merge_call_lhs_flags): .. this one
5707 (modref_eaf_analysis::merge_with_ssa_name): New member function.
5708 (record_escape_points): Turn to ...
5709 (modref_eaf_analysis::record_escape_points): ... this one.
5710 (analyze_parms): Updat
5711 (ipa_merge_modref_summary_after_inlining): Move to the end of file.
5713 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
5715 * gimple.c (gimple_call_static_chain_flags): New function.
5716 * gimple.h (gimple_call_static_chain_flags): Declare
5717 * ipa-modref.c (modref_summary::modref_summary): Initialize
5719 (modref_summary_lto::modref_summary_lto): Likewise.
5720 (modref_summary::useful_p): Test static_chain_flags.
5721 (modref_summary_lto::useful_p): Likewise.
5722 (struct modref_summary_lto): Add static_chain_flags.
5723 (modref_summary::dump): Dump static_chain_flags.
5724 (modref_summary_lto::dump): Likewise.
5725 (struct escape_point): Add static_cahin_arg.
5726 (analyze_ssa_name_flags): Use gimple_call_static_chain_flags.
5727 (analyze_parms): Handle static chains.
5728 (modref_summaries::duplicate): Duplicate static_chain_flags.
5729 (modref_summaries_lto::duplicate): Likewise.
5730 (modref_write): Stream static_chain_flags.
5731 (read_section): Likewise.
5732 (modref_merge_call_site_flags): Handle static_chain_flags.
5733 * ipa-modref.h (struct modref_summary): Add static_chain_flags.
5734 * tree-ssa-structalias.c (handle_rhs_call): Use
5735 gimple_static_chain_flags.
5737 2021-11-02 Richard Biener <rguenther@suse.de>
5739 PR tree-optimization/103029
5740 * tree-vect-loop-manip.c (vect_loop_versioning): Ensure
5741 the PHI nodes in the loop maintain their original operand
5744 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
5746 * tree-core.h (EAF_NOT_RETURNED_DIRECTLY): New flag.
5747 (EAF_NOREAD): Renumber.
5748 * ipa-modref.c (dump_eaf_flags): Dump EAF_NOT_RETURNED_DIRECTLY.
5749 (remove_useless_eaf_flags): Handle EAF_NOT_RETURNED_DIRECTLY
5750 (deref_flags): Likewise.
5751 (modref_lattice::init): Likewise.
5752 (modref_lattice::merge): Likewise.
5753 (merge_call_lhs_flags): Likewise.
5754 (analyze_ssa_name_flags): Likewise.
5755 (modref_merge_call_site_flags): Likewise.
5756 * tree-ssa-structalias.c (handle_call_arg): Likewise.
5758 2021-11-02 Maciej W. Rozycki <macro@embecosm.com>
5760 * config/riscv/riscv.c (riscv_rtx_costs): Correct a CONST_INT_P
5761 check and remove an unused local variable with shNadd/shNadd.uw
5764 2021-11-02 Tamar Christina <tamar.christina@arm.com>
5766 PR tree-optimization/103007
5767 * tree-vect-slp-patterns.c (complex_fms_pattern::matches): Add elem
5770 2021-11-02 Richard Biener <rguenther@suse.de>
5772 PR middle-end/103038
5773 * fold-const.c (native_interpret_expr): Handle OFFSET_TYPE.
5774 (can_native_interpret_type_p): Likewise.
5775 * internal-fn.c (expand_DEFERRED_INIT): View-convert the
5776 RHS if the LHS is an SSA name.
5778 2021-11-02 Richard Sandiford <richard.sandiford@arm.com>
5780 * langhooks.h (lang_hooks_for_types::simulate_record_decl): New hook.
5781 * langhooks-def.h (lhd_simulate_record_decl): Declare.
5782 (LANG_HOOKS_SIMULATE_RECORD_DECL): Define.
5783 (LANG_HOOKS_FOR_TYPES_INITIALIZER): Include it.
5784 * langhooks.c (lhd_simulate_record_decl): New function.
5786 2021-11-02 Jakub Jelinek <jakub@redhat.com>
5789 * config/i386/i386.c (ix86_vector_mode_supported_p): Reject vector
5790 modes with TImode inner mode if 32-bit.
5792 2021-11-02 liuhongt <hongtao.liu@intel.com>
5794 * doc/sourcebuild.texi (vect_slp_v4qi_store_unalign,
5795 vect_slp_v2hi_store_unalign, vect_slp_v4hi_store_unalign,
5796 vect_slp_v4si_store_unalign): Document efficient target.
5797 (vect_slp_v4qi_store_unalign_1, vect_slp_v8qi_store_unalign_1,
5798 vect_slp_v16qi_store_unalign_1): Ditto.
5799 (vect_slp_v2hi_store_align,vect_slp_v2qi_store_align,
5800 vect_slp_v2si_store_align, vect_slp_v4qi_store_align): Ditto.
5801 (struct_4char_block_move, struct_8char_block_move,
5802 struct_16char_block_move): Ditto.
5804 2021-11-02 Roger Sayle <roger@nextmovesoftware.com>
5805 Jakub Jelinek <jakub@redhat.com>
5808 * config/i386/i386-expand.c (ix86_expand_v1ti_to_ti,
5809 ix86_expand_ti_to_v1ti): New helper functions.
5810 (ix86_expand_v1ti_shift): Check if the amount operand is an
5811 integer constant, and expand as a TImode shift if it isn't.
5812 (ix86_expand_v1ti_rotate): Check if the amount operand is an
5813 integer constant, and expand as a TImode rotate if it isn't.
5814 (ix86_expand_v1ti_ashiftrt): New function to expand arithmetic
5815 right shifts of V1TImode quantities.
5816 * config/i386/i386-protos.h (ix86_expand_v1ti_ashift): Prototype.
5817 * config/i386/sse.md (ashlv1ti3, lshrv1ti3): Change constraints
5818 to QImode general_operand, and let the helper functions lower
5819 shifts by non-constant operands, as TImode shifts. Make
5820 conditional on TARGET_64BIT.
5821 (ashrv1ti3): New expander calling ix86_expand_v1ti_ashiftrt.
5822 (rotlv1ti3, rotrv1ti3): Change shift operand to QImode.
5823 Make conditional on TARGET_64BIT.
5825 2021-11-02 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
5827 * config/s390/s390.md ("*cc_to_int", "tabort", "*tabort_1",
5828 "*tabort_1_plus"): Remove operands non-null check.
5830 2021-11-01 Martin Liska <mliska@suse.cz>
5832 * opt-functions.awk: Add new sanity checking.
5833 * optc-gen.awk: Add new argument to integer_range_info.
5834 * params.opt: Update 2 params which have negative IntegerRange.
5836 2021-11-01 qing zhao <qing.zhao@oracle.com>
5838 * gimplify.c (gimplify_decl_expr): Do not add call to
5839 __builtin_clear_padding when a variable is a gimple register
5840 or it might not have padding.
5841 (gimplify_init_constructor): Likewise.
5843 2021-11-01 Tamar Christina <tamar.christina@arm.com>
5845 * config/arm/aarch-common-protos.h (struct vector_cost_table): Add
5846 movi, dup and extract costing fields.
5847 * config/aarch64/aarch64-cost-tables.h (qdf24xx_extra_costs,
5848 thunderx_extra_costs, thunderx2t99_extra_costs,
5849 thunderx3t110_extra_costs, tsv110_extra_costs, a64fx_extra_costs): Use
5851 * config/arm/aarch-cost-tables.h (generic_extra_costs,
5852 cortexa53_extra_costs, cortexa57_extra_costs, cortexa76_extra_costs,
5853 exynosm1_extra_costs, xgene1_extra_costs): Likewise
5854 * config/aarch64/aarch64-simd.md (aarch64_simd_dup<mode>): Add r->w dup.
5855 * config/aarch64/aarch64.c (aarch64_rtx_costs): Add extra costs.
5857 2021-11-01 Tamar Christina <tamar.christina@arm.com>
5859 * cse.c (add_to_set): New.
5860 (find_sets_in_insn): Register constants in sets.
5861 (canonicalize_insn): Use auto_vec instead.
5862 (cse_insn): Try materializing using vec_dup.
5863 * rtl.h (simplify_context::simplify_gen_vec_select,
5864 simplify_gen_vec_select): New.
5865 * simplify-rtx.c (simplify_context::simplify_gen_vec_select): New.
5867 2021-11-01 David Malcolm <dmalcolm@redhat.com>
5869 * common.opt (fdiagnostics-escape-format=): New.
5870 (diagnostics_escape_format): New enum.
5871 (DIAGNOSTICS_ESCAPE_FORMAT_UNICODE): New enum value.
5872 (DIAGNOSTICS_ESCAPE_FORMAT_BYTES): Likewise.
5873 * diagnostic-format-json.cc (json_end_diagnostic): Add
5874 "escape-source" attribute.
5875 * diagnostic-show-locus.c
5876 (exploc_with_display_col::exploc_with_display_col): Replace
5877 "tabstop" param with a cpp_char_column_policy and add an "aspect"
5878 param. Use these to compute m_display_col accordingly.
5879 (struct char_display_policy): New struct.
5880 (layout::m_policy): New field.
5881 (layout::m_escape_on_output): New field.
5882 (def_policy): New function.
5883 (make_range): Update for changes to exploc_with_display_col ctor.
5884 (default_print_decoded_ch): New.
5885 (width_per_escaped_byte): New.
5886 (escape_as_bytes_width): New.
5887 (escape_as_bytes_print): New.
5888 (escape_as_unicode_width): New.
5889 (escape_as_unicode_print): New.
5891 (layout::layout): Initialize new fields. Update m_exploc ctor
5892 call for above change to ctor.
5893 (layout::maybe_add_location_range): Update for changes to
5894 exploc_with_display_col ctor.
5895 (layout::calculate_x_offset_display): Update for change to
5897 (layout::print_source_line): Pass policy
5898 to cpp_display_width_computation. Capture cpp_decoded_char when
5899 calling process_next_codepoint. Move printing of source code to
5900 m_policy.m_print_cb.
5901 (line_label::line_label): Pass in policy rather than context.
5902 (layout::print_any_labels): Update for change to line_label ctor.
5903 (get_affected_range): Pass in policy rather than context, updating
5904 calls to location_compute_display_column accordingly.
5905 (get_printed_columns): Likewise, also for cpp_display_width.
5906 (correction::correction): Pass in policy rather than tabstop.
5907 (correction::compute_display_cols): Pass m_policy rather than
5908 m_tabstop to cpp_display_width.
5909 (correction::m_tabstop): Replace with...
5910 (correction::m_policy): ...this.
5911 (line_corrections::line_corrections): Pass in policy rather than
5913 (line_corrections::m_context): Replace with...
5914 (line_corrections::m_policy): ...this.
5915 (line_corrections::add_hint): Update to use m_policy rather than
5917 (line_corrections::add_hint): Likewise.
5918 (layout::print_trailing_fixits): Likewise.
5919 (selftest::test_display_widths): New.
5920 (selftest::test_layout_x_offset_display_utf8): Update to use
5921 policy rather than tabstop.
5922 (selftest::test_one_liner_labels_utf8): Add test of escaping
5924 (selftest::test_diagnostic_show_locus_one_liner_utf8): Update to
5925 use policy rather than tabstop.
5926 (selftest::test_overlapped_fixit_printing): Likewise.
5927 (selftest::test_overlapped_fixit_printing_utf8): Likewise.
5928 (selftest::test_overlapped_fixit_printing_2): Likewise.
5929 (selftest::test_tab_expansion): Likewise.
5930 (selftest::test_escaping_bytes_1): New.
5931 (selftest::test_escaping_bytes_2): New.
5932 (selftest::diagnostic_show_locus_c_tests): Call the new tests.
5933 * diagnostic.c (diagnostic_initialize): Initialize
5934 context->escape_format.
5935 (convert_column_unit): Update to use default character width policy.
5936 (selftest::test_diagnostic_get_location_text): Likewise.
5937 * diagnostic.h (enum diagnostics_escape_format): New enum.
5938 (diagnostic_context::escape_format): New field.
5939 * doc/invoke.texi (-fdiagnostics-escape-format=): New option.
5940 (-fdiagnostics-format=): Add "escape-source" attribute to examples
5941 of JSON output, and document it.
5942 * input.c (location_compute_display_column): Pass in "policy"
5943 rather than "tabstop", passing to
5944 cpp_byte_column_to_display_column.
5945 (selftest::test_cpp_utf8): Update to use cpp_char_column_policy.
5946 * input.h (class cpp_char_column_policy): New forward decl.
5947 (location_compute_display_column): Pass in "policy" rather than
5949 * opts.c (common_handle_option): Handle
5950 OPT_fdiagnostics_escape_format_.
5951 * selftest.c (temp_source_file::temp_source_file): New ctor
5952 overload taking a size_t.
5953 * selftest.h (temp_source_file::temp_source_file): Likewise.
5955 2021-11-01 Aldy Hernandez <aldyh@redhat.com>
5957 * dbgcnt.def: Add debug counter for back_thread[12] and
5958 back_threadfull[12].
5959 * passes.def: Pass "first" argument to each back threading pass.
5960 * tree-ssa-threadbackward.c (back_threader::back_threader): Add
5962 (back_threader::debug_counter): New.
5963 (back_threader::maybe_register_path): Call debug_counter.
5965 2021-11-01 Aldy Hernandez <aldyh@redhat.com>
5967 * tree-ssa-threadbackward.c (BT_NONE): New.
5970 (back_threader::back_threader): Add flags.
5971 Move loop initialization here.
5972 (back_threader::~back_threader): New.
5973 (back_threader::find_taken_edge_switch): Change solver and ranger
5975 (back_threader::find_taken_edge_cond): Same.
5976 (back_threader::find_paths_to_names): Same.
5977 (back_threader::find_paths): Same.
5978 (back_threader::dump): Same.
5979 (try_thread_blocks): Merge into thread_blocks.
5980 (back_threader::thread_blocks): New.
5981 (do_early_thread_jumps): Merge into thread_blocks.
5982 (do_thread_jumps): Merge into thread_blocks.
5983 (back_threader::thread_through_all_blocks): Remove.
5985 2021-11-01 Andrew MacLeod <amacleod@redhat.com>
5987 PR tree-optimization/103003
5988 * value-relation.cc (dom_oracle::register_relation): If the 2
5989 ssa names are the same, don't register any relation.
5991 2021-11-01 Dan Li <ashimida@linux.alibaba.com>
5993 * config/aarch64/aarch64.c (aarch64_expand_epilogue): Remove
5994 redundant check for calls_eh_return.
5995 * config/aarch64/aarch64.md (*do_return): Likewise.
5997 2021-11-01 Xionghu Luo <luoxhu@linux.ibm.com>
5999 * cfghooks.c (cfg_hook_duplicate_loop_to_header_edge): Rename
6000 duplicate_loop_to_header_edge to
6001 duplicate_loop_body_to_header_edge.
6002 (cfg_hook_duplicate_loop_body_to_header_edge): Likewise.
6003 * cfghooks.h (struct cfg_hooks): Likewise.
6004 (cfg_hook_duplicate_loop_body_to_header_edge): Likewise.
6005 * cfgloopmanip.c (duplicate_loop_body_to_header_edge): Likewise.
6006 (clone_loop_to_header_edge): Likewise.
6007 * cfgloopmanip.h (duplicate_loop_body_to_header_edge): Likewise.
6008 * cfgrtl.c (struct cfg_hooks): Likewise.
6009 * doc/loop.texi: Likewise.
6010 * loop-unroll.c (unroll_loop_constant_iterations): Likewise.
6011 (unroll_loop_runtime_iterations): Likewise.
6012 (unroll_loop_stupid): Likewise.
6013 (apply_opt_in_copies): Likewise.
6014 * tree-cfg.c (struct cfg_hooks): Likewise.
6015 * tree-ssa-loop-ivcanon.c (try_unroll_loop_completely): Likewise.
6016 (try_peel_loop): Likewise.
6017 * tree-ssa-loop-manip.c (copy_phi_node_args): Likewise.
6018 (gimple_duplicate_loop_body_to_header_edge): Likewise.
6019 (tree_transform_and_unroll_loop): Likewise.
6020 * tree-ssa-loop-manip.h (gimple_duplicate_loop_body_to_header_edge):
6023 2021-11-01 Xionghu Luo <luoxhu@linux.ibm.com>
6025 * cfgloopmanip.c (loop_version): Refactor loopify to
6026 loop_version. Move condition generation after loopify.
6028 * cfgloopmanip.h (loopify): Delete.
6030 2021-10-31 Jan Hubicka <hubicka@ucw.cz>
6032 * ipa-fnsummary.c: Include tree-dfa.h.
6033 (points_to_local_or_readonly_memory_p): Return true on return
6035 * ipa-modref.c (analyze_ssa_name_flags): Fix handling of copy
6038 2021-10-30 Tobias Burnus <tobias@codesourcery.com>
6040 PR middle-end/102972
6041 * omp-low.c (omp_runtime_api_call): Use DECL_ASSEMBLER_NAME to get
6042 internal Fortran name; new permit_num_teams arg to permit
6043 omp_get_num_teams and omp_get_team_num.
6044 (scan_omp_1_stmt): Update call to it, add missing call for
6045 reverse offload, and check for strictly nested API calls in teams.
6047 2021-10-30 Jakub Jelinek <jakub@redhat.com>
6049 * gimplify.c (gimplify_omp_for): Diagnose threadprivate iterators.
6051 2021-10-29 David Malcolm <dmalcolm@redhat.com>
6053 * selftest.c (assert_streq): Add newlines when emitting non-equal
6056 2021-10-29 David Malcolm <dmalcolm@redhat.com>
6058 * Makefile.in: Fix syntax for reference to LIBDEPS in
6061 2021-10-29 Bernhard Reutner-Fischer <aldot@gcc.gnu.org>
6063 * doc/install.texi: Bump required minimum DejaGnu version.
6065 2021-10-29 Aldy Hernandez <aldyh@redhat.com>
6066 Andrew MacLeod <amacleod@redhat.com>
6068 * value-relation.cc (path_oracle::killing_def): Add a
6069 self-equivalence so we don't look to the root oracle.
6071 2021-10-29 Aldy Hernandez <aldyh@redhat.com>
6073 * passes.def: Replace the pass_thread_jumps before VRP* with
6074 pass_thread_jumps_full. Remove all pass_vrp_threader instances.
6075 * tree-ssa-threadbackward.c (pass_data_thread_jumps_full):
6076 Remove hyphen from "thread-full" name.
6078 2021-10-29 Andrew MacLeod <amacleod@redhat.com>
6080 PR tree-optimization/102983
6081 * gimple-range-cache.h (propagate_updated_value): Make public.
6082 * gimple-range.cc (gimple_ranger::range_of_stmt): Propagate exports
6083 when processing gcond stmts.
6085 2021-10-29 Jan Hubicka <hubicka@ucw.cz>
6087 * gimple.c (gimple_call_retslot_flags): New function.
6088 * gimple.h (gimple_call_retslot_flags): Declare.
6089 * ipa-modref.c: Include tree-cfg.h.
6090 (struct escape_entry): Turn parm_index to signed.
6091 (modref_summary_lto::modref_summary_lto): Add retslot_flags.
6092 (modref_summary::modref_summary): Initialize retslot_flags.
6093 (struct modref_summary_lto): Likewise.
6094 (modref_summary::useful_p): Check retslot_flags.
6095 (modref_summary_lto::useful_p): Likewise.
6096 (modref_summary::dump): Dump retslot_flags.
6097 (modref_summary_lto::dump): Likewise.
6098 (struct escape_point): Add hidden_args enum.
6099 (analyze_ssa_name_flags): Ignore return slot return;
6100 use gimple_call_retslot_flags.
6101 (record_escape_points): Break out from ...
6102 (analyze_parms): ... here; handle retslot_flags.
6103 (modref_summaries::duplicate): Duplicate retslot_flags.
6104 (modref_summaries_lto::duplicate): Likewise.
6105 (modref_write_escape_summary): Stream parm_index as signed.
6106 (modref_read_escape_summary): Likewise.
6107 (modref_write): Stream retslot_flags.
6108 (read_section): Likewise.
6109 (struct escape_map): Fix typo in comment.
6110 (update_escape_summary_1): Fix whitespace.
6111 (ipa_merge_modref_summary_after_inlining): Drop retslot_flags.
6112 (modref_merge_call_site_flags): Merge retslot_flags.
6113 * ipa-modref.h (struct modref_summary): Add retslot_flags.
6114 * tree-ssa-structalias.c (handle_rhs_call): Handle retslot_flags.
6116 2021-10-29 Tamar Christina <tamar.christina@arm.com>
6118 PR tree-optimization/102977
6119 * tree-vect-slp-patterns.c (vect_match_call_p): Remove.
6120 (vect_detect_pair_op): Add crosslane check.
6121 (vect_match_call_complex_mla): Remove.
6122 (class complex_mul_pattern): Update comment.
6123 (complex_mul_pattern::matches): Update detection.
6124 (class complex_fma_pattern): Remove.
6125 (complex_fma_pattern::matches): Remove.
6126 (complex_fma_pattern::recognize): Remove.
6127 (complex_fma_pattern::build): Remove.
6128 (class complex_fms_pattern): Update comment.
6129 (complex_fms_pattern::matches): Remove.
6130 (complex_operations_pattern::recognize): Remove complex_fma_pattern
6132 2021-10-29 Jakub Jelinek <jakub@redhat.com>
6134 * gimple-fold.c (gimple_fold_builtin_memset): Copy over location from
6137 2021-10-29 Xionghu Luo <luoxhu@linux.ibm.com>
6140 * config/rs6000/rs6000.c (altivec_expand_vec_perm_const): Add
6141 patterns match and emit for VSX xxpermdi.
6143 2021-10-29 liuhongt <hongtao.liu@intel.com>
6146 * config/i386/i386-builtin-types.def (V8HF_FTYPE_V8HF): New
6148 (V16HF_FTYPE_V16HF): Ditto.
6149 (V32HF_FTYPE_V32HF): Ditto.
6150 (V8HF_FTYPE_V8HF_ROUND): Ditto.
6151 (V16HF_FTYPE_V16HF_ROUND): Ditto.
6152 (V32HF_FTYPE_V32HF_ROUND): Ditto.
6153 * config/i386/i386-builtin.def ( IX86_BUILTIN_FLOORPH,
6154 IX86_BUILTIN_CEILPH, IX86_BUILTIN_TRUNCPH,
6155 IX86_BUILTIN_FLOORPH256, IX86_BUILTIN_CEILPH256,
6156 IX86_BUILTIN_TRUNCPH256, IX86_BUILTIN_FLOORPH512,
6157 IX86_BUILTIN_CEILPH512, IX86_BUILTIN_TRUNCPH512): New builtin.
6158 * config/i386/i386-builtins.c
6159 (ix86_builtin_vectorized_function): Enable vectorization for
6160 HFmode FLOOR/CEIL/TRUNC operation.
6161 * config/i386/i386-expand.c (ix86_expand_args_builtin): Handle
6163 * config/i386/sse.md (rint<mode>2, nearbyint<mode>2): Extend
6166 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
6167 Andrew MacLeod <amacleod@redhat.com>
6169 * value-relation.cc (path_oracle::killing_def): Walk the
6170 equivalency list and remove SSA from any equivalencies.
6172 2021-10-28 Stafford Horne <shorne@gmail.com>
6174 * config/or1k/or1k.h (PROFILE_HOOK): Add return address argument
6177 2021-10-28 Jakub Jelinek <jakub@redhat.com>
6179 PR tree-optimization/102951
6180 * fold-const.h (address_compare): Declare.
6181 * fold-const.c (address_compare): New function.
6182 * match.pd (cmp (convert1?@2 addr@0) (convert2? addr@1)): Use
6183 address_compare helper.
6184 (minmax cmp (convert1?@2 addr@0) (convert2?@3 addr@1)): New
6187 2021-10-28 Andrew MacLeod <amacleod@redhat.com>
6189 * vr-values.c (simplify_using_ranges::fold_cond): Change fold message.
6191 2021-10-28 Andrew MacLeod <amacleod@redhat.com>
6193 PR tree-optimization/102940
6194 * tree-vrp.c (execute_ranger_vrp): Reset scev.
6196 2021-10-28 Richard Purdie <richard.purdie@linuxfoundation.org>
6198 * config/nios2/linux.h (MUSL_DYNAMIC_LINKER): Add musl linker
6200 2021-10-28 Richard Purdie <richard.purdie@linuxfoundation.org>
6202 * configure: Regenerate.
6203 * configure.ac: Use CPPFLAGS_FOR_BUILD for GMPINC
6205 2021-10-28 Eric Botcazou <ebotcazou@adacore.com>
6207 * doc/invoke.texi (%X): Remove obsolete reference to -Wl.
6209 2021-10-28 Richard Biener <rguenther@suse.de>
6212 * fold-const.c (fold_convert_const): Avoid int to float
6213 constant folding with -frounding-math and inexact result.
6214 * simplify-rtx.c (simplify_const_unary_operation): Likewise
6215 for both float and unsigned_float.
6217 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
6219 * tree-ssa-threadbackward.c
6220 (back_threader::find_taken_edge_switch): Use find_case_label_range
6221 instead of find_taken_edge.
6223 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
6225 * tree-ssa-threadbackward.c (class back_threader_registry):
6226 Inherit from back_jt_path_registry.
6227 (back_threader_registry::thread_through_all_blocks): Remove.
6228 (back_threader_registry::register_path): Remove
6229 m_lowlevel_registry prefix.
6231 2021-10-28 Richard Biener <rguenther@suse.de>
6234 * fold-const.c (fold_convert_const_real_from_real): Honor
6235 -frounding-math if the conversion is not exact.
6236 * simplify-rtx.c (simplify_const_unary_operation): Do not
6237 simplify FLOAT_TRUNCATE with sign dependent rounding.
6239 2021-10-28 Richard Biener <rguenther@suse.de>
6241 PR tree-optimization/102949
6242 * tree-vect-stmts.c (ensure_base_align): Look at the
6243 dr_info of a group leader and assert we are looking at
6244 one with analyzed alignment.
6246 2021-10-28 Kewen Lin <linkw@linux.ibm.com>
6249 * config/rs6000/rs6000.c (rs6000_builtin_vectorization_cost): Consider
6250 V1T1 mode for unaligned load and store.
6252 2021-10-28 Kito Cheng <kito.cheng@sifive.com>
6254 * config/riscv/riscv.md (zero_extendsidi2_internal): Allow ZBB
6257 2021-10-28 Kito Cheng <kito.cheng@sifive.com>
6259 * config/riscv/arch-canonicalize (CANONICAL_ORDER): Add `i` to
6262 2021-10-28 Alexandre Oliva <oliva@adacore.com>
6264 * common.opt (fharden-compares): New.
6265 (fharden-conditional-branches): New.
6266 * doc/invoke.texi: Document new options.
6267 * gimple-harden-conditionals.cc: New.
6268 * Makefile.in (OBJS): Build it.
6269 * passes.def: Add new passes.
6270 * tree-pass.h (make_pass_harden_compares): Declare.
6271 (make_pass_harden_conditional_branches): Declare.
6273 2021-10-28 Xionghu Luo <luoxhu@linux.ibm.com>
6276 * config/rs6000/altivec.md: Add vsx register constraints.
6277 * config/rs6000/vsx.md (vsx_xxsel<mode>): Delete.
6278 (vsx_xxsel<mode>2): Likewise.
6279 (vsx_xxsel<mode>3): Likewise.
6280 (vsx_xxsel<mode>4): Likewise.
6282 2021-10-28 Xionghu Luo <luoxhu@linux.ibm.com>
6285 * config/rs6000/altivec.md (*altivec_vsel<mode>): Change to ...
6286 (altivec_vsel<mode>): ... this and update define.
6287 (*altivec_vsel<mode>_uns): Delete.
6288 (altivec_vsel<mode>2): New define_insn.
6289 (altivec_vsel<mode>3): Likewise.
6290 (altivec_vsel<mode>4): Likewise.
6291 * config/rs6000/rs6000-call.c (altivec_expand_vec_sel_builtin): New.
6292 (altivec_expand_builtin): Call altivec_expand_vec_sel_builtin to expand
6294 * config/rs6000/rs6000.c (rs6000_emit_vector_cond_expr): Use bit-wise
6295 selection instead of per element.
6296 * config/rs6000/vector.md:
6297 * config/rs6000/vsx.md (*vsx_xxsel<mode>): Change to ...
6298 (vsx_xxsel<mode>): ... this and update define.
6299 (*vsx_xxsel<mode>_uns): Delete.
6300 (vsx_xxsel<mode>2): New define_insn.
6301 (vsx_xxsel<mode>3): Likewise.
6302 (vsx_xxsel<mode>4): Likewise.
6304 2021-10-28 Hongyu Wang <hongyu.wang@intel.com>
6306 * config/i386/i386.c (use_rsqrt_p): Add mode parameter, enable
6307 HFmode rsqrt without TARGET_SSE_MATH.
6308 (ix86_optab_supported_p): Refactor rint, adjust floor, ceil,
6309 btrunc condition to be restricted by -ftrapping-math, adjust
6310 use_rsqrt_p function call.
6311 * config/i386/i386.md (rcphf2): New define_insn.
6312 (rsqrthf2): Likewise.
6313 * config/i386/sse.md (div<mode>3): Change VF2H to VF2.
6314 (div<mode>3): New expander for HF mode.
6315 (rsqrt<mode>2): Likewise.
6316 (*avx512fp16_vmrcpv8hf2): New define_insn for rpad pass.
6317 (*avx512fp16_vmrsqrtv8hf2): Likewise.
6319 2021-10-27 Saagar Jha <saagar@saagarjha.com>
6321 * config.gcc: Adjust for Darwin21.
6322 * config/darwin-c.c (macosx_version_as_macro): Likewise.
6323 * config/darwin-driver.c (validate_macosx_version_min):
6325 (darwin_find_version_from_kernel): Likewise.
6327 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
6329 * gimple-range-path.cc
6330 (path_range_query::range_defined_in_block): Call killing_def.
6332 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
6334 * gimple-range-fold.cc (fold_using_range::range_of_range_op): Dump
6335 operands as well as relation.
6336 * gimple-range-path.cc
6337 (path_range_query::compute_ranges_in_block): Compute PHI relations
6338 first. Compute outgoing relations at the end.
6339 (path_range_query::compute_ranges): Remove call to compute_relations.
6340 (path_range_query::compute_relations): Remove.
6341 (path_range_query::maybe_register_phi_relation): New.
6342 (path_range_query::compute_phi_relations): Abstract out
6343 registering one PHI relation to...
6344 (path_range_query::compute_outgoing_relations): ...here.
6345 * gimple-range-path.h (class path_range_query): Remove
6347 Add maybe_register_phi_relation.
6349 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
6350 Andrew MacLeod <amacleod@redhat.com>
6352 * value-relation.cc (path_oracle::killing_def): Kill second
6355 2021-10-27 Martin Jambor <mjambor@suse.cz>
6357 * ipa-cp.c (good_cloning_opportunity_p): Decide whether to use
6358 profile feedback depending on their local availability.
6360 2021-10-27 Martin Jambor <mjambor@suse.cz>
6362 * params.opt (param_ipa_cp_profile_count_base): New parameter.
6363 * doc/invoke.texi (Optimize Options): Add entry for
6364 ipa-cp-profile-count-base.
6365 * ipa-cp.c (max_count): Replace with base_count, replace all
6366 occurrences too, unless otherwise stated.
6367 (ipcp_cloning_candidate_p): identify mostly-directly called
6368 functions based on their counts, not max_count.
6369 (compare_edge_profile_counts): New function.
6370 (ipcp_propagate_stage): Instead of setting max_count, find the
6371 appropriate edge count in a sorted vector of counts of eligible
6372 edges and make it the base_count.
6374 2021-10-27 Martin Jambor <mjambor@suse.cz>
6376 * ipa-cp.c (struct caller_statistics): New fields rec_count_sum,
6377 n_nonrec_calls and itself, document all fields.
6378 (init_caller_stats): Initialize the above new fields.
6379 (gather_caller_stats): Gather self-recursive counts and calls number.
6380 (get_info_about_necessary_edges): Gather counts of self-recursive and
6381 other edges bringing in the requested value separately.
6382 (dump_profile_updates): Rework to dump info about a single node only.
6383 (lenient_count_portion_handling): New function.
6384 (struct gather_other_count_struct): New type.
6385 (gather_count_of_non_rec_edges): New function.
6386 (struct desc_incoming_count_struct): New type.
6387 (analyze_clone_icoming_counts): New function.
6388 (adjust_clone_incoming_counts): Likewise.
6389 (update_counts_for_self_gen_clones): Likewise.
6390 (update_profiling_info): Rewritten.
6391 (update_specialized_profile): Adjust call to dump_profile_updates.
6392 (create_specialized_node): Do not update profiling info.
6393 (decide_about_value): New parameter self_gen_clones, either push new
6394 clones into it or updat their profile counts. For self-recursively
6395 generated values, use a portion of the node count instead of count
6396 from self-recursive edges to estimate goodness.
6397 (decide_whether_version_node): Gather clones for self-generated values
6398 in a new vector, update their profiles at once at the end.
6400 2021-10-27 Richard Biener <rguenther@suse.de>
6402 * tree-vectorizer.c (vect_transform_loops): New function,
6404 (try_vectorize_loop_1): ... here. Simplify as epilogues
6405 are now fully handled in the split part.
6407 2021-10-27 Jakub Jelinek <jakub@redhat.com>
6409 * omp-expand.c (expand_omp_for_init_counts): Handle non-rectangular
6410 iterators with pointer types.
6411 (expand_omp_for_init_vars, extract_omp_for_update_vars): Likewise.
6413 2021-10-26 Martin Sebor <msebor@redhat.com>
6415 PR tree-optimization/102238
6416 PR tree-optimization/102919
6417 * gimple-ssa-sprintf.c (get_string_length): Add an argument.
6418 (array_elt_at_offset): Move to pointer-query.
6419 (set_aggregate_size_and_offset): New function.
6420 (field_at_offset): Move to pointer-query.
6421 (get_origin_and_offset): Rename...
6422 (get_origin_and_offset_r): this. Add an argument. Make aggregate
6423 handling more robust.
6424 (get_origin_and_offset): New.
6425 (alias_offset): Add an argument.
6426 (format_string): Use subobject size determined by get_origin_and_offset.
6427 * pointer-query.cc (field_at_offset): Move from gimple-ssa-sprintf.c.
6428 Improve/correct handling of aggregates.
6429 (array_elt_at_offset): Same.
6430 * pointer-query.h (field_at_offset): Declare.
6431 (array_elt_at_offset): Declare.
6433 2021-10-26 Martin Sebor <msebor@redhat.com>
6435 * builtins.c (check_strncat_sizes): Pass access_data ctor additional
6437 (expand_builtin_memcmp): Move code to gimple-ssa-warn-access.cc.
6438 (expand_builtin_fork_or_exec): Same.
6439 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Pass
6440 compute_objsize additional arguments.
6441 (inbounds_memaccess_p): Same.
6442 (array_bounds_checker::check_array_bounds): Add an assert. Stash
6443 statement in a member.
6444 (check_array_bounds_dom_walker::before_dom_children): Same.
6445 * gimple-array-bounds.h (array_bounds_checker::m_stmt): New member.
6446 * gimple-ssa-sprintf.c (get_destination_size): Add an argument.
6447 (handle_printf_call): Pass a new argument.
6448 * gimple-ssa-warn-access.cc (get_size_range): Add an argument.
6449 (check_access): Add an argument and pass it along to callees.
6450 (check_read_access): Make a member function.
6451 (pass_waccess::check_strcat): Pass access_data ctor additional
6453 (pass_waccess::check_strncat): Same.
6454 (pass_waccess::check_stxcpy): Same.
6455 (pass_waccess::check_stxncpy): Same.
6456 (pass_waccess::check_strncmp): Same.
6457 (pass_waccess::check_read_access): Same.
6458 (pass_waccess::check_builtin): Same.
6459 (pass_waccess::maybe_check_access_sizes): Same.
6460 (pass_waccess::maybe_check_dealloc_call): Same.
6461 * gimple-ssa-warn-access.h (check_read_access): Declare a new
6463 * pointer-query.cc (compute_objsize_r): Add an argument.
6464 (gimple_call_return_array): Same.
6465 (gimple_call_alloc_size): Same.
6466 (access_ref::access_ref): Same.
6467 (access_ref::get_ref): Same.
6468 (pointer_query::get_ref): Same.
6469 (handle_min_max_size): Pass an arguments to callees.
6470 (handle_array_ref): Add an argument.
6471 (handle_mem_ref): Same.
6472 (compute_objsize): Same.
6473 * pointer-query.h (struct access_ref): Adjust signatures.
6474 (struct access_data): Same.
6475 (gimple_call_alloc_size): Add an argument.
6476 (gimple_parm_array_size): Same.
6477 (compute_objsize): Same.
6478 * tree-ssa-strlen.c (strlen_pass::adjust_last_stmt): Pass an additional
6479 argument to compute_objsize.
6480 (strlen_pass::maybe_warn_overflow): Same.
6481 (maybe_diag_stxncpy_trunc): Same.
6483 2021-10-26 Martin Sebor <msebor@redhat.com>
6485 PR middle-end/102453
6486 * gimple-ssa-warn-access.cc (pass_waccess::check_atomic_builtin): New.
6487 (pass_waccess::check_atomic_builtin): Call it.
6489 2021-10-26 Vladimir N. Makarov <vmakarov@redhat.com>
6491 PR rtl-optimization/102842
6492 * lra-constraints.c (match_reload): Ignore out in checking values
6494 (curr_insn_transform): Collect outputs before doing reloads of operands.
6496 2021-10-26 Paul A. Clarke <pc@us.ibm.com>
6499 * config/rs6000/x86intrin.h: Move some included headers to new
6500 headers. Include new immintrin.h instead of those headers.
6501 * config/rs6000/immintrin.h: New.
6502 * config/rs6000/x86gprintrin.h: New.
6503 * config.gcc (powerpc*-*-*): Add new headers to extra_headers.
6505 2021-10-26 Richard Biener <rguenther@suse.de>
6507 * tree-vectorizer.h (vect_create_addr_base_for_vector_ref):
6508 Remove byte_offset parameter.
6509 (vect_create_data_ref_ptr): Likewise.
6510 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
6512 (vect_create_data_ref_ptr): Likewise.
6513 * tree-vect-stmts.c (vectorizable_store): Adjust.
6514 (vectorizable_load): Likewise.
6516 2021-10-26 Richard Biener <rguenther@suse.de>
6518 PR tree-optimization/96109
6519 * tree-vectorizer.h (dr_misalignment): Add optional offset
6521 * tree-vect-data-refs.c (dr_misalignment): Likewise. Remove
6522 offset applied for negative stride accesses.
6523 (vect_enhance_data_refs_alignment): Compute negative stride
6524 access offset and pass it to dr_misalignment.
6525 * tree-vect-stmts.c (get_negative_load_store_type): Pass
6526 negative offset to dr_misalignment.
6527 (get_group_load_store_type): Likewise.
6528 (get_load_store_type): Likewise.
6529 (vectorizable_store): Remove asserts about alignment.
6530 (vectorizable_load): Likewise.
6532 2021-10-26 Kewen Lin <linkw@linux.ibm.com>
6534 PR tree-optimization/102897
6535 * tree-ssa-forwprop.c (simplify_permutation): Remove a wrong assertion.
6537 2021-10-26 Richard Biener <rguenther@suse.de>
6539 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
6540 Take offset in bytes.
6541 (vect_create_data_ref_ptr): Likewise.
6542 * tree-vect-loop-manip.c (get_misalign_in_elems): Multiply
6543 offset by element size.
6544 (vect_create_cond_for_align_checks): Likewise.
6545 * tree-vect-stmts.c (get_negative_load_store_type): Likewise.
6546 (vectorizable_load): Remove duplicate leftover from merge
6549 2021-10-26 Roger Sayle <roger@nextmovesoftware.com>
6551 * config/i386/i386-expand.c (ix86_expand_v1ti_shift): New helper
6552 function to expand V1TI mode logical shifts by integer constants.
6553 (ix86_expand_v1ti_rotate): New helper function to expand V1TI
6554 mode rotations by integer constants.
6555 * config/i386/i386-protos.h (ix86_expand_v1ti_shift,
6556 ix86_expand_v1ti_rotate): Prototype new functions here.
6557 * config/i386/sse.md (ashlv1ti3, lshrv1ti3, rotlv1ti3, rotrv1ti3):
6558 New TARGET_SSE2 expanders to implement V1TI shifts and rotations.
6560 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
6562 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
6563 Avoid threading circular paths.
6565 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
6567 * tree-ssa-threadbackward.c (back_threader::resolve_phi):
6568 Attempt to resolve all incoming paths to a PHI.
6569 (back_threader::resolve_def): Always return true for PHIs.
6571 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
6573 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
6574 Always try to resolve path without looking back.
6575 * tree-ssa-threadupdate.c (dump_jump_thread): Indidicate whether
6576 edge is a back edge.
6578 2021-10-26 Kewen Lin <linkw@linux.ibm.com>
6580 PR tree-optimization/102789
6581 * tree-vect-loop-manip.c (vect_update_inits_of_drs): Do not
6582 update inits of simd_lane_access.
6584 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
6586 * tree-vrp.c (vrp_simplify_cond_using_ranges): Add return type and
6587 move to vr-values.c.
6588 (simplify_casted_conds): Move to vrp_folder class.
6589 (execute_vrp): Call via vrp_folder now.
6590 * vr-values.c (simplify_cond_using_ranges_1): Call simplify_casted_cond.
6591 (simplify_using_ranges::simplify_casted_cond): Relocate from tree-vrp.c.
6592 * vr-values.h (simplify_casted_cond): Add prototype.
6594 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
6596 * tree-vrp.c (rvrp_folder::fold_stmt): If simplification fails, try
6599 2021-10-25 Paul A. Clarke <pc@us.ibm.com>
6601 * config/rs6000/smmintrin.h (_mm_testz_si128): Add "extern" to
6603 (_mm_testc_si128): Likewise.
6604 (_mm_testnzc_si128): Likewise.
6605 (_mm_blend_ps): Likewise.
6606 (_mm_blendv_ps): Likewise.
6607 (_mm_blend_pd): Likewise.
6608 (_mm_blendv_pd): Likewise.
6609 (_mm_ceil_pd): Likewise.
6610 (_mm_ceil_sd): Likewise.
6611 (_mm_ceil_ps): Likewise.
6612 (_mm_ceil_ss): Likewise.
6613 (_mm_floor_pd): Likewise.
6614 (_mm_floor_sd): Likewise.
6615 (_mm_floor_ps): Likewise.
6616 (_mm_floor_ss): Likewise.
6617 (_mm_minpos_epu16): Likewise.
6618 (_mm_mul_epi32): Likewise.
6619 (_mm_cvtepi8_epi16): Likewise.
6620 (_mm_packus_epi32): Likewise.
6621 (_mm_cmpgt_epi64): Likewise.
6623 2021-10-25 Roger Sayle <roger@nextmovesoftware.com>
6625 * simplify-rtx.c (simplify_binary_operation_1) [SS_ASHIFT]: Simplify
6626 shifts of the mode's smin_value and smax_value when the bit count
6627 operand doesn't have side-effects.
6628 [US_ASHIFT]: Likewise, simplify shifts of the mode's umax_value
6629 when the bit count operand doesn't have side-effects.
6630 (simplify_const_binary_operation) [SS_ASHIFT, US_ASHIFT]: Perform
6631 compile-time evaluation of saturating left shifts with constant
6634 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
6636 * gimple-range.cc (gimple_ranger::export_global_ranges): Remove check
6639 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
6641 * flag-types.h (enum ranger_debug): Adjust values.
6642 * params.opt (ranger_debug): Ditto.
6644 2021-10-25 Martin Jambor <mjambor@suse.cz>
6646 PR tree-optimization/102886
6647 * tree-sra.c (totally_scalarize_subtree): Fix the out of
6650 2021-10-25 Andrew Pinski <apinski@marvell.com>
6652 * tree-ssa-dce.c (simple_dce_from_worklist):
6653 Check stmt_unremovable_because_of_non_call_eh_p also
6654 before removing the statement.
6656 2021-10-25 Richard Biener <rguenther@suse.de>
6658 PR tree-optimization/102905
6659 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment):
6660 Use vect_supportable_dr_alignment again to determine whether
6661 an access is supported when not aligned.
6663 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
6665 * config/riscv/riscv.c (riscv_rtx_costs): Handle cost model
6668 2021-10-25 Jim Wilson <jimw@sifive.com>
6669 Kito Cheng <kito.cheng@sifive.com>
6670 Jia-Wei Chen <jiawei@iscas.ac.cn>
6671 Shi-Hua Liao <shihua@iscas.ac.cn>
6673 * config/riscv/bitmanip.md (shiftm1): New.
6674 (*bset<mode>): Ditto.
6675 (*bset<mode>_mask): Ditto.
6676 (*bset<mode>_1): Ditto.
6677 (*bset<mode>_1_mask): Ditto.
6678 (*bseti<mode>): Ditto.
6679 (*bclr<mode>): Ditto.
6680 (*bclri<mode>): Ditto.
6681 (*binv<mode>): Ditto.
6682 (*binvi<mode>): Ditto.
6683 (*bext<mode>): Ditto.
6685 * config/riscv/predicates.md (splittable_const_int_operand):
6687 (single_bit_mask_operand): New.
6688 (not_single_bit_mask_operand): Ditto.
6689 (const31_operand): Ditto.
6690 (const63_operand): Ditto.
6691 * config/riscv/riscv.c (riscv_build_integer_1): Handle bseti.
6692 (riscv_output_move): Ditto.
6693 (riscv_print_operand): Handle new operand type: T and S.
6694 * config/riscv/riscv.h (SINGLE_BIT_MASK_OPERAND): New.
6696 2021-10-25 Jim Wilson <jimw@sifive.com>
6698 * config/riscv/riscv.c (riscv_build_integer_1): Build integer
6701 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
6703 * config/riscv/riscv.c (riscv_extend_cost): Handle cost model
6705 (riscv_rtx_costs): Ditto.
6707 2021-10-25 Jim Wilson <jimw@sifive.com>
6708 Kito Cheng <kito.cheng@sifive.com>
6709 Jia-Wei Chen <jiawei@iscas.ac.cn>
6711 * config/riscv/bitmanip.md (bitmanip_bitwise): New.
6712 (bitmanip_minmax): New.
6713 (clz_ctz_pcnt): New.
6714 (bitmanip_optab): New.
6715 (bitmanip_insn): New.
6716 (*<optab>_not<mode>): New.
6717 (*xor_not<mode>): New.
6718 (<bitmanip_optab>si2): New.
6719 (*<bitmanip_optab>disi2): New.
6720 (<bitmanip_optab>di2): New.
6721 (*zero_extendhi<GPR:mode>2_bitmanip): New.
6722 (*extend<SHORT:mode><SUPERQI:mode>2_zbb): New.
6723 (*zero_extendhi<GPR:mode>2_zbb): New.
6726 (rotrsi3_sext): New.
6729 (rotlsi3_sext): New.
6730 (bswap<mode>2): New.
6731 (<bitmanip_optab><mode>3): New.
6732 * config/riscv/riscv.md (type): Add rotate.
6733 (zero_extendhi<GPR:mode>2): Change to define_expand pattern.
6734 (*zero_extendhi<GPR:mode>2): New.
6735 (extend<SHORT:mode><SUPERQI:mode>2): Change to define_expand pattern.
6736 (*extend<SHORT:mode><SUPERQI:mode>2): New.
6738 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
6740 * config/riscv/riscv.c (riscv_extend_cost): Handle cost model
6742 (riscv_rtx_costs): Ditto.
6744 2021-10-25 Jim Wilson <jimw@sifive.com>
6745 Kito Cheng <kito.cheng@sifive.com>
6746 Jia-Wei Chen <jiawei@iscas.ac.cn>
6748 * config/riscv/bitmanip.md (*zero_extendsidi2_bitmanip): New.
6753 (riscv_rtx_costs): Ditto.
6754 * config/riscv/riscv.md: Include bitmanip.md
6755 (type): Add bitmanip bype.
6756 (zero_extendsidi2): Change to define_expand pattern.
6757 (*zero_extendsidi2_internal): New.
6758 (zero_extendsidi2_shifted): Disable for ZBA.
6760 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
6762 * common/config/riscv/riscv-common.c (riscv_ext_version_table):
6763 Add zba, zbb, zbc and zbs.
6764 (riscv_ext_flag_table): Ditto.
6765 * config/riscv/riscv-opts.h (MASK_ZBA): New.
6769 (TARGET_ZBA): Ditto.
6770 (TARGET_ZBB): Ditto.
6771 (TARGET_ZBC): Ditto.
6772 (TARGET_ZBS): Ditto.
6773 * config/riscv/riscv.opt (riscv_zb_subext): New.
6775 2021-10-25 liuhongt <hongtao.liu@intel.com>
6778 * match.pd: Simplify (_Float16) sqrtf((float) a) to .SQRT(a)
6779 when direct_internal_fn_supported_p, similar for sqrt/sqrtl.
6781 2021-10-25 Richard Biener <rguenther@suse.de>
6783 PR tree-optimization/102920
6784 * tree-ssa-sccvn.h (expressions_equal_p): Add argument
6785 controlling VN_TOP matching behavior.
6786 * tree-ssa-sccvn.c (expressions_equal_p): Likewise.
6787 (vn_phi_eq): Do not optimistically match VN_TOP.
6789 2021-10-25 konglin1 <lingling.kong@intel.com>
6791 * config/i386/sse.md (fma_<mode>_fadd_fmul): Add new
6792 define_insn_and_split.
6793 (fma_<mode>_fadd_fcmul):Likewise
6794 (fma_<complexopname>_<mode>_fma_zero):Likewise
6796 2021-10-24 John David Anglin <danglin@gcc.gnu.org>
6798 * config/pa/pa-d.c (pa_d_handle_target_float_abi): Don't check
6799 TARGET_DISABLE_FPREGS.
6800 * config/pa/pa.c (fix_range): Use MASK_SOFT_FLOAT instead of
6801 MASK_DISABLE_FPREGS.
6802 (hppa_rtx_costs): Don't check TARGET_DISABLE_FPREGS. Adjust
6803 cost of hardware integer multiplication.
6804 (pa_conditional_register_usage): Don't check TARGET_DISABLE_FPREGS.
6805 * config/pa/pa.h (INT14_OK_STRICT): Likewise.
6806 * config/pa/pa.md: Don't check TARGET_DISABLE_FPREGS. Check
6807 TARGET_SOFT_FLOAT in patterns that use xmpyu instruction.
6808 * config/pa/pa.opt (mdisable-fpregs): Change target mask to
6809 SOFT_FLOAT. Revise comment.
6810 (msoft-float): New option.
6812 2021-10-24 John David Anglin <danglin@gcc.gnu.org>
6814 * config/pa/pa.md: Don't use 'G' constraint in integer move patterns.
6816 2021-10-24 Gerald Pfeifer <gerald@pfeifer.com>
6818 * doc/install.texi (Specific): Remove obsolete details
6819 around GNU/Linux on Itanium.
6820 (Specific): Remove reference to Windows for Itanium.
6822 2021-10-23 Bernhard Reutner-Fischer <aldot@gcc.gnu.org>
6824 * config/i386/x86-tune-sched-bd.c (dispatch_group): Commentary
6827 2021-10-23 Jan Hubicka <hubicka@ucw.cz>
6829 * tree-ssa-structalias.c (compute_points_to_sets): Cleanup.
6831 2021-10-23 Roger Sayle <roger@nextmovesoftware.com>
6833 * config/i386/sse.md (<any_logic>v1ti3): New define_insn to
6834 implement V1TImode AND, IOR and XOR on TARGET_SSE2 (and above).
6835 (one_cmplv1ti2): New define expand.
6837 2021-10-22 Eric Gallager <egallager@gcc.gnu.org>
6840 * Makefile.in: Handle dvidir and install-dvi target.
6841 * configure: Regenerate.
6842 * configure.ac: Add install-dvi to target_list.
6844 2021-10-22 Gerald Pfeifer <gerald@pfeifer.com>
6846 * doc/install.texi (Binaries): Convert mingw-w64.org to https.
6849 2021-10-22 Richard Biener <rguenther@suse.de>
6851 PR tree-optimization/102893
6852 * tree-ssa-dce.c (find_obviously_necessary_stmts): Fix the
6853 test for an exit edge.
6855 2021-10-22 Aldy Hernandez <aldyh@redhat.com>
6856 Andrew MacLeod <amacleod@redhat.com>
6858 * gimple-range-path.cc (path_range_query::compute_phi_relations):
6859 Kill any global relations we may know before registering a new
6861 * value-relation.cc (path_oracle::killing_def): New.
6862 * value-relation.h (path_oracle::killing_def): New.
6864 2021-10-22 Richard Biener <rguenther@suse.de>
6867 * tree-ssa-sccvn.c (vn_phi_insert): For undefined SSA args
6869 (vn_phi_lookup): Likewise.
6871 2021-10-21 H.J. Lu <hjl.tools@gmail.com>
6874 * doc/invoke.texi: Document -fcf-protection requires i686 or
6877 2021-10-21 Eric Botcazou <ebotcazou@adacore.com>
6879 PR middle-end/102764
6880 * cfgexpand.c (expand_gimple_basic_block): Robustify latest change.
6882 2021-10-21 Jonathan Wright <jonathan.wright@arm.com>
6884 * config/aarch64/arm_neon.h (__STRUCTN): Delete function
6885 macro and all invocations.
6887 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
6889 * doc/invoke.texi (ranger-debug): Document.
6890 * flag-types.h (enum ranger_debug): New.
6891 (enum evrp_mode): Remove debug values.
6892 * gimple-range-cache.cc (DEBUG_RANGE_CACHE): Use new debug flag.
6893 * gimple-range-gori.cc (gori_compute::gori_compute): Ditto.
6894 * gimple-range.cc (gimple_ranger::gimple_ranger): Ditto.
6895 * gimple-ssa-evrp.c (hybrid_folder::choose_value): Ditto.
6896 (execute_early_vrp): Use evrp-mode directly.
6897 * params.opt (enum evrp_mode): Remove debug values.
6898 (ranger-debug): New.
6899 (ranger-logical-depth): Relocate to be in alphabetical order.
6901 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
6903 * doc/invoke.texi: (vrp1-mode, vrp2-mode): Document.
6904 * flag-types.h: (enum vrp_mode): New.
6905 * params.opt: (vrp1-mode, vrp2-mode): New.
6906 * tree-vrp.c (vrp_pass_num): New.
6907 (pass_vrp::pass_vrp): Set pass number.
6908 (pass_vrp::execute): Choose which VRP mode to execute.
6910 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
6912 * gimple-ssa-evrp.c (class rvrp_folder): Move to tree-vrp.c.
6913 (execute_early_vrp): For ranger only mode, invoke ranger_vrp.
6914 * tree-vrp.c (class rvrp_folder): Relocate here.
6915 (execute_ranger_vrp): New.
6916 * tree-vrp.h (execute_ranger_vrp): Export.
6918 2021-10-21 Martin Liska <mliska@suse.cz>
6922 * opts.c (finish_options): Process flag_var_tracking* options
6923 here as they can be adjusted by optimize attribute.
6924 Process also flag_syntax_only and flag_gtoggle.
6925 * toplev.c (process_options): Remove it here.
6926 * common.opt: Make debug_nonbind_markers_p as PerFunction
6927 attribute as it depends on optimization level.
6929 2021-10-21 Martin Jambor <mjambor@suse.cz>
6931 PR tree-optimization/102505
6932 * tree-sra.c (totally_scalarize_subtree): Check that the
6933 encountered field fits within the acces we would like to put it
6936 2021-10-21 Aldy Hernandez <aldyh@redhat.com>
6938 * tree-ssa-threadbackward.c
6939 (back_threader::maybe_register_path): Remove circular paths check.
6941 2021-10-21 Richard Biener <rguenther@suse.de>
6943 * toplev.c (process_options): Move the initial debug_hooks
6945 (toplev::main): ... before the call of the post_options
6948 2021-10-21 Richard Biener <rguenther@suse.de>
6950 PR tree-optimization/102847
6951 * tree-vect-stmts.c (vect_model_load_cost): Add the scalar
6952 load cost in the prologue for VMAT_INVARIANT.
6954 2021-10-21 Richard Biener <rguenther@suse.de>
6956 PR tree-optimization/102847
6957 * tree-vect-stmts.c (vect_model_load_cost): Explicitely
6958 handle VMAT_INVARIANT as a splat in the prologue.
6960 2021-10-21 Hongyu Wang <hongyu.wang@intel.com>
6963 * config/i386/i386.c (ix86_get_ssemov): Adjust HFmode vector
6964 move to use the same logic as HImode.
6966 2021-10-21 Richard Biener <rguenther@suse.de>
6968 * tree-vect-slp.c (vect_build_slp_tree_1): Remove
6969 superfluous gimple_call_nothrow_p check.
6971 2021-10-21 Andrew Pinski <apinski@marvell.com>
6973 * tree-cfg.c (maybe_remove_writeonly_store): Add dce_ssa_names argument.
6974 Mark the ssa-name of the rhs as one to be removed.
6975 (execute_fixup_cfg): Update call to maybe_remove_writeonly_store.
6976 Call simple_dce_from_worklist at the end to a simple dce.
6978 2021-10-21 Andrew Pinski <apinski@marvell.com>
6980 * tree-cfg.c (maybe_remove_writeonly_store): New function
6981 factored out from ...
6982 (execute_fixup_cfg): Here. Call maybe_remove_writeonly_store.
6984 2021-10-21 Andrew Pinski <apinski@marvell.com>
6986 * tree-cfg.c (execute_fixup_cfg): Remove comment
6987 about standalone pass.
6989 2021-10-21 Andrew Pinski <apinski@marvell.com>
6991 * tree-cfg.c (execute_fixup_cfg): Output when the statement
6992 is removed when it is a write only var.
6994 2021-10-21 Aldy Hernandez <aldyh@redhat.com>
6996 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
6997 Avoid threading circular paths.
6999 2021-10-20 Alex Coplan <alex.coplan@arm.com>
7001 * calls.c (initialize_argument_information): Remove some dead
7002 code, remove handling for function_arg returning const_int.
7003 * doc/tm.texi: Delete documentation for unused target hooks.
7004 * doc/tm.texi.in: Likewise.
7005 * target.def (load_bounds_for_arg): Delete.
7006 (store_bounds_for_arg): Delete.
7007 (load_returned_bounds): Delete.
7008 (store_returned_bounds): Delete.
7009 * targhooks.c (default_load_bounds_for_arg): Delete.
7010 (default_store_bounds_for_arg): Delete.
7011 (default_load_returned_bounds): Delete.
7012 (default_store_returned_bounds): Delete.
7013 * targhooks.h (default_load_bounds_for_arg): Delete.
7014 (default_store_bounds_for_arg): Delete.
7015 (default_load_returned_bounds): Delete.
7016 (default_store_returned_bounds): Delete.
7018 2021-10-20 Jonathan Wakely <jwakely@redhat.com>
7020 * doc/extend.texi (Basic Asm): Clarify that asm is not an
7022 * doc/invoke.texi (-fno-asm): Fix description for C++.
7024 2021-10-20 Jonathan Wakely <jwakely@redhat.com>
7026 * doc/install.texi: Remove link to old.html
7028 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7030 * config/aarch64/aarch64-simd.md (*aarch64_cmtst_same_<mode>): New.
7032 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7034 * config/aarch64/aarch64-simd.md (*aarch64_narrow_trunc<mode>): New.
7036 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7038 * config/aarch64/aarch64-simd.md (aarch64_simd_ashr<mode>): Add case cmp
7040 * config/aarch64/constraints.md (D1): New.
7042 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7044 * config/aarch64/aarch64-simd.md
7045 (*aarch64_<srn_op>topbits_shuffle<mode>_le): New.
7046 (*aarch64_topbits_shuffle<mode>_le): New.
7047 (*aarch64_<srn_op>topbits_shuffle<mode>_be): New.
7048 (*aarch64_topbits_shuffle<mode>_be): New.
7049 * config/aarch64/predicates.md
7050 (aarch64_simd_shift_imm_vec_exact_top): New.
7052 2021-10-20 Tamar Christina <tamar.christina@arm.com>
7054 * config/aarch64/aarch64-simd.md (*aarch64_<srn_op>shrn<mode>_vect,
7055 *aarch64_<srn_op>shrn<mode>2_vect_le,
7056 *aarch64_<srn_op>shrn<mode>2_vect_be): New.
7057 * config/aarch64/iterators.md (srn_op): New.
7059 2021-10-20 Chung-Lin Tang <cltang@codesourcery.com>
7061 * omp-low.c (omp_copy_decl_2): For !ctx, use record_vars to add new copy
7063 (scan_sharing_clauses): Place copy of OMP_CLAUSE_IN_REDUCTION decl in
7064 ctx->outer instead of ctx.
7066 2021-10-20 Martin Liska <mliska@suse.cz>
7069 2021-10-19 Martin Liska <mliska@suse.cz>
7072 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p): Strip whitespaces.
7073 * system.h (strip_whilespaces): New function.
7075 2021-10-20 Martin Liska <mliska@suse.cz>
7078 2021-10-19 Martin Liska <mliska@suse.cz>
7081 * config/aarch64/aarch64.c (aarch64_process_one_target_attr):
7084 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
7086 * config/aarch64/aarch64.c (aarch64_case_values_threshold):
7087 Change to 8 with -Os, 11 otherwise.
7089 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
7091 * config/aarch64/aarch64.c (neoversev1_tunings):
7092 Enable AARCH64_EXTRA_TUNE_CHEAP_SHIFT_EXTEND.
7093 (neoversen2_tunings): Likewise.
7095 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
7098 * config/aarch64/aarch64.md (frint_pattern): Update comment.
7099 * config/aarch64/aarch64-simd-builtins.def: Change frintn to roundeven.
7100 * config/aarch64/arm_fp16.h: Change frintn to roundeven.
7101 * config/aarch64/arm_neon.h: Likewise.
7102 * config/aarch64/iterators.md (frint_pattern): Use roundeven for FRINTN.
7104 2021-10-20 Martin Liska <mliska@suse.cz>
7106 * config/arm/arm.c (arm_unwind_emit_sequence): Do not declare
7107 already declared global variable.
7108 (arm_unwind_emit_set): Use out_file as function argument.
7109 (arm_unwind_emit): Likewise.
7110 * config/darwin.c (machopic_output_data_section_indirection): Likewise.
7111 (machopic_output_stub_indirection): Likewise.
7112 (machopic_output_indirection): Likewise.
7113 (machopic_finish): Likewise.
7114 * config/i386/i386.c (ix86_asm_output_function_label): Likewise.
7115 * config/i386/winnt.c (i386_pe_seh_unwind_emit): Likewise.
7116 * config/ia64/ia64.c (process_epilogue): Likewise.
7117 (process_cfa_adjust_cfa): Likewise.
7118 (process_cfa_register): Likewise.
7119 (process_cfa_offset): Likewise.
7120 (ia64_asm_unwind_emit): Likewise.
7121 * config/s390/s390.c (s390_asm_output_function_label): Likewise.
7123 2021-10-20 Andre Simoes Dias Vieira <andre.simoesdiasvieira@arm.com>
7125 * config/aarch64/aarch64-builtins.c
7126 (aarch64_general_gimple_fold_builtin):
7127 lower vld1 and vst1 variants of the neon builtins
7128 * config/aarch64/aarch64-protos.h:
7129 (aarch64_general_gimple_fold_builtin): Add gsi parameter.
7130 * config/aarch64/aarch64.c (aarch64_general_gimple_fold_builtin):
7133 2021-10-20 Andre Simoes Dias Vieira <andre.simoesdiasvieira@arm.com>
7135 * match.pd: Generate IFN_TRUNC.
7137 2021-10-20 Richard Biener <rguenther@suse.de>
7139 PR tree-optimization/102853
7140 * tree-data-ref.c (split_constant_offset_1): Bail out
7141 immediately if the expression traps on overflow.
7143 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7145 * tree-ssa-threadbackward.c (back_threader::~back_threader): Remove.
7147 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7149 * tree-ssa-threadupdate.c (back_jt_path_registry::adjust_paths_after_duplication):
7150 Remove superflous debugging message.
7151 (back_jt_path_registry::duplicate_thread_path): Same.
7153 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7155 * tree-ssa-threadbackward.c (back_threader_registry::back_threader_registry):
7157 (back_threader_registry::register_path): Remove m_threaded_paths.
7159 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7161 PR tree-optimization/102814
7162 * doc/invoke.texi: Document --param=max-fsm-thread-length.
7163 * params.opt: Add --param=max-fsm-thread-length.
7164 * tree-ssa-threadbackward.c
7165 (back_threader_profitability::profitable_path_p): Fail on paths
7166 longer than max-fsm-thread-length.
7168 2021-10-20 Eric Botcazou <ebotcazou@adacore.com>
7170 PR middle-end/102764
7171 * cfgexpand.c (expand_gimple_basic_block): Disregard a final debug
7172 statement to reset the current location for the outgoing edges.
7174 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
7175 Richard Biener <rguenther@suse.de>
7177 * tree-ssa-threadupdate.c (cancel_thread): Dump threading reason
7178 on the same line as the threading cancellation.
7179 (jt_path_registry::cancel_invalid_paths): Avoid rotating loops.
7180 Avoid threading through loop headers where the path remains in the
7183 2021-10-20 Siddhesh Poyarekar <siddhesh@gotplt.org>
7185 * tree-object-size.c (unknown): Make into a function. Adjust
7187 (unknown_object_size): Simplify implementation.
7189 2021-10-20 Hongtao Liu <hongtao.liu@intel.com>
7190 Kewen Lin <linkw@linux.ibm.com>
7192 * doc/sourcebuild.texi (Effective-Target Keywords): Document
7193 vect_slp_v2qi_store, vect_slp_v4qi_store, vect_slp_v8qi_store,
7194 vect_slp_v16qi_store, vect_slp_v2hi_store,
7195 vect_slp_v4hi_store, vect_slp_v2si_store, vect_slp_v4si_store.
7197 2021-10-19 Jonathan Wakely <jwakely@redhat.com>
7199 * doc/extend.texi (Basic PowerPC Built-in Functions): Fix typo.
7201 2021-10-19 Paul A. Clarke <pc@us.ibm.com>
7205 * config/rs6000/emmintrin.h: Guard POWER8 intrinsics.
7206 * config/rs6000/pmmintrin.h: Same.
7207 * config/rs6000/smmintrin.h: Same.
7208 * config/rs6000/tmmintrin.h: Same.
7210 2021-10-19 Paul A. Clarke <pc@us.ibm.com>
7212 * config.gcc (extra_headers): Add nmmintrin.h.
7214 2021-10-19 Richard Biener <rguenther@suse.de>
7216 * tree-vectorizer.h (vect_supportable_dr_alignment): Add
7217 misalignment parameter.
7218 * tree-vect-data-refs.c (vect_get_peeling_costs_all_drs):
7219 Do not change DR_MISALIGNMENT in place, instead pass the
7220 adjusted misalignment to vect_supportable_dr_alignment.
7221 (vect_peeling_supportable): Likewise.
7222 (vect_peeling_hash_get_lowest_cost): Adjust.
7223 (vect_enhance_data_refs_alignment): Likewise.
7224 (vect_vfa_access_size): Likewise.
7225 (vect_supportable_dr_alignment): Add misalignment
7226 parameter and simplify.
7227 * tree-vect-stmts.c (get_negative_load_store_type): Adjust.
7228 (get_group_load_store_type): Likewise.
7229 (get_load_store_type): Likewise.
7231 2021-10-19 Clément Chigot <clement.chigot@atos.net>
7233 * config/rs6000/rs6000.c (rs6000_xcoff_file_end): Move
7234 __tls_get_addr reference to .text csect.
7236 2021-10-19 Martin Liska <mliska@suse.cz>
7239 * config/aarch64/aarch64.c (aarch64_process_one_target_attr):
7242 2021-10-19 Richard Biener <rguenther@suse.de>
7244 * tree-vectorizer.h (vect_get_store_cost): Adjust signature.
7245 (vect_get_load_cost): Likewise.
7246 * tree-vect-data-refs.c (vect_get_data_access_cost): Get
7247 alignment support scheme and misalignment as arguments
7249 (vect_get_peeling_costs_all_drs): Compute that info here
7250 and note that we shouldn't need to.
7251 * tree-vect-stmts.c (vect_model_store_cost): Get
7252 alignment support scheme and misalignment as arguments.
7253 (vect_get_store_cost): Likewise.
7254 (vect_model_load_cost): Likewise.
7255 (vect_get_load_cost): Likewise.
7256 (vectorizable_store): Pass down alignment support scheme
7257 and misalignment to costing.
7258 (vectorizable_load): Likewise.
7260 2021-10-19 Richard Biener <rguenther@suse.de>
7262 * tree-vect-stmts.c (get_negative_load_store_type): Add
7263 offset output parameter and initialize it.
7264 (get_group_load_store_type): Likewise.
7265 (get_load_store_type): Likewise.
7266 (vectorizable_store): Use offset as computed by
7267 get_load_store_type.
7268 (vectorizable_load): Likewise.
7270 2021-10-19 Richard Biener <rguenther@suse.de>
7272 PR tree-optimization/102827
7273 * tree-if-conv.c (predicate_statements): Add pe parameter
7274 and use that edge to insert invariant stmts on.
7275 (combine_blocks): Pass through pe.
7276 (tree_if_conversion): Compute the edge to insert invariant
7277 stmts on and pass it along.
7279 2021-10-19 Roger Sayle <roger@nextmovesoftware.com>
7282 * config/bfin/bfin.md (addsubv2hi3, subaddv2hi3, ssaddsubv2hi3,
7283 sssubaddv2hi3): Swap the order of operators in vec_concat.
7285 2021-10-19 Xionghu Luo <luoxhu@linux.ibm.com>
7287 * config/rs6000/altivec.md (*altivec_vmrghb_internal): Delete.
7288 (altivec_vmrghb_direct): New.
7289 (*altivec_vmrghh_internal): Delete.
7290 (altivec_vmrghh_direct): New.
7291 (*altivec_vmrghw_internal): Delete.
7292 (altivec_vmrghw_direct_<mode>): New.
7293 (altivec_vmrghw_direct): Delete.
7294 (*altivec_vmrglb_internal): Delete.
7295 (altivec_vmrglb_direct): New.
7296 (*altivec_vmrglh_internal): Delete.
7297 (altivec_vmrglh_direct): New.
7298 (*altivec_vmrglw_internal): Delete.
7299 (altivec_vmrglw_direct_<mode>): New.
7300 (altivec_vmrglw_direct): Delete.
7301 * config/rs6000/rs6000-p8swap.c (rtx_is_swappable_p): Adjust.
7302 * config/rs6000/rs6000.c (altivec_expand_vec_perm_const):
7304 * config/rs6000/vsx.md (vsx_xxmrghw_<mode>): Adjust.
7305 (vsx_xxmrglw_<mode>): Adjust.
7307 2021-10-19 Aldy Hernandez <aldyh@redhat.com>
7309 * passes.def: Change threading comment before pass_ccp pass.
7311 2021-10-19 Haochen Gui <guihaoc@gcc.gnu.org>
7313 * config/rs6000/rs6000-call.c (altivec_expand_lxvr_builtin):
7314 Modify the expansion for sign extension. All extensions are done
7315 within VSX registers.
7317 2021-10-19 Richard Biener <rguenther@suse.de>
7319 * tree-vect-stmts.c (get_group_load_store_type): Add
7320 misalignment output parameter and initialize it.
7321 (get_group_load_store_type): Likewise.
7322 (vectorizable_store): Remove now redundant queries.
7323 (vectorizable_load): Likewise.
7325 2021-10-19 Richard Biener <rguenther@suse.de>
7327 * tree-vectorizer.h (vect_supportable_dr_alignment): Remove
7328 check_aligned argument.
7329 * tree-vect-data-refs.c (vect_supportable_dr_alignment):
7331 (vect_peeling_hash_insert): Add supportable_if_not_aligned
7332 argument and do not call vect_supportable_dr_alignment here.
7333 (vect_peeling_supportable): Adjust.
7334 (vect_enhance_data_refs_alignment): Compute whether the
7335 access is supported with different alignment here and
7336 pass that down to vect_peeling_hash_insert.
7337 (vect_vfa_access_size): Adjust.
7338 * tree-vect-stmts.c (vect_get_store_cost): Likewise.
7339 (vect_get_load_cost): Likewise.
7340 (get_negative_load_store_type): Likewise.
7341 (get_group_load_store_type): Likewise.
7342 (get_load_store_type): Likewise.
7344 2021-10-19 Martin Liska <mliska@suse.cz>
7347 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p): Strip whitespaces.
7348 * system.h (strip_whilespaces): New function.
7350 2021-10-19 dianhong xu <dianhong.xu@intel.com>
7352 * config/i386/avx512fp16intrin.h:
7353 (_mm512_set1_pch): New intrinsic.
7354 * config/i386/avx512fp16vlintrin.h:
7355 (_mm256_set1_pch): New intrinsic.
7356 (_mm_set1_pch): Ditto.
7358 2021-10-18 Andrew MacLeod <amacleod@redhat.com>
7360 PR tree-optimization/102796
7361 * gimple-range.cc (gimple_ranger::range_on_edge): Process EH edges
7362 normally. Return get_tree_range for non gimple_range_ssa_p names.
7363 (gimple_ranger::range_of_stmt): Use get_tree_range for non
7364 gimple_range_ssa_p names.
7366 2021-10-18 Uroš Bizjak <ubizjak@gmail.com>
7369 * config/i386/i386.c (ix86_print_operand_address):
7370 Error out for non-address_operand asm operands.
7372 2021-10-18 Richard Biener <rguenther@suse.de>
7374 * tree-vect-data-refs.c (vect_peeling_hash_insert): Do
7375 not auto-convert dr_alignment_support to bool.
7376 (vect_peeling_supportable): Likewise.
7377 (vect_enhance_data_refs_alignment): Likewise.
7378 (vect_supportable_dr_alignment): Commonize read/write case.
7379 * tree-vect-stmts.c (vect_get_store_cost): Use
7380 dr_alignment_support, not int, for the vect_supportable_dr_alignment
7382 (vect_get_load_cost): Likewise.
7384 2021-10-18 Siddhesh Poyarekar <siddhesh@gotplt.org>
7386 * tree-object-size.c (object_sizes_execute): Consolidate LHS
7387 null check and do it early.
7389 2021-10-18 Richard Biener <rguenther@suse.de>
7391 * tree-vect-stmts.c (vectorizable_store): Use the
7392 computed alignment scheme instead of querying
7395 2021-10-18 Richard Biener <rguenther@suse.de>
7397 * tree-vect-stmts.c (vectorizable_store): Do not recompute
7398 alignment scheme already determined by get_load_store_type.
7400 2021-10-18 Aldy Hernandez <aldyh@redhat.com>
7402 * tree-ssa-threadbackward.c (class pass_thread_jumps_full):
7403 Clone corresponding pass.
7405 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
7407 * combine.c (recog_for_combine): For an unrecognized move/set of
7408 a constant, try force_const_mem to place it in the constant pool.
7409 * cse.c (constant_pool_entries_cost, constant_pool_entries_regcost):
7410 Delete global variables (that are no longer assigned a cost value).
7411 (cse_insn): Simplify logic for deciding whether to place a folded
7412 constant in the constant pool using force_const_mem.
7413 (cse_main): Remove zero initialization of constant_pool_entries_cost
7414 and constant_pool_entries_regcost.
7415 * config/i386/i386.c (ix86_rtx_costs): Make memory accesses
7416 fractionally more expensive, when optimizing for speed.
7418 2021-10-18 Martin Liska <mliska@suse.cz>
7420 PR gcov-profile/102746
7421 PR gcov-profile/102747
7422 * gcov.c (main): Return return_code.
7423 (output_gcov_file): Mark return_code when error happens.
7424 (generate_results): Likewise.
7425 (read_graph_file): Likewise.
7426 (read_count_file): Likewise.
7428 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
7430 * config/bfin/bfin.md (define_constants): Remove UNSPEC_ONES.
7431 (define_insn "ones"): Replace UNSPEC_ONES with a truncate of
7432 a popcount, allowing compile-time evaluation/simplification.
7433 (popcountsi2, popcounthi2): New expanders using a "ones" insn.
7435 2021-10-18 Richard Biener <rguenther@suse.de>
7437 PR tree-optimization/102788
7438 * tree-vect-patterns.c (vect_init_pattern_stmt): Allow
7440 (vect_pattern_recog_1): Likewise.
7441 (vect_recog_bool_pattern): Continue matching the pattern
7442 even if we do not have a vector type for a conversion
7445 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
7447 * simplify-rtx.c (simplify_const_unary_operation) [SS_NEG, SS_ABS]:
7448 Evalute SS_NEG and SS_ABS of a constant argument.
7450 2021-10-18 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
7453 * gimple-match-head.c (try_conditional_simplification): Add case for single operand.
7454 * internal-fn.def: Add entry for COND_NEG internal function.
7455 * internal-fn.c (FOR_EACH_CODE_MAPPING): Add entry for
7456 NEGATE_EXPR, COND_NEG mapping.
7457 * optabs.def: Add entry for cond_neg_optab.
7458 * match.pd (UNCOND_UNARY, COND_UNARY): New operator lists.
7459 (vec_cond COND (foo A) B) -> (IFN_COND_FOO COND A B): New pattern.
7460 (vec_cond COND B (foo A)) -> (IFN_COND_FOO ~COND A B): Likewise.
7462 2021-10-18 Aldy Hernandez <aldyh@redhat.com>
7464 * tree-ssa-strlen.c (class strlen_pass): Rename from
7466 (handle_builtin_stxncpy_strncat): Move to strlen_pass.
7467 (handle_assign): Same.
7468 (adjust_last_stmt): Same.
7469 (maybe_warn_overflow): Same.
7470 (handle_builtin_strlen): Same.
7471 (handle_builtin_strchr): Same.
7472 (handle_builtin_strcpy): Same.
7473 (handle_builtin_strncat): Same.
7474 (handle_builtin_stxncpy_strncat): Same.
7475 (handle_builtin_memcpy): Same.
7476 (handle_builtin_strcat): Same.
7477 (handle_alloc_call): Same.
7478 (handle_builtin_memset): Same.
7479 (handle_builtin_memcmp): Same.
7480 (get_len_or_size): Same.
7481 (strxcmp_eqz_result): Same.
7482 (handle_builtin_string_cmp): Same.
7483 (handle_pointer_plus): Same.
7484 (count_nonzero_bytes_addr): Same.
7485 (count_nonzero_bytes): Same.
7486 (handle_store): Same.
7487 (strlen_check_and_optimize_call): Same.
7488 (handle_integral_assign): Same.
7489 (check_and_optimize_stmt): Same.
7490 (printf_strlen_execute): Rename strlen_dom_walker to strlen_pass.
7492 2021-10-18 Richard Biener <rguenther@suse.de>
7494 PR tree-optimization/102798
7495 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
7496 Only copy points-to info to newly generated SSA names.
7498 2021-10-18 Martin Liska <mliska@suse.cz>
7500 * dbgcnt.c (dbg_cnt_process_opt): Remove unused but set variable.
7501 * gcov.c (get_cycles_count): Likewise.
7502 * lto-compress.c (lto_compression_zlib): Likewise.
7503 (lto_uncompression_zlib): Likewise.
7504 * targhooks.c (default_pch_valid_p): Likewise.
7506 2021-10-17 Aldy Hernandez <aldyh@redhat.com>
7508 * tree-pass.h (make_pass_thread_jumps_full): New.
7509 * tree-ssa-threadbackward.c (pass_thread_jumps::gate): Inline.
7510 (try_thread_blocks): Add resolve and speed arguments.
7511 (pass_thread_jumps::execute): Inline.
7512 (do_early_thread_jumps): New.
7513 (do_thread_jumps): New.
7514 (make_pass_thread_jumps): Move.
7515 (pass_early_thread_jumps::gate): Inline.
7516 (pass_early_thread_jumps::execute): Inline.
7517 (class pass_thread_jumps_full): New.
7519 2021-10-16 Piotr Kubaj <pkubaj@FreeBSD.org>
7521 * configure.ac: Treat powerpc64*-*-freebsd* the same as
7522 powerpc64-*-freebsd*.
7523 * configure: Regenerate.
7525 2021-10-16 H.J. Lu <hjl.tools@gmail.com>
7527 * value-query.cc (get_ssa_name_ptr_info_nonnull): Change
7528 set_ptr_nonull to set_ptr_nonnull in comments.
7530 2021-10-16 Jan Hubicka <hubicka@ucw.cz>
7532 PR tree-optimization/102720
7533 * tree-ssa-structalias.c (compute_points_to_sets): Fix producing
7534 of call used and clobbered sets.
7536 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
7538 * config/darwin.h (LINK_COMMAND_SPEC_A): Update 'r' handling to
7539 skip gomp and itm when r or nodefaultlibs is given.
7540 (DSYMUTIL_SPEC): Do not call dsymutil for '-r' link lines.
7541 Update ordering of exclusions, remove duplicate 'v' addition
7542 (collect2 will add this from the main command line).
7544 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
7546 * config/darwin-driver.c (darwin_driver_init): Revise comments, handle
7547 filelist and framework options in specs instead of code.
7548 * config/darwin.h (SUBTARGET_DRIVER_SELF_SPECS): Update to handle link
7549 specs that are really driver ones.
7550 (DARWIN_CC1_SPEC): Likewise.
7551 (CPP_SPEC): Likewise.
7552 (SYSROOT_SPEC): Append space.
7553 (LINK_SYSROOT_SPEC): Remove most driver link specs.
7554 (STANDARD_STARTFILE_PREFIX_2): Update link-related specs.
7555 (STARTFILE_SPEC): Likewise.
7556 (ASM_MMACOSX_VERSION_MIN_SPEC): Fix line wrap.
7557 (ASM_SPEC): Update driver-related specs.
7558 (ASM_FINAL_SPEC): Likewise.
7559 * config/darwin.opt: Remove now unused option aliases.
7560 * config/i386/darwin.h (EXTRA_ASM_OPTS): Ensure space after opt.
7561 (ASM_SPEC): Update driver-related specs.
7563 2021-10-15 Roger Sayle <roger@nextmovesoftware.com>
7565 * config/i386/i386.c (ix86_hardreg_mov_ok): For vector modes,
7566 allow standard_sse_constant_p immediate constants.
7568 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
7570 * config.gcc: Add tm-dwarf2.h to tm_d-file.
7572 2021-10-15 Andrew MacLeod <amacleod@redhat.com>
7574 * gimple-range-fold.h (gimple_range_ssa_p): Don't process names
7575 that occur in abnormal phis.
7576 * gimple-range.cc (gimple_ranger::range_on_edge): Return false for
7577 abnormal and EH edges.
7578 * gimple-ssa-evrp.c (rvrp_folder::value_of_expr): Ditto.
7579 (rvrp_folder::value_on_edge): Ditto.
7580 (rvrp_folder::value_of_stmt): Ditto.
7581 (hybrid_folder::value_of_expr): Ditto for ranger queries.
7582 (hybrid_folder::value_on_edge): Ditto.
7583 (hybrid_folder::value_of_stmt): Ditto.
7584 * value-query.cc (gimple_range_global): Always return a range if
7585 the type is supported.
7587 2021-10-15 John David Anglin <danglin@gcc.gnu.org>
7589 * config/pa/pa.md: Consistently use "rG" constraint for copy
7590 instruction in move patterns.
7592 2021-10-15 Andrew Stubbs <ams@codesourcery.com>
7594 * config/gcn/gcn-hsa.h (S_FIJI): Set unconditionally.
7597 * config/gcn/gcn.c: Hard code SRAM ECC settings for old architectures.
7598 * config/gcn/mkoffload.c (ELFABIVERSION_AMDGPU_HSA): Rename to ...
7599 (ELFABIVERSION_AMDGPU_HSA_V3): ... this.
7600 (ELFABIVERSION_AMDGPU_HSA_V4): New.
7601 (SET_SRAM_ECC_UNSUPPORTED): New.
7602 (copy_early_debug_info): Create elf flags to match the other objects.
7603 (main): Just let the attribute flags pass through.
7605 2021-10-15 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
7607 * tree-loop-distribution.c (reduction_var_overflows_first):
7608 Pass the type of reduction_var as first argument as it is also
7609 done for the load type.
7610 (loop_distribution::transform_reduction_loop): Add missing
7611 TREE_TYPE while determining precission of reduction_var.
7613 2021-10-15 Richard Biener <rguenther@suse.de>
7615 * defaults.h (PREFERRED_DEBUGGING_TYPE): Choose DWARF2_DEBUG
7617 * toplev.c (process_options): Warn when STABS debugging is
7618 enabled but not the preferred format.
7619 * config/pa/som.h (PREFERRED_DEBUGGING_TYPE): Define to
7621 * config/pdp11/pdp11.h (PREFERRED_DEBUGGING_TYPE): Likewise.
7623 2021-10-15 Richard Biener <rguenther@suse.de>
7626 * tree-inline.c (copy_bb): Avoid underflowing nargs.
7628 2021-10-15 Hongyu Wang <hongyu.wang@intel.com>
7630 * config/i386/i386-expand.c (ix86_expand_vec_perm): Convert
7631 HFmode input operand to HImode.
7632 (ix86_vectorize_vec_perm_const): Likewise.
7633 * config/i386/sse.md (*avx512bw_permvar_truncv16siv16hi_1_hf):
7635 (*avx512f_permvar_truncv8siv8hi_1_hf):
7638 2021-10-15 Richard Biener <rguenther@suse.de>
7640 PR middle-end/102682
7641 * expmed.c (store_bit_field_1): Ensure a LHS subreg would
7642 not create a paradoxical subreg.
7644 2021-10-15 Hongyu Wang <hongyu.wang@intel.com>
7646 * config/i386/i386-expand.c (ix86_expand_vector_init):
7647 For half_vector concat for HFmode, handle them like HImode.
7649 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7651 * tree-ssa-threadbackward.c (class back_threader): Add m_resolve.
7652 (back_threader::back_threader): Same.
7653 (back_threader::resolve_phi): Try to solve without looking back if
7655 (back_threader::find_paths_to_names): Same.
7656 (try_thread_blocks): Pass resolve argument to back threader.
7657 (pass_early_thread_jumps::execute): Same.
7659 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7661 * doc/invoke.texi: Remove max-fsm-thread-length,
7662 max-fsm-thread-paths, and fsm-maximum-phi-arguments.
7664 * tree-ssa-threadbackward.c (back_threader::back_threader): Remove
7666 (back_threader_registry::back_threader_registry): Same.
7667 (back_threader_profitability::profitable_path_p): Remove
7668 param_max_fsm_thread-length.
7669 (back_threader_registry::register_path): Remove
7670 m_max_allowable_paths.
7672 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7674 * tree-ssa-threadbackward.c (class back_threader): Make m_imports
7676 (back_threader::~back_threader): Do not release m_path.
7678 2021-10-14 Andrew MacLeod <amacleod@redhat.com>
7680 PR tree-optimization/102738
7681 * vr-values.c (simplify_using_ranges::simplify): Handle RSHIFT_EXPR.
7683 2021-10-14 Kwok Cheung Yeung <kcy@codesourcery.com>
7685 * omp-general.c (omp_check_context_selector): Move from c-omp.c.
7686 (omp_mark_declare_variant): Move from c-omp.c.
7687 (omp_context_name_list_prop): Update for Fortran strings.
7688 * omp-general.h (omp_check_context_selector): New prototype.
7689 (omp_mark_declare_variant): New prototype.
7691 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
7693 * config/rs6000/rs6000.c (rs6000_density_test): Move early
7694 exit test further up the function.
7696 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
7698 * config/arm/arm.c (arm_add_stmt_cost): Delete.
7699 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
7701 2021-10-14 Martin Jambor <mjambor@suse.cz>
7703 * doc/invoke.texi (Optimize Options): Add entry for
7704 ipa-cp-recursive-freq-factor.
7706 2021-10-14 Tamar Christina <tamar.christina@arm.com>
7708 * match.pd: New rule.
7710 2021-10-14 Jan Hubicka <hubicka@ucw.cz>
7713 * ipa-modref-tree.h (modref_access_node::update2):
7714 Also check that parm_offset is unchanged.
7715 (modref_ref_node::insert_access): Fix updating of
7718 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7720 * tree-ssa-threadbackward.c (back_threader::resolve_phi): Add
7723 2021-10-14 Richard Biener <rguenther@suse.de>
7725 PR tree-optimization/102659
7726 * tree-if-conv.c (if_convertible_gimple_assign_stmt_p): Also
7727 rewrite pointer typed undefined overflow operations.
7728 (predicate_statements): Likewise. Make sure to emit invariant
7729 conversions in the preheader.
7730 * tree-vectorizer.c (vect_loop_vectorized_call): Look through
7731 non-empty preheaders.
7732 * tree-data-ref.c (dr_analyze_indices): Strip useless
7733 conversions to the MEM_REF base type.
7735 2021-10-14 Martin Liska <mliska@suse.cz>
7737 * common.opt: Stop using AUTODETECT_VALUE
7738 and use EnabledBy where possible.
7739 * opts.c: Enable OPT_fvar_tracking with optimize >= 1.
7740 * toplev.c (AUTODETECT_VALUE): Remove macro.
7741 (process_options): Simplify by using EnabledBy and
7742 OPT_fvar_tracking. Use OPTION_SET_P macro instead of
7745 2021-10-14 Jonathan Wright <jonathan.wright@arm.com>
7747 * config/aarch64/arm_neon.h (vld1_s8_x3): Use signed type for
7749 (vld1_s32_x3): Likewise.
7751 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7753 PR tree-optimization/102736
7754 PR tree-optimization/102736
7755 * gimple-range-path.cc (path_range_query::range_on_path_entry):
7756 Assert that the requested range is defined outside the path.
7757 (path_range_query::ssa_range_in_phi): Do not call
7758 range_on_path_entry for SSA names that are defined within the
7761 2021-10-14 Iain Sandoe <iain@sandoe.co.uk>
7763 * config/darwin-driver.c (darwin_find_version_from_kernel):
7764 Quote internal identifiers and avoid contractions in
7766 (darwin_default_min_version): Likewise.
7767 (darwin_driver_init): Likewise.
7769 2021-10-14 Martin Jambor <mjambor@suse.cz>
7771 * params.opt (ipa-cp-recursive-freq-factor): New.
7772 * ipa-cp.c (ipcp_value): Switch to inline initialization. New members
7773 scc_no, self_recursion_generated_level, same_scc and
7774 self_recursion_generated_p.
7775 (ipcp_lattice::add_value): Replaced parameter unlimited with
7776 same_lat_gen_level, usit it determine limit of values and store it to
7778 (ipcp_lattice<valtype>::print): Dump the new fileds.
7779 (allocate_and_init_ipcp_value): Take same_lat_gen_level as a new
7780 parameter and store it to the new value.
7781 (self_recursively_generated_p): Removed.
7782 (propagate_vals_across_arith_jfunc): Use self_recursion_generated_p
7783 instead of self_recursively_generated_p, store self generation level
7785 (value_topo_info<valtype>::add_val): Set scc_no.
7786 (value_topo_info<valtype>::propagate_effects): Multiply frequencies of
7787 recursively feeding values and self generated values by appropriate
7790 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
7792 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Remove
7793 redundant test for flag_vect_cost_model.
7795 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7797 * bitmap.c (debug): New overloaded function for auto_bitmaps.
7798 * bitmap.h (debug): Same.
7800 2021-10-14 Hongyu Wang <hongyu.wang@intel.com>
7802 * config/i386/avx512fp16intrin.h (_mm512_mask_fcmadd_pch):
7803 Adjust builtin call.
7804 (_mm512_mask3_fcmadd_pch): Likewise.
7805 (_mm512_mask_fmadd_pch): Likewise
7806 (_mm512_mask3_fmadd_pch): Likewise
7807 (_mm512_mask_fcmadd_round_pch): Likewise
7808 (_mm512_mask3_fcmadd_round_pch): Likewise
7809 (_mm512_mask_fmadd_round_pch): Likewise
7810 (_mm512_mask3_fmadd_round_pch): Likewise
7811 (_mm_mask_fcmadd_sch): Likewise
7812 (_mm_mask3_fcmadd_sch): Likewise
7813 (_mm_mask_fmadd_sch): Likewise
7814 (_mm_mask3_fmadd_sch): Likewise
7815 (_mm_mask_fcmadd_round_sch): Likewise
7816 (_mm_mask3_fcmadd_round_sch): Likewise
7817 (_mm_mask_fmadd_round_sch): Likewise
7818 (_mm_mask3_fmadd_round_sch): Likewise
7819 (_mm_fcmadd_round_sch): Likewise
7820 * config/i386/avx512fp16vlintrin.h (_mm_mask_fmadd_pch):
7821 Adjust builtin call.
7822 (_mm_mask3_fmadd_pch): Likewise
7823 (_mm256_mask_fmadd_pch): Likewise
7824 (_mm256_mask3_fmadd_pch): Likewise
7825 (_mm_mask_fcmadd_pch): Likewise
7826 (_mm_mask3_fcmadd_pch): Likewise
7827 (_mm256_mask_fcmadd_pch): Likewise
7828 (_mm256_mask3_fcmadd_pch): Likewise
7829 * config/i386/i386-builtin.def: Add mask3 builtin for complex
7830 fma, and adjust mask_builtin to corresponding expander.
7831 * config/i386/i386-expand.c (ix86_expand_round_builtin):
7832 Skip eraseing embedded rounding for expanders that emits
7834 * config/i386/sse.md (complexmove): New mode_attr.
7835 (<avx512>_fmaddc_<mode>_mask1<round_expand_name>): New expander.
7836 (<avx512>_fcmaddc_<mode>_mask1<round_expand_name>): Likewise.
7837 (avx512fp16_fmaddcsh_v8hf_mask1<round_expand_name>): Likewise.
7838 (avx512fp16_fcmaddcsh_v8hf_mask1<round_expand_name>): Likewise.
7839 (avx512fp16_fcmaddcsh_v8hf_mask3<round_expand_name>): Likewise.
7840 (avx512fp16_fmaddcsh_v8hf_mask3<round_expand_name>): Likewise.
7841 * config/i386/subst.md (round_embedded_complex): New subst.
7843 2021-10-14 John David Anglin <danglin@gcc.gnu.org>
7845 * config/pa/pa.md (cbranchsf4): Disable if TARGET_SOFT_FLOAT.
7846 (cbranchdf4): Likewise.
7847 Add missing move patterns for TARGET_SOFT_FLOAT.
7849 2021-10-13 Roger Sayle <roger@nextmovesoftware.com>
7851 * config/i386/i386-expand.c (ix86_expand_vector_move): Use a
7852 pseudo intermediate when moving a SUBREG into a hard register,
7853 by checking ix86_hardreg_mov_ok.
7854 (ix86_expand_vector_extract): Store zero-extended SImode
7855 intermediate in a pseudo, then set target using a SUBREG_PROMOTED
7857 * config/i386/sse.md (mov<VMOVE>_internal): Prevent CSE creating
7858 complex (SUBREG) sets of (vector) hard registers before reload, by
7859 checking ix86_hardreg_mov_ok.
7861 2021-10-13 Indu Bhagat <indu.bhagat@oracle.com>
7863 * ctfc.h (enum ctf_dtu_d_union_enum): Remove redundant comma.
7865 2021-10-13 Indu Bhagat <indu.bhagat@oracle.com>
7867 * dwarf2ctf.c (gen_ctf_array_type): Fix typo in comment.
7869 2021-10-13 Martin Sebor <msebor@redhat.com>
7871 PR middle-end/102630
7872 * pointer-query.cc (compute_objsize_r): Handle named address spaces.
7874 2021-10-13 Iain Sandoe <iain@sandoe.co.uk>
7876 * collect2.c (is_lto_object_file): Release simple-object
7877 resources, close files.
7879 2021-10-13 Andre Vieira <andre.simoesdiasvieira@arm.com>
7881 * config/arm/arm.c (thumb2_legitimate_address_p): Use VALID_MVE_MODE
7882 when checking mve addressing modes.
7883 (mve_vector_mem_operand): Fix the way we handle pre, post and offset
7885 (arm_print_operand): Fix printing of POST_ and PRE_MODIFY.
7886 * config/arm/mve.md: Use mve_memory_operand predicate everywhere where
7887 there is a single Ux constraint.
7889 2021-10-13 John David Anglin <danglin@gcc.gnu.org>
7891 * config/pa/pa.md (muldi3): Add support for inlining 64-bit
7892 multiplication on 32-bit PA 1.1 and 2.0 targets.
7894 2021-10-13 Martin Liska <mliska@suse.cz>
7896 PR gcov-profile/90364
7897 * coverage.c (build_info): Emit checksum to the global variable.
7898 (build_info_type): Add new field for checksum.
7899 (coverage_obj_finish): Pass object_checksum.
7900 (coverage_init): Use 0 as checksum for .gcno files.
7901 * gcov-dump.c (dump_gcov_file): Dump also new checksum field.
7902 * gcov.c (read_graph_file): Read also checksum.
7903 * doc/invoke.texi: Document the behaviour change.
7905 2021-10-13 Richard Biener <rguenther@suse.de>
7907 * gimple-iterator.h (gsi_iterator_update): Add GSI_LAST_NEW_STMT,
7908 start at integer value 2.
7909 * gimple-iterator.c (gsi_insert_seq_nodes_before): Update
7910 the iterator for GSI_LAST_NEW_STMT.
7911 (gsi_insert_seq_nodes_after): Likewise.
7912 * tree-if-conv.c (predicate_statements): Use GSI_LAST_NEW_STMT.
7913 * tree-ssa.c (execute_update_addresses_taken): Correct bogus
7914 arguments to gsi_replace.
7916 2021-10-13 Martin Liska <mliska@suse.cz>
7919 * common.opt: Use EnabledBy instead of detection in
7920 finish_options and process_options.
7921 * opts.c (finish_options): Remove handling of
7922 x_flag_unroll_all_loops.
7923 * toplev.c (process_options): Likewise for flag_web and
7924 flag_rename_registers.
7926 2021-10-13 Richard Biener <rguenther@suse.de>
7928 PR tree-optimization/102659
7929 * tree-if-conv.c (need_to_rewrite_undefined): New flag.
7930 (if_convertible_gimple_assign_stmt_p): Mark the loop for
7931 rewrite when stmts with undefined behavior on integer
7933 (combine_blocks): Predicate also when we need to rewrite stmts.
7934 (predicate_statements): Rewrite affected stmts to something
7935 with well-defined behavior on overflow.
7936 (tree_if_conversion): Initialize need_to_rewrite_undefined.
7938 2021-10-13 Richard Biener <rguenther@suse.de>
7941 * ipa-sra.c (ptr_parm_has_nonarg_uses): Fix volatileness
7944 2021-10-13 Jose E. Marchesi <jose.marchesi@oracle.com>
7946 * dwarf2ctf.c: Fix typo in comment.
7948 2021-10-12 Bill Schmidt <wschmidt@linux.ibm.com>
7951 * config/rs6000/altivec.h (vec_cpsgn): Swap operand order.
7952 * config/rs6000/rs6000-overload.def (VEC_COPYSIGN): Use SKIP to
7953 avoid generating an automatic #define of vec_cpsgn. Use the
7954 correct built-in for V4SFmode that doesn't depend on VSX.
7956 2021-10-12 Uroš Bizjak <ubizjak@gmail.com>
7960 * config/i386/i386.md (*add<mode>_1_slp): Rewrite as
7961 define_insn_and_split pattern. Add alternative 1 and split it
7962 post reload to insert operand 1 into the low part of operand 0.
7963 (*sub<mode>_1_slp): Ditto.
7964 (*and<mode>_1_slp): Ditto.
7965 (*<any_or:code><mode>_1_slp): Ditto.
7966 (*ashl<mode>3_1_slp): Ditto.
7967 (*<any_shiftrt:insn><mode>3_1_slp): Ditto.
7968 (*<any_rotate:insn><mode>3_1_slp): Ditto.
7969 (*neg<mode>_1_slp): New insn_and_split pattern.
7970 (*one_cmpl<mode>_1_slp): Ditto.
7972 2021-10-12 David Edelsohn <dje.gcc@gmail.com>
7974 * doc/install.texi: Update MinGW and mingw-64 Binaries
7977 2021-10-12 Daniel Le Duc Khoi Nguyen <greenrecyclebin@gmail.com>
7979 * doc/extend.texi (Common Variable Attributes): Fix typos in
7980 alloc_size documentation.
7982 2021-10-12 Richard Biener <rguenther@suse.de>
7984 PR tree-optimization/102696
7985 * tree-vect-slp.c (vect_build_slp_tree_2): Properly mark
7986 the tree fatally failed when we reject a BIT_FIELD_REF.
7988 2021-10-12 Richard Biener <rguenther@suse.de>
7990 PR tree-optimization/102572
7991 * tree-vect-stmts.c (vect_build_gather_load_calls): When
7992 gathering the vectorized defs for the mask pass in the
7993 desired mask vector type so invariants will be handled
7996 2021-10-12 Tamar Christina <tamar.christina@arm.com>
7998 * config/aarch64/aarch64-sve.md (*fcm<cmp_op><mode>_bic_combine,
7999 *fcm<cmp_op><mode>_nor_combine, *fcmuo<mode>_bic_combine,
8000 *fcmuo<mode>_nor_combine): New.
8002 2021-10-12 Eric Botcazou <ebotcazou@adacore.com>
8005 * config/sparc/sparc-modes.def (OI): New integer mode.
8007 2021-10-12 Jakub Jelinek <jakub@redhat.com>
8009 * gimple-fold.h (clear_padding_type_may_have_padding_p): Declare.
8010 * gimple-fold.c (clear_padding_type_may_have_padding_p): No longer
8013 2021-10-12 Jakub Jelinek <jakub@redhat.com>
8015 * tree-vectorizer.h (loop_cost_model): New function.
8016 (unlimited_cost_model): Use it.
8017 * tree-vect-loop.c (vect_analyze_loop_costing): Use loop_cost_model
8018 call instead of flag_vect_cost_model.
8019 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment): Likewise.
8020 (vect_prune_runtime_alias_test_list): Likewise. Also use it instead
8021 of flag_simd_cost_model.
8023 2021-10-12 liuhongt <hongtao.liu@intel.com>
8026 * config/i386/i386-expand.c (emit_reduc_half): Handle
8028 * config/i386/mmx.md (reduc_<code>_scal_v4qi): New expander.
8029 (reduc_plus_scal_v4qi): Ditto.
8031 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
8033 * config/rs6000/smmintrin.h (_mm_cmpeq_epi64, _mm_cmpgt_epi64,
8034 _mm_mullo_epi32, _mm_mul_epi32, _mm_packus_epi32): New.
8035 * config/rs6000/nmmintrin.h: Copy from i386, tweak to suit.
8037 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
8039 * config/rs6000/smmintrin.h (_mm_cvtepi8_epi16, _mm_cvtepi8_epi32,
8040 _mm_cvtepi8_epi64, _mm_cvtepi16_epi32, _mm_cvtepi16_epi64,
8041 _mm_cvtepi32_epi64, _mm_cvtepu8_epi16, _mm_cvtepu8_epi32,
8042 _mm_cvtepu8_epi64, _mm_cvtepu16_epi32, _mm_cvtepu16_epi64,
8043 _mm_cvtepu32_epi64): New.
8045 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
8047 * config/rs6000/smmintrin.h (_mm_test_all_zeros,
8048 _mm_test_all_ones, _mm_test_mix_ones_zeros): Rewrite as macro.
8050 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
8052 * config/rs6000/smmintrin.h (_mm_min_epi8, _mm_min_epu16,
8053 _mm_min_epi32, _mm_min_epu32, _mm_max_epi8, _mm_max_epu16,
8054 _mm_max_epi32, _mm_max_epu32): New.
8056 2021-10-11 Jan Hubicka <hubicka@ucw.cz>
8058 * ipa-modref-tree.h (struct modref_access_node): Revert
8060 (struct modref_ref_node): Likewise.
8062 2021-10-11 Jan Hubicka <hubicka@ucw.cz>
8064 * ipa-modref-tree.h (modref_tree::global_access_p): New member
8067 (implicint_const_eaf_flags,implicit_pure_eaf_flags,
8068 ignore_stores_eaf_flags): Move to ipa-modref.h
8069 (remove_useless_eaf_flags): Remove early exit on NOCLOBBER.
8070 (modref_summary::global_memory_read_p): New member function.
8071 (modref_summary::global_memory_written_p): New member function.
8072 * ipa-modref.h (modref_summary::global_memory_read_p,
8073 modref_summary::global_memory_written_p): Declare.
8074 (implicint_const_eaf_flags,implicit_pure_eaf_flags,
8075 ignore_stores_eaf_flags): move here.
8076 * tree-ssa-structalias.c: Include ipa-modref-tree.h, ipa-modref.h
8078 (handle_rhs_call): Rewrite.
8079 (handle_call_arg): New function.
8080 (determine_global_memory_access): New function.
8081 (handle_const_call): Remove
8082 (handle_pure_call): Remove
8083 (find_func_aliases_for_call): Update use of handle_rhs_call.
8084 (compute_points_to_sets): Handle global memory acccesses
8087 2021-10-11 Diane Meirowitz <diane.meirowitz@oracle.com>
8089 * doc/invoke.texi: Add link to UndefinedBehaviorSanitizer
8090 documentation, mention UBSAN_OPTIONS, similar to what is done
8091 for AddressSanitizer.
8093 2021-10-11 Richard Biener <rguenther@suse.de>
8095 PR middle-end/102683
8096 * internal-fn.c (expand_DEFERRED_INIT): Check for mode
8097 availability before building an integer type for storage
8100 2021-10-11 Richard Biener <rguenther@suse.de>
8102 PR middle-end/101480
8103 * gimple.c (gimple_call_fnspec): Do not mark operator new/delete
8106 2021-10-11 Martin Liska <mliska@suse.cz>
8108 * common.opt: Remove Init(2) for some options.
8109 * toplev.c (process_options): Do not use AUTODETECT_VALUE, but
8110 use rather OPTION_SET_P.
8112 2021-10-11 Martin Liska <mliska@suse.cz>
8114 * common.opt: Remove usage of IRA_REGION_AUTODETECT.
8115 * flag-types.h (enum ira_region): Likewise.
8116 * toplev.c (process_options): Use OPTION_SET_P instead of
8117 IRA_REGION_AUTODETECT.
8119 2021-10-11 Jakub Jelinek <jakub@redhat.com>
8121 * omp-low.c (omp_runtime_api_call): Handle omp_get_max_teams,
8122 omp_[sg]et_teams_thread_limit and omp_set_num_teams.
8124 2021-10-11 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
8126 * config/s390/s390-protos.h (s390_rawmemchr): Add prototype.
8127 * config/s390/s390.c (s390_rawmemchr): New function.
8128 * config/s390/s390.md (rawmemchr<SINT:mode>): New expander.
8129 * config/s390/vector.md (@vec_vfees<mode>): Basically a copy of
8130 the pattern vfees<mode> from vx-builtins.md.
8131 * config/s390/vx-builtins.md (*vfees<mode>): Remove.
8133 2021-10-11 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
8135 * builtins.c (get_memory_rtx): Change to external linkage.
8136 * builtins.h (get_memory_rtx): Add function prototype.
8137 * doc/md.texi (rawmemchr<mode>): Document.
8138 * internal-fn.c (expand_RAWMEMCHR): Define.
8139 * internal-fn.def (RAWMEMCHR): Add.
8140 * optabs.def (rawmemchr_optab): Add.
8141 * tree-loop-distribution.c (find_single_drs): Change return code
8142 behaviour by also returning true if no single store was found
8144 (loop_distribution::classify_partition): Respect the new return
8145 code behaviour of function find_single_drs.
8146 (loop_distribution::execute): Call new function
8147 transform_reduction_loop in order to replace rawmemchr or strlen
8148 like loops by calls into builtins.
8149 (generate_reduction_builtin_1): New function.
8150 (generate_rawmemchr_builtin): New function.
8151 (generate_strlen_builtin_1): New function.
8152 (generate_strlen_builtin): New function.
8153 (generate_strlen_builtin_using_rawmemchr): New function.
8154 (reduction_var_overflows_first): New function.
8155 (determine_reduction_stmt_1): New function.
8156 (determine_reduction_stmt): New function.
8157 (loop_distribution::transform_reduction_loop): New function.
8159 2021-10-11 Martin Liska <mliska@suse.cz>
8161 * tree.c (cl_option_hasher::hash): Use cl_optimization_hash
8162 and remove legacy hashing code.
8164 2021-10-11 Kito Cheng <kito.cheng@sifive.com>
8167 * builtins.c (maybe_emit_call_builtin___clear_cache): Allow
8168 CONST_INT for BEGIN and END, and use gcc_assert rather than
8171 2021-10-10 Jakub Jelinek <jakub@redhat.com>
8174 * var-tracking.c (add_stores): For cselib_sp_derived_value_p values
8175 use MO_VAL_SET if loc is not sp.
8177 2021-10-10 Andrew Pinski <apinski@marvell.com>
8179 PR tree-optimization/102622
8180 * match.pd: Swap the order of a?pow2cst:0 and a?-1:0 transformations.
8181 Swap the order of a?0:pow2cst and a?0:-1 transformations.
8183 2021-10-09 liuhongt <hongtao.liu@intel.com>
8186 * config/i386/i386-expand.c (ix86_valid_mask_cmp_mode): Handle
8188 (ix86_use_mask_cmp_p): Ditto.
8189 (ix86_expand_sse_movcc): Ditto.
8190 * config/i386/i386.md (setcc_hf_mask): New define_insn.
8191 (movhf_mask): Ditto.
8192 (UNSPEC_MOVCC_MASK): New unspec.
8193 * config/i386/sse.md (UNSPEC_PCMP): Move to i386.md.
8195 2021-10-08 Vladimir N. Makarov <vmakarov@redhat.com>
8197 PR rtl-optimization/102627
8198 * lra-constraints.c (split_reg): Use at least natural mode of hard reg.
8200 2021-10-08 Aldy Hernandez <aldyh@redhat.com>
8202 * gimple-range-cache.cc (non_null_ref::non_null_deref_p): Grow
8205 2021-10-08 Aldy Hernandez <aldyh@redhat.com>
8207 * value-range.cc (irange::debug): New.
8208 * value-range.h (irange::debug): New.
8210 2021-10-08 Richard Sandiford <richard.sandiford@arm.com>
8212 PR tree-optimization/102385
8213 * predict.h (change_edge_frequency): Declare.
8214 * predict.c (change_edge_frequency): New function.
8215 * tree-ssa-loop-manip.h (tree_transform_and_unroll_loop): Remove
8217 (tree_unroll_loop): Likewise.
8218 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Update accordingly.
8219 * tree-predcom.c (pcom_worker::tree_predictive_commoning_loop):
8221 * tree-ssa-loop-prefetch.c (loop_prefetch_arrays): Likewise.
8222 * tree-ssa-loop-manip.c (tree_unroll_loop): Likewise.
8223 (tree_transform_and_unroll_loop): Likewise. Use single_dom_exit
8224 to retrieve the exit edges. Make all the old profile update code
8225 conditional on !single_loop_p -- the case it was written for --
8226 and use a different approach for the single-loop case.
8228 2021-10-08 Martin Liska <mliska@suse.cz>
8230 * config/alpha/alpha.c (alpha_option_override): Use new macro
8232 * config/arc/arc.c (arc_override_options): Likewise.
8233 * config/arm/arm.c (arm_option_override): Likewise.
8234 * config/bfin/bfin.c (bfin_load_pic_reg): Likewise.
8235 * config/c6x/c6x.c (c6x_option_override): Likewise.
8236 * config/csky/csky.c: Likewise.
8237 * config/darwin.c (darwin_override_options): Likewise.
8238 * config/frv/frv.c (frv_option_override): Likewise.
8239 * config/i386/djgpp.h: Likewise.
8240 * config/i386/i386.c (ix86_stack_protect_guard): Likewise.
8241 (ix86_max_noce_ifcvt_seq_cost): Likewise.
8242 * config/ia64/ia64.c (ia64_option_override): Likewise.
8243 (ia64_override_options_after_change): Likewise.
8244 * config/m32c/m32c.c (m32c_option_override): Likewise.
8245 * config/m32r/m32r.c (m32r_init): Likewise.
8246 * config/m68k/m68k.c (m68k_option_override): Likewise.
8247 * config/microblaze/microblaze.c (microblaze_option_override): Likewise.
8248 * config/mips/mips.c (mips_option_override): Likewise.
8249 * config/nios2/nios2.c (nios2_option_override): Likewise.
8250 * config/nvptx/nvptx.c (nvptx_option_override): Likewise.
8251 * config/pa/pa.c (pa_option_override): Likewise.
8252 * config/riscv/riscv.c (riscv_option_override): Likewise.
8253 * config/rs6000/aix71.h: Likewise.
8254 * config/rs6000/aix72.h: Likewise.
8255 * config/rs6000/aix73.h: Likewise.
8256 * config/rs6000/rs6000.c (darwin_rs6000_override_options): Likewise.
8257 (rs6000_override_options_after_change): Likewise.
8258 (rs6000_linux64_override_options): Likewise.
8259 (glibc_supports_ieee_128bit): Likewise.
8260 (rs6000_option_override_internal): Likewise.
8261 (rs6000_file_start): Likewise.
8262 (rs6000_darwin_file_start): Likewise.
8263 * config/rs6000/rtems.h: Likewise.
8264 * config/rs6000/sysv4.h: Likewise.
8265 * config/rs6000/vxworks.h (SUB3TARGET_OVERRIDE_OPTIONS): Likewise.
8266 * config/s390/s390.c (s390_option_override): Likewise.
8267 * config/sh/linux.h: Likewise.
8268 * config/sh/netbsd-elf.h (while): Likewise.
8269 * config/sh/sh.c (sh_option_override): Likewise.
8270 * config/sol2.c (solaris_override_options): Likewise.
8271 * config/sparc/sparc.c (sparc_option_override): Likewise.
8272 * config/tilegx/tilegx.c (tilegx_option_override): Likewise.
8273 * config/visium/visium.c (visium_option_override): Likewise.
8274 * config/vxworks.c (vxworks_override_options): Likewise.
8275 * lto-opts.c (lto_write_options): Likewise.
8276 * omp-expand.c (expand_omp_simd): Likewise.
8277 * omp-general.c (omp_max_vf): Likewise.
8278 * omp-offload.c (oacc_xform_loop): Likewise.
8279 * opts.h (OPTION_SET_P): Likewise.
8280 * targhooks.c (default_max_noce_ifcvt_seq_cost): Likewise.
8281 * toplev.c (process_options): Likewise.
8282 * tree-predcom.c: Likewise.
8283 * tree-sra.c (analyze_all_variable_accesses): Likewise.
8285 2021-10-08 liuhongt <hongtao.liu@intel.com>
8288 * config/i386/i386.c (ix86_optab_supported_p):
8289 Return true for HFmode.
8290 * match.pd: Simplify (_Float16) ceil ((double) x) to
8291 __builtin_ceilf16 (a) when a is _Float16 type and
8292 direct_internal_fn_supported_p.
8294 2021-10-08 liuhongt <hongtao.liu@intel.com>
8297 * config/i386/i386-expand.c (emit_reduc_half): Hanlde V4HImode.
8298 * config/i386/mmx.md (reduc_plus_scal_v4hi): New.
8299 (reduc_<code>_scal_v4hi): New.
8301 2021-10-08 liuhongt <hongtao.liu@intel.com>
8303 * common.opt (ftree-vectorize): Add Var(flag_tree_vectorize).
8304 * doc/invoke.texi (Options That Control Optimization): Update
8306 * opts.c (default_options_table): Enable auto-vectorization at
8307 O2 with very-cheap cost model.
8308 (finish_options): Use cheap cost model for
8309 explicit -ftree{,-loop}-vectorize.
8311 2021-10-07 Indu Bhagat <indu.bhagat@oracle.com>
8313 * ctfc.c (ctfc_delete_container): Free hash table contents.
8315 2021-10-07 Indu Bhagat <indu.bhagat@oracle.com>
8317 * toplev.c (process_options): Do not warn for GNU GIMPLE.
8319 2021-10-07 Siddhesh Poyarekar <siddhesh@gotplt.org>
8321 * tree-object-size.c (addr_object_size,
8322 compute_builtin_object_size): Drop PDECL and POFF arguments.
8323 (addr_object_size): Adjust calls.
8324 * tree-object-size.h (compute_builtin_object_size): Drop PDECL
8327 2021-10-07 Roger Sayle <roger@nextmovesoftware.com>
8329 * rtl.def (SMUL_HIGHPART, UMUL_HIGHPART): New RTX codes for
8330 representing signed and unsigned high-part multiplication resp.
8331 * simplify-rtx.c (simplify_binary_operation_1) [SMUL_HIGHPART,
8332 UMUL_HIGHPART]: Simplify high-part multiplications by zero.
8333 [SS_PLUS, US_PLUS, SS_MINUS, US_MINUS, SS_MULT, US_MULT,
8334 SS_DIV, US_DIV]: Similar simplifications for saturating
8336 (simplify_const_binary_operation) [SS_PLUS, US_PLUS, SS_MINUS,
8337 US_MINUS, SS_MULT, US_MULT, SMUL_HIGHPART, UMUL_HIGHPART]:
8338 Implement compile-time evaluation for constant operands.
8339 * dwarf2out.c (mem_loc_descriptor): Skip SMUL_HIGHPART and
8341 * doc/rtl.texi (smul_highpart, umul_highpart): Document RTX codes.
8342 * doc/md.texi (smul@var{m}3_highpart, umul@var{m3}_highpart):
8343 Mention the new smul_highpart and umul_highpart RTX codes.
8344 * doc/invoke.texi: Silence @xref "compilation" warnings.
8346 2021-10-07 Martin Jambor <mjambor@suse.cz>
8349 * ipa-prop.c (ipa_edge_args_sum_t::duplicate): Also handle the
8350 case when the source reference description corresponds to a
8351 referance taken in a function src->caller is inlined to.
8353 2021-10-07 Jan Hubicka <hubicka@ucw.cz>
8356 * ipa-modref-tree.h (modref_access_node::contains_p): Handle offsets
8358 (modref_access_node::try_merge_with): Add sanity check that there
8359 are no redundant entries in the list.
8361 2021-10-07 Richard Biener <rguenther@suse.de>
8363 PR tree-optimization/102608
8364 * tree-ssa-sccvn.c (visit_stmt): Drop .DEFERRED_INIT to
8367 2021-10-07 Martin Liska <mliska@suse.cz>
8369 * toplev.c (toplev::main): Make
8370 save_opt_decoded_options a pointer type
8371 * toplev.h: Likewise.
8373 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
8375 * config/gcn/gcn-valu.md (gather<mode>_insn_2offsets<exec>): Apply
8376 HAVE_GCN_ASM_GLOBAL_LOAD_FIXED.
8377 (scatter<mode>_insn_2offsets<exec_scatter>): Likewise.
8379 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
8381 * config/gcn/gcn-hsa.h (SRAMOPT): Include the whole option string.
8382 Adjust for new -msram-ecc=any behaviour.
8383 (ASM_SPEC): Adjust -mxnack and -msram-ecc usage.
8384 * config/gcn/gcn.c (output_file_start): Implement -msram-ecc=any.
8385 * config/gcn/mkoffload.c (EF_AMDGPU_XNACK): Rename to ...
8386 (EF_AMDGPU_XNACK_V3): ... this.
8387 (EF_AMDGPU_SRAM_ECC): Rename to ...
8388 (EF_AMDGPU_SRAM_ECC_V3): ... this.
8389 (EF_AMDGPU_FEATURE_XNACK_V4): New.
8390 (EF_AMDGPU_FEATURE_XNACK_UNSUPPORTED_V4): New.
8391 (EF_AMDGPU_FEATURE_XNACK_ANY_V4): New.
8392 (EF_AMDGPU_FEATURE_XNACK_OFF_V4): New.
8393 (EF_AMDGPU_FEATURE_XNACK_ON_V4): New.
8394 (EF_AMDGPU_FEATURE_SRAMECC_V4): New.
8395 (EF_AMDGPU_FEATURE_SRAMECC_UNSUPPORTED_V4): New.
8396 (EF_AMDGPU_FEATURE_SRAMECC_ANY_V4): New.
8397 (EF_AMDGPU_FEATURE_SRAMECC_OFF_V4): New.
8398 (EF_AMDGPU_FEATURE_SRAMECC_ON_V4): New.
8399 (SET_XNACK_ON): New.
8400 (SET_XNACK_OFF): New.
8402 (SET_SRAM_ECC_ON): New.
8403 (SET_SRAM_ECC_ANY): New.
8404 (SET_SRAM_ECC_OFF): New.
8405 (TEST_SRAM_ECC_ANY): New.
8406 (TEST_SRAM_ECC_ON): New.
8407 (main): Implement HSACOv4 and -msram-ecc=any.
8409 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
8411 * config.in: Regenerate.
8412 * config/gcn/gcn-hsa.h (X_FIJI): New macro.
8416 (A_FIJI): Rename to ...
8418 (A_900): Rename to ...
8420 (A_906): Rename to ...
8422 (A_908): Rename to ...
8424 (SRAMOPT): New macro.
8425 (ASM_SPEC): Adjust xnack option usage.
8426 * config/gcn/gcn.c (output_file_start): Adjust amdgcn_target usage.
8427 * configure: Regenerate.
8428 * configure.ac: Detect LLVM assembler dialect.
8430 2021-10-07 Richard Biener <rguenther@suse.de>
8432 * tree-pretty-print.c (dump_generic_node): Do not elide
8433 printing '&' when dumping with -gimple.
8435 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
8437 * gimple-range-cache.cc (non_null_ref::adjust_range): Call new
8439 * gimple-range-fold.cc (adjust_pointer_diff_expr): Ditto.
8440 (adjust_imagpart_expr): Ditto.
8441 * value-range.cc (irange::irange_intersect): Call new routine if
8442 RHS is a single pair.
8443 (irange::intersect): New wide_int version.
8444 * value-range.h (class irange): New prototype.
8446 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
8448 * gimple-range-edge.cc (gimple_outgoing_range::gimple_outgoing_range):
8449 Add parameter to limit size when recognizing switches.
8450 (gimple_outgoing_range::edge_range_p): Check size limit.
8451 * gimple-range-edge.h (gimple_outgoing_range): Add size field.
8452 * gimple-range-gori.cc (gori_map::calculate_gori): Ignore switches
8453 that exceed the size limit.
8454 (gori_compute::gori_compute): Add initializer.
8455 * params.opt (evrp-switch-limit): New.
8456 * doc/invoke.texi: Update docs.
8458 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
8460 * value-range.h (irange::set_varying): Use TYPE_MIN_VALUE and
8461 TYPE_MAX_VALUE instead of creating new trees when possible.
8463 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
8465 * gimple-range-cache.cc (non_null_ref::adjust_range): Check for
8466 zero and non-zero more efficently.
8468 2021-10-06 Richard Biener <rguenther@suse.de>
8471 * dumpfile.h (TDF_GIMPLE_VAL): New.
8472 (dump_flag): Re-order and adjust TDF_* flags. Make
8473 the enum uint32_t. Use std::underlying_type in the
8475 (optgroup_flag): Likewise for the operator overloads.
8476 * tree-pretty-print.c (dump_generic_node): Wrap ADDR_EXPR
8477 in _Literal if TDF_GIMPLE_VAL.
8478 * gimple-pretty-print.c (dump_gimple_assign): Add
8479 TDF_GIMPLE_VAL to flags when dumping operands where only
8480 is_gimple_val are allowed.
8481 (dump_gimple_cond): Likewise.
8483 2021-10-06 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
8485 * gimple-isel.cc (gimple_expand_vec_cond_expr): Remove redundant if
8488 2021-10-05 qing zhao <qing.zhao@oracle.com>
8490 PR middle-end/102359
8491 * gimplify.c (gimplify_decl_expr): Not add initialization for an
8492 auto variable when it has been initialized by frontend.
8494 2021-10-05 Aldy Hernandez <aldyh@redhat.com>
8496 * tree-ssa-threadupdate.c (jt_path_registry::cancel_invalid_paths):
8499 2021-10-05 Jan-Benedict Glaw <jbglaw@lug-owl.de>
8501 * common/config/avr/avr-common.c (avr_handle_option): Mark
8502 argument as ATTRIBUTE_UNUSED.
8504 2021-10-05 Jan-Benedict Glaw <jbglaw@lug-owl.de>
8506 * config/lm32/uclinux-elf.h (LINK_GCC_C_SEQUENCE_SPEC):
8507 Undefine before redefinition.
8509 2021-10-05 Richard Biener <rguenther@suse.de>
8511 * toplev.c (no_backend): Remove global var.
8512 (process_options): Pass in no_backend, move post_options
8513 langhook call to toplev::main.
8514 (do_compile): Pass in no_backend, move process_options call
8516 (toplev::run_self_tests): Check no_backend at the caller.
8517 (toplev::main): Call post_options and process_options
8518 split out from do_compile, do self-tests only if
8519 no_backend is initialized.
8521 2021-10-05 Richard Biener <rguenther@suse.de>
8523 * tree-cfg.c (dump_function_to_file): Dump the UID of the
8524 function as part of the name when requested.
8525 * tree-pretty-print.c (dump_function_name): Dump the UID when
8526 requested and the langhook produced the actual name.
8528 2021-10-05 Richard Biener <rguenther@suse.de>
8530 PR middle-end/102587
8531 PR middle-end/102285
8532 * internal-fn.c (expand_DEFERRED_INIT): Fall back to
8533 zero-initialization as last resort, use the constant
8534 size as given by the DEFERRED_INIT argument to build
8537 2021-10-04 Marek Polacek <polacek@redhat.com>
8540 * doc/invoke.texi: Document -Warray-compare.
8542 2021-10-04 Richard Biener <rguenther@suse.de>
8544 * gimplify.c (is_var_need_auto_init): DECL_HARD_REGISTER
8545 variables are not to be initialized.
8547 2021-10-04 Richard Biener <rguenther@suse.de>
8549 * expr.h (non_mem_decl_p): Declare.
8550 (mem_ref_refers_to_non_mem_p): Likewise.
8551 * expr.c (non_mem_decl_p): Export.
8552 (mem_ref_refers_to_non_mem_p): Likewise.
8553 * internal-fn.c (expand_DEFERRED_INIT): Do not expand the LHS
8554 but check the base with mem_ref_refers_to_non_mem_p
8557 2021-10-04 Richard Biener <rguenther@suse.de>
8559 PR tree-optimization/102570
8560 * tree-ssa-sccvn.h (vn_reference_op_struct): Document
8561 we are using clique for the internal function code.
8562 * tree-ssa-sccvn.c (vn_reference_op_eq): Compare the
8563 internal function code.
8564 (print_vn_reference_ops): Print the internal function code.
8565 (vn_reference_op_compute_hash): Hash it.
8566 (copy_reference_ops_from_call): Record it.
8567 (visit_stmt): Remove the restriction around internal function
8569 (fully_constant_vn_reference_p): Use fold_const_call and handle
8571 (vn_reference_eq): Compare call return types.
8572 * tree-ssa-pre.c (create_expression_by_pieces): Handle
8573 generating calls to internal functions.
8574 (compute_avail): Remove the restriction around internal function
8577 2021-10-04 Aldy Hernandez <aldyh@redhat.com>
8579 PR tree-optimization/102560
8580 * gimple-ssa-warn-alloca.c (alloca_call_type): Remove static
8581 marker for invalid_range.
8583 2021-10-04 Richard Biener <rguenther@suse.de>
8585 PR middle-end/102587
8586 * internal-fn.c (expand_DEFERRED_INIT): Guard register
8587 initialization path an avoid initializing VLA registers
8590 2021-10-04 Eric Botcazou <ebotcazou@adacore.com>
8592 * config/rs6000/vxworks.h (TARGET_INIT_LIBFUNCS): Delete.
8594 2021-10-03 Martin Liska <mliska@suse.cz>
8596 * toplev.c (toplev::main): Check opt_index if it is a part
8599 2021-10-02 Aldy Hernandez <aldyh@redhat.com>
8601 PR tree-optimization/102563
8602 * range-op.cc (operator_lshift::op1_range): Do not clobber
8605 2021-10-02 Martin Liska <mliska@suse.cz>
8607 * toplev.c (toplev::main): save_decoded_options[0] is program
8608 name and so it should be skipped.
8610 2021-10-01 Aldy Hernandez <aldyh@redhat.com>
8612 PR tree-optimization/102546
8613 * range-op.cc (operator_lshift::op1_range): Teach range-ops that
8614 X << Y is non-zero implies X is also non-zero.
8616 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8618 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
8620 * config/aarch64/aarch64-tune.md: Regenerate.
8621 * doc/invoke.texi: Update docs.
8623 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8625 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
8627 * config/aarch64/aarch64-tune.md: Regenerate.
8628 * doc/invoke.texi: Update docs.
8630 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8632 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
8634 * config/aarch64/aarch64-tune.md: Regenerate.
8635 * doc/invoke.texi: Update docs.
8637 2021-10-01 Martin Sebor <msebor@redhat.com>
8640 * doc/invoke.texi (-Waddress): Update.
8641 * gengtype.c (write_types): Avoid -Waddress.
8642 * poly-int.h (POLY_SET_COEFF): Avoid using null.
8644 2021-10-01 John David Anglin <danglin@gcc.gnu.org>
8647 * config/pa/pa.c (pa_option_override): Default to dwarf version 4
8650 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8652 * config/aarch64/aarch64.h (AARCH64_FL_V9): Update value.
8654 2021-10-01 Aldy Hernandez <aldyh@redhat.com>
8656 * gimple-range-path.cc (path_range_query::compute_ranges): Use
8658 * gimple-range-path.h (class path_range_query): Remove shadowed
8660 (path_range_query::get_path_oracle): New.
8662 2021-10-01 Jakub Jelinek <jakub@redhat.com>
8663 Richard Biener <rguenther@suse.de>
8666 * doc/invoke.texi (-fsanitize=integer-divide-by-zero): Remove
8667 INT_MIN / -1 division detection from here ...
8668 (-fsanitize=signed-integer-overflow): ... and add it here.
8670 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8672 * config/aarch64/aarch64-arches.def (AARCH64_ARCH): Added
8674 * config/aarch64/aarch64.h (AARCH64_FL_V9): New.
8675 (AARCH64_FL_FOR_ARCH9): New flags for Armv9-A.
8676 (AARCH64_ISA_V9): New ISA flag.
8677 * doc/invoke.texi: Update docs.
8679 2021-10-01 Martin Liska <mliska@suse.cz>
8681 * toplev.c (toplev::main): Save decoded optimization options.
8682 * toplev.h (save_opt_decoded_options): New.
8683 * doc/extend.texi: Be more clear about optimize and target
8686 2021-10-01 Eric Botcazou <ebotcazou@adacore.com>
8688 * explow.c: Include langhooks.h.
8689 (set_stack_check_libfunc): Build a proper function type.
8691 2021-10-01 Eric Botcazou <ebotcazou@adacore.com>
8694 * config/i386/i386.c (legitimate_pic_address_disp_p): For PE-COFF do
8695 not return true for external weak function symbols in medium model.
8697 2021-10-01 Jakub Jelinek <jakub@redhat.com>
8699 * tree.h (OMP_CLAUSE_ORDER_REPRODUCIBLE): Define.
8700 * tree-pretty-print.c (dump_omp_clause) <case OMP_CLAUSE_ORDER>: Print
8701 reproducible: for OMP_CLAUSE_ORDER_REPRODUCIBLE.
8702 * omp-general.c (omp_extract_for_data): If OMP_CLAUSE_ORDER is seen
8703 without OMP_CLAUSE_ORDER_UNCONSTRAINED, overwrite sched_kind to
8704 OMP_CLAUSE_SCHEDULE_STATIC.
8706 2021-10-01 Richard Biener <rguenther@suse.de>
8708 PR middle-end/102518
8709 * tree-inline.c (setup_one_parameter): Avoid substituting
8710 an invariant into contexts where a GIMPLE register is not valid.
8712 2021-09-30 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8714 * config/arm/arm-cpus.in: Add Cortex-R52+ CPU.
8715 * config/arm/arm-tables.opt: Regenerate.
8716 * config/arm/arm-tune.md: Regenerate.
8717 * doc/invoke.texi: Update docs.
8719 2021-09-30 Uroš Bizjak <ubizjak@gmail.com>
8722 * config/i386/i386.md
8723 (sign_extend:WIDE (any_logic:NARROW (memory, immediate)) splitters):
8726 2021-09-30 Tobias Burnus <tobias@codesourcery.com>
8728 * omp-low.c (omp_runtime_api_call): Add omp_aligned_{,c}alloc and
8729 omp_{c,re}alloc, fix omp_alloc/omp_free.
8731 2021-09-30 Martin Liska <mliska@suse.cz>
8733 * defaults.h (ASM_OUTPUT_ASCII): Do not hide global variable
8734 asm_out_file and stream directly to MYFILE.
8736 2021-09-30 Richard Biener <rguenther@suse.de>
8738 * tree-vect-data-refs.c (vect_update_misalignment_for_peel):
8739 Restore and fix condition under which we apply npeel to
8740 the DRs misalignment value.
8742 2021-09-30 Richard Biener <rguenther@suse.de>
8744 * tree-vect-data-refs.c (vect_update_misalignment_for_peel):
8745 Fix npeel check for variable amount of peeling.
8747 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
8749 * lto-wrapper.c (run_gcc): Plug snprintf overflow.
8751 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
8753 * gimple-range.cc (gimple_ranger::debug): New.
8754 * gimple-range.h (class gimple_ranger): Add debug.
8756 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
8758 PR middle-end/102519
8759 * tree-vrp.c (hybrid_threader::~hybrid_threader): Free m_query.
8761 2021-09-29 Indu Bhagat <indu.bhagat@oracle.com>
8764 * btfout.c (GTY): Add GTY (()) albeit for cosmetic only purpose.
8765 (btf_finalize): Empty the hash_map btf_var_ids.
8767 2021-09-29 Aldy Hernandez <aldyh@redhat.com>
8769 * tree-vrp.c (thread_through_all_blocks): Return bool.
8770 (execute_vrp_threader): Return TODO_* flags.
8771 (pass_data_vrp_threader): Set todo_flags_finish to 0.
8773 2021-09-29 Aldy Hernandez <aldyh@redhat.com>
8775 * timevar.def (TV_TREE_VRP_THREADER): New.
8776 * tree-vrp.c: Use TV_TREE_VRP_THREADER for VRP threader pass.
8778 2021-09-29 David Faust <david.faust@oracle.com>
8780 * config.gcc (bpf-*-*): Do not overwrite extra_headers.
8782 2021-09-29 Jonathan Wright <jonathan.wright@arm.com>
8784 * config/aarch64/aarch64-builtins.c (TYPES_BINOP_PPU): Define
8785 new type qualifier enum.
8786 (TYPES_TERNOP_SSSU): Likewise.
8787 (TYPES_TERNOP_PPPU): Likewise.
8788 * config/aarch64/aarch64-simd-builtins.def: Define PPU, SSU,
8789 PPPU and SSSU builtin generator macros for qtbl1 and qtbx1
8791 * config/aarch64/arm_neon.h (vqtbl1_p8): Use type-qualified
8792 builtin and remove casts.
8793 (vqtbl1_s8): Likewise.
8794 (vqtbl1q_p8): Likewise.
8795 (vqtbl1q_s8): Likewise.
8796 (vqtbx1_s8): Likewise.
8797 (vqtbx1_p8): Likewise.
8798 (vqtbx1q_s8): Likewise.
8799 (vqtbx1q_p8): Likewise.
8800 (vtbl1_p8): Likewise.
8801 (vtbl2_p8): Likewise.
8802 (vtbx2_p8): Likewise.
8804 2021-09-29 Richard Biener <rguenther@suse.de>
8806 * tree-vect-data-refs.c (vect_dr_misalign_for_aligned_access):
8808 (vect_update_misalignment_for_peel): Use it to update
8809 misaligned to the value necessary for an aligned access.
8810 (vect_get_peeling_costs_all_drs): Likewise.
8811 (vect_enhance_data_refs_alignment): Likewise.
8813 2021-09-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
8815 * config/aarch64/aarch64.c (aarch64_expand_cpymem): Count number of
8816 emitted operations and adjust heuristic for code size.
8818 2021-09-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
8820 * config/aarch64/aarch64.c (aarch64_expand_setmem): Count number of
8821 emitted operations and adjust heuristic for code size.
8823 2021-09-29 Jakub Jelinek <jakub@redhat.com>
8825 PR middle-end/102504
8826 * gimplify.c (gimplify_scan_omp_clauses): Use omp_check_private even
8827 in OMP_SCOPE clauses, not just on worksharing construct clauses.
8829 2021-09-28 Geng Qi <gengqi@linux.alibaba.com>
8831 * config/riscv/riscv.md (mulv<mode>4): Call gen_smul<mode>3_highpart.
8832 (<u>mulditi3): Call <su>muldi3_highpart.
8833 (<u>muldi3_highpart): Rename to <su>muldi3_highpart.
8834 (<u>mulsidi3): Call <su>mulsi3_highpart.
8835 (<u>mulsi3_highpart): Rename to <su>mulsi3_highpart.
8837 2021-09-28 Iain Sandoe <iain@sandoe.co.uk>
8839 * config/darwin.h (DSYMUTIL_SPEC): Recognize D sources.
8841 2021-09-28 Iain Sandoe <iain@sandoe.co.uk>
8843 * config/rs6000/darwin.h (FIXED_R13): Add for PPC64.
8844 (FIRST_SAVED_GP_REGNO): Save from R13 even when it is one
8847 2021-09-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
8849 * config/aarch64/aarch64.h (AARCH64_FL_LS64): Define
8850 (AARCH64_FL_V8_7): Likewise.
8851 (AARCH64_FL_FOR_ARCH8_7): Likewise.
8852 * config/aarch64/aarch64-arches.def (armv8.7-a): Define.
8853 * config/aarch64/aarch64-option-extensions.def (ls64): Define.
8854 * doc/invoke.texi: Document the above.
8856 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
8858 * dbgcnt.c (dbg_cnt_counter): New.
8859 * dbgcnt.h (dbg_cnt_counter): New.
8860 * dumpfile.c (dump_options): Add entry for TDF_THREADING.
8861 * dumpfile.h (enum dump_flag): Add TDF_THREADING.
8862 * gimple-range-path.cc (DEBUG_SOLVER): Use TDF_THREADING.
8863 * tree-ssa-threadupdate.c (dump_jump_thread_path): Dump out
8866 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
8868 * cfgcleanup.c (pass_jump::execute): Check
8869 flag_expensive_optimizations.
8870 (pass_jump_after_combine::gate): Same.
8871 * doc/invoke.texi (-fthread-jumps): Enable for -O1.
8872 * opts.c (default_options_table): Enable -fthread-jumps at -O1.
8873 * tree-ssa-threadupdate.c
8874 (fwd_jt_path_registry::remove_jump_threads_including): Bail unless
8877 2021-09-28 Ilya Leoshkevich <iii@linux.ibm.com>
8879 * tree-ssa-reassoc.c (biased_names): New global.
8880 (propagate_bias_p): New function.
8881 (loop_carried_phi): Remove.
8882 (propagate_rank): Propagate bias along single uses.
8883 (get_rank): Update biased_names when needed.
8885 2021-09-28 Ilya Leoshkevich <iii@linux.ibm.com>
8887 * passes.def (pass_reassoc): Rename parameter to early_p.
8888 * tree-ssa-reassoc.c (reassoc_bias_loop_carried_phi_ranks_p):
8890 (phi_rank): Don't bias loop-carried phi ranks
8891 before vectorization pass.
8892 (execute_reassoc): Add bias_loop_carried_phi_ranks_p parameter.
8893 (pass_reassoc::pass_reassoc): Add bias_loop_carried_phi_ranks_p
8895 (pass_reassoc::set_param): Set bias_loop_carried_phi_ranks_p
8897 (pass_reassoc::execute): Pass bias_loop_carried_phi_ranks_p to
8899 (pass_reassoc::bias_loop_carried_phi_ranks_p): New member.
8901 2021-09-28 Jakub Jelinek <jakub@redhat.com>
8904 * config/i386/i386.c (standard_80387_constant_p): Don't recognize
8905 special 80387 instruction XFmode constants if flag_rounding_math.
8907 2021-09-28 Richard Biener <rguenther@suse.de>
8909 PR tree-optimization/100112
8910 * tree-ssa-sccvn.c (visit_reference_op_load): Record the
8911 referece into the hashtable twice in case last_vuse is
8912 different from the original vuse on the stmt.
8914 2021-09-28 Jakub Jelinek <jakub@redhat.com>
8916 PR middle-end/102492
8917 * gimplify.c (gimplify_adjust_omp_clauses_1): Don't call the
8918 omp_finish_clause langhook on implicitly added OMP_CLAUSE_PRIVATE
8919 clauses on SIMD constructs.
8921 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
8923 PR tree-optimization/102511
8924 * gimple-range-path.cc (path_range_query::range_on_path_entry):
8925 Return VARYING when nothing found.
8927 2021-09-28 Hongyu Wang <hongyu.wang@intel.com>
8930 * config/i386/i386.h (VALID_AVX512FP16_REG_MODE): Add
8932 (VALID_SSE2_REG_VHF_MODE): Add V4HFmode and V2HFmode.
8933 (VALID_MMX_REG_MODE): Add V4HFmode.
8934 (SSE_REG_MODE_P): Replace VALID_AVX512FP16_REG_MODE with
8935 vector mode condition.
8936 * config/i386/i386.c (classify_argument): Parse V4HF/V2HF
8938 (function_arg_32): Add V4HFmode.
8939 (function_arg_advance_32): Likewise.
8940 * config/i386/i386.md (mode): Add V4HF/V2HF.
8941 (MODE_SIZE): Likewise.
8942 * config/i386/mmx.md (MMXMODE): Add V4HF mode.
8943 (V_32): Add V2HF mode.
8944 (VHF_32_64): New mode iterator.
8945 (*mov<mode>_internal): Adjust sse alternatives to support
8947 (*mov<mode>_internal): Adjust sse alternatives to support
8949 (<insn><mode>3): New define_insn for add/sub/mul/div.
8951 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
8953 * tree-ssa-threadbackward.c (pass_thread_jumps::gate): Check
8955 (pass_early_thread_jumps::gate): Same.
8956 * tree-ssa-threadedge.c (jump_threader::thread_outgoing_edges):
8957 Return if !flag_thread_jumps.
8958 * tree-ssa-threadupdate.c
8959 (jt_path_registry::register_jump_thread): Assert that
8960 flag_thread_jumps is true.
8962 2021-09-28 liuhongt <hongtao.liu@intel.com>
8965 (simplify_context::simplify_binary_operation_1): Relax
8966 condition of simplifying (vec_concat:M (vec_select op0
8967 index0)(vec_select op1 index1)) to allow different modes
8968 between op0 and M, but have same inner mode.
8970 2021-09-28 liuhongt <hongtao.liu@intel.com>
8972 * config/i386/i386-expand.c (emit_reduc_half): Handle
8973 V8HF/V16HF/V32HFmode.
8974 * config/i386/sse.md (REDUC_SSE_PLUS_MODE): Add V8HF.
8975 (REDUC_SSE_SMINMAX_MODE): Ditto.
8976 (REDUC_PLUS_MODE): Add V16HF and V32HF.
8977 (REDUC_SMINMAX_MODE): Ditto.
8979 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
8981 * gimple-range-path.cc
8982 (path_range_query::precompute_ranges_in_block): Rename to...
8983 (path_range_query::compute_ranges_in_block): ...this.
8984 (path_range_query::precompute_ranges): Rename to...
8985 (path_range_query::compute_ranges): ...this.
8986 (path_range_query::precompute_relations): Rename to...
8987 (path_range_query::compute_relations): ...this.
8988 (path_range_query::precompute_phi_relations): Rename to...
8989 (path_range_query::compute_phi_relations): ...this.
8990 * gimple-range-path.h: Rename precompute* to compute*.
8991 * tree-ssa-threadbackward.c
8992 (back_threader::find_taken_edge_switch): Same.
8993 (back_threader::find_taken_edge_cond): Same.
8994 * tree-ssa-threadedge.c
8995 (hybrid_jt_simplifier::compute_ranges_from_state): Same.
8996 (hybrid_jt_state::register_equivs_stmt): Inline...
8997 * tree-ssa-threadedge.h: ...here.
8999 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9001 * tree-vrp.c (lhs_of_dominating_assert): Remove.
9002 (class vrp_jt_state): Remove.
9003 (class vrp_jt_simplifier): Remove.
9004 (vrp_jt_simplifier::simplify): Remove.
9005 (class vrp_jump_threader): Remove.
9006 (vrp_jump_threader::vrp_jump_threader): Remove.
9007 (vrp_jump_threader::~vrp_jump_threader): Remove.
9008 (vrp_jump_threader::before_dom_children): Remove.
9009 (vrp_jump_threader::after_dom_children): Remove.
9011 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9013 * passes.def (pass_vrp_threader): New.
9014 * tree-pass.h (make_pass_vrp_threader): Add make_pass_vrp_threader.
9015 * tree-ssa-threadedge.c (hybrid_jt_state::register_equivs_stmt): New.
9016 (hybrid_jt_simplifier::hybrid_jt_simplifier): New.
9017 (hybrid_jt_simplifier::simplify): New.
9018 (hybrid_jt_simplifier::compute_ranges_from_state): New.
9019 * tree-ssa-threadedge.h (class hybrid_jt_state): New.
9020 (class hybrid_jt_simplifier): New.
9021 * tree-vrp.c (execute_vrp): Remove ASSERT_EXPR based jump
9023 (class hybrid_threader): New.
9024 (hybrid_threader::hybrid_threader): New.
9025 (hybrid_threader::~hybrid_threader): New.
9026 (hybrid_threader::before_dom_children): New.
9027 (hybrid_threader::after_dom_children): New.
9028 (execute_vrp_threader): New.
9029 (class pass_vrp_threader): New.
9030 (make_pass_vrp_threader): New.
9032 2021-09-27 Martin Liska <mliska@suse.cz>
9034 * output.h (enum section_flag): New.
9035 (SECTION_FORGET): Remove.
9036 (SECTION_ENTSIZE): Make it (1UL << 8) - 1.
9037 (SECTION_STYLE_MASK): Define it based on other enum
9039 * varasm.c (switch_to_section): Remove unused handling of
9042 2021-09-27 Martin Liska <mliska@suse.cz>
9044 * common.opt: Add new variable flag_default_complex_method.
9045 * opts.c (finish_options): Handle flags related to
9046 x_flag_complex_method.
9047 * toplev.c (process_options): Remove option handling related
9048 to flag_complex_method.
9050 2021-09-27 Richard Biener <rguenther@suse.de>
9052 PR middle-end/102450
9053 * gimple-fold.c (gimple_fold_builtin_memory_op): Avoid using
9054 type_for_size, instead use int_mode_for_size.
9056 2021-09-27 Andrew Pinski <apinski@marvell.com>
9059 * gimplify.c (gimplify_save_expr): Return early
9060 if the type of val is error_mark_node.
9062 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9064 * tree-ssanames.c (ssa_name_has_boolean_range): Use
9067 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
9069 * gimple-ssa-evrp-analyze.h (class evrp_range_analyzer): Remove
9070 vrp_visit_cond_stmt.
9071 * tree-ssa-dom.c (cprop_operand): Convert to range_query API.
9072 (cprop_into_stmt): Same.
9073 (dom_opt_dom_walker::optimize_stmt): Same.
9075 2021-09-27 Richard Biener <rguenther@suse.de>
9077 PR tree-optimization/97351
9078 PR tree-optimization/97352
9079 PR tree-optimization/82426
9080 * tree-vectorizer.h (dr_misalignment): Add vector type
9082 (aligned_access_p): Likewise.
9083 (known_alignment_for_access_p): Likewise.
9084 (vect_supportable_dr_alignment): Likewise.
9085 (vect_known_alignment_in_bytes): Likewise. Refactor.
9086 (DR_MISALIGNMENT): Remove.
9087 (vect_update_shared_vectype): Likewise.
9088 * tree-vect-data-refs.c (dr_misalignment): Refactor, handle
9089 a vector type with larger alignment requirement and apply
9090 the negative step adjustment here.
9091 (vect_calculate_target_alignment): Remove.
9092 (vect_compute_data_ref_alignment): Get explicit vector type
9093 argument, do not apply a negative step alignment adjustment
9095 (vect_slp_analyze_node_alignment): Re-analyze alignment
9096 when we re-visit the DR with a bigger desired alignment but
9097 keep more precise results from smaller alignments.
9098 * tree-vect-slp.c (vect_update_shared_vectype): Remove.
9099 (vect_slp_analyze_node_operations_1): Do not update the
9100 shared vector type on stmts.
9101 * tree-vect-stmts.c (vect_analyze_stmt): Push/pop the
9102 vector type of an SLP node to the representative stmt-info.
9103 (vect_transform_stmt): Likewise.
9105 2021-09-27 liuhongt <hongtao.liu@intel.com>
9108 2021-09-09 liuhongt <hongtao.liu@intel.com>
9111 * config/i386/sse.md (reduc_plus_scal_<mode>): Split to ..
9112 (reduc_plus_scal_v4sf): .. this, New define_expand.
9113 (reduc_plus_scal_v2df): .. and this, New define_expand.
9115 2021-09-26 liuhongt <hongtao.liu@intel.com>
9117 * doc/extend.texi (Half-Precision): Remove storage only
9118 description for _Float16 w/o avx512fp16.
9120 2021-09-25 Dimitar Dimitrov <dimitar@dinux.eu>
9122 * config/pru/constraints.md (Rrio): New constraint.
9123 * config/pru/predicates.md (regio_operand): New predicate.
9124 * config/pru/pru-pragma.c (pru_register_pragmas): Register
9125 the __regio_symbol address space.
9126 * config/pru/pru-protos.h (pru_symref2ioregno): Declaration.
9127 * config/pru/pru.c (pru_symref2ioregno): New helper function.
9128 (pru_legitimate_address_p): Remove.
9129 (pru_addr_space_legitimate_address_p): Use the address space
9131 (pru_nongeneric_pointer_addrspace): New helper function.
9132 (pru_insert_attributes): New function to validate __regio_symbol
9134 (TARGET_INSERT_ATTRIBUTES): New macro.
9135 (TARGET_LEGITIMATE_ADDRESS_P): Remove.
9136 (TARGET_ADDR_SPACE_LEGITIMATE_ADDRESS_P): New macro.
9137 * config/pru/pru.h (enum reg_class): Add REGIO_REGS class.
9138 * config/pru/pru.md (*regio_readsi): New pattern to read I/O
9140 (*regio_nozext_writesi): New pattern to write to I/O registers.
9141 (*regio_zext_write_r30<EQS0:mode>): Ditto.
9142 * doc/extend.texi: Document the new PRU Named Address Space.
9144 2021-09-24 Patrick Palka <ppalka@redhat.com>
9148 * real.c (encode_ieee_double): Avoid unwanted sign extension.
9149 (encode_ieee_quad): Likewise.
9151 2021-09-24 Vladimir Makarov <vmakarov@redhat.com>
9153 PR rtl-optimization/102147
9154 * ira-build.c (ira_conflict_vector_profitable_p): Make
9155 profitability calculation independent of host compiler pointer and
9158 2021-09-24 Aldy Hernandez <aldyh@redhat.com>
9160 * gimple-range-path.cc (path_range_query::path_range_query):
9161 Move debugging header...
9162 (path_range_query::precompute_ranges): ...here.
9163 (path_range_query::internal_range_of_expr): Do not call
9164 range_on_path_entry if NAME is defined in the current block.
9166 2021-09-24 Richard Biener <rguenther@suse.de>
9168 * cfghooks.c (verify_flow_info): Verify unallocated BB and
9169 edge flags are not set.
9171 2021-09-24 Aldy Hernandez <aldyh@redhat.com>
9173 * tree-ssa-threadupdate.c (jt_path_registry::cancel_invalid_paths):
9175 (jt_path_registry::register_jump_thread): Call
9176 cancel_invalid_paths.
9177 * tree-ssa-threadupdate.h (class jt_path_registry): Add
9178 cancel_invalid_paths.
9180 2021-09-24 Feng Xue <fxue@os.amperecomputing.com>
9182 PR tree-optimization/102400
9183 * tree-ssa-sccvn.c (vn_reference_insert_pieces): Initialize
9184 result_vdef to zero value.
9186 2021-09-24 Feng Xue <fxue@os.amperecomputing.com>
9188 PR tree-optimization/102451
9189 * tree-ssa-dse.c (delete_dead_or_redundant_call): Record bb of stmt
9192 2021-09-24 Hongyu Wang <hongyu.wang@intel.com>
9194 * config/i386/sse.md (cond_<insn><mode>): Extend to support
9196 (cond_mul<mode>): Likewise.
9197 (cond_div<mode>): Likewise.
9198 (cond_<code><mode>): Likewise.
9199 (cond_fma<mode>): Likewise.
9200 (cond_fms<mode>): Likewise.
9201 (cond_fnma<mode>): Likewise.
9202 (cond_fnms<mode>): Likewise.
9204 2021-09-23 Andrew MacLeod <amacleod@redhat.com>
9206 PR tree-optimization/102463
9207 * gimple-range-fold.cc (fold_using_range::relation_fold_and_or): If
9208 there is no range-ops handler, don't look for a relation.
9210 2021-09-23 Andrew MacLeod <amacleod@redhat.com>
9212 * gimple-range-cache.cc (ranger_cache::ranger_cache): Take
9213 non-executable_edge flag as parameter.
9214 * gimple-range-cache.h (ranger_cache): Adjust prototype.
9215 * gimple-range-gori.cc (gori_compute::gori_compute): Take
9216 non-executable_edge flag as parameter.
9217 (gori_compute::outgoing_edge_range_p): Check new flag.
9218 * gimple-range-gori.h (gori_compute): Adjust prototype.
9219 * gimple-range.cc (gimple_ranger::gimple_ranger): Create new flag.
9220 (gimple_ranger::range_on_edge): Check new flag.
9221 * gimple-range.h (gimple_ranger::non_executable_edge_flag): New.
9222 * gimple-ssa-evrp.c (rvrp_folder): Pass ranger flag to simplifer.
9223 (hybrid_folder::hybrid_folder): Set ranger non-executable flag value.
9224 (hybrid_folder::fold_stmt): Set flag value in the simplifer.
9225 * vr-values.c (simplify_using_ranges::set_and_propagate_unexecutable):
9226 Use not_executable flag if provided inmstead of EDGE_EXECUTABLE.
9227 (simplify_using_ranges::simplify_switch_using_ranges): Clear
9228 EDGE_EXECUTABLE like it originally did.
9229 (simplify_using_ranges::cleanup_edges_and_switches): Clear any
9230 NON_EXECUTABLE flags.
9231 (simplify_using_ranges::simplify_using_ranges): Adjust.
9232 * vr-values.h (class simplify_using_ranges): Adjust.
9233 (simplify_using_ranges::set_range_query): Add non-executable flag param.
9235 2021-09-23 Bill Schmidt <wschmidt@linux.ibm.com>
9238 * config/rs6000/rs6000-call.c (rs6000_aggregate_candidate): Detect
9239 zero-width bit fields and return indicator.
9240 (rs6000_discover_homogeneous_aggregate): Diagnose when the
9241 presence of a zero-width bit field changes parameter passing in
9244 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
9246 * gimple-range-fold.cc (fold_using_range::range_of_phi):
9247 Remove dominator check.
9249 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
9251 * gimple-range-path.cc (path_range_query::precompute_relations):
9252 Hoist edge calculations before using EDGE_SUCC.
9254 2021-09-23 Jonathan Wakely <jwakely@redhat.com>
9256 * configure.ac: Fix --with-multilib-list description.
9257 * configure: Regenerate.
9259 2021-09-23 Richard Biener <rguenther@suse.de>
9261 PR tree-optimization/102448
9262 * tree-vect-data-refs.c (vect_duplicate_ssa_name_ptr_info):
9263 Clear alignment info copied from DR_PTR_INFO.
9265 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
9267 * config/i386/i386-expand.c (ix86_use_mask_cmp_p): Enable
9269 * config/i386/sse.md (sseintvecmodelower): Add HF vector modes.
9270 (<avx512>_store<mode>_mask): Extend to support HF vector modes.
9271 (vec_cmp<mode><avx512fmaskmodelower>): Likewise.
9272 (vcond_mask_<mode><avx512fmaskmodelower>): Likewise.
9273 (vcond<mode><mode>): New expander.
9274 (vcond<mode><sseintvecmodelower>): Likewise.
9275 (vcond<sseintvecmodelower><mode>): Likewise.
9276 (vcondu<mode><sseintvecmodelower>): Likewise.
9278 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
9280 * config/i386/sse.md (extend<ssePHmodelower><mode>2):
9282 (extendv4hf<mode>2): Likewise.
9283 (extendv2hfv2df2): Likewise.
9284 (trunc<mode><ssePHmodelower>2): Likewise.
9285 (avx512fp16_vcvt<castmode>2ph_<mode>): Rename to ...
9286 (trunc<mode>v4hf2): ... this, and drop constraints.
9287 (avx512fp16_vcvtpd2ph_v2df): Rename to ...
9288 (truncv2dfv2hf2): ... this, and likewise.
9290 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
9292 * config/i386/sse.md (float<floatunssuffix><mode><ssePHmodelower>2):
9294 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>):
9296 (float<floatunssuffix><mode>v4hf2): ... this, and drop constraints.
9297 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Rename to ...
9298 (float<floatunssuffix>v2div2hf2): ... this, and likewise.
9300 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
9302 * config/i386/i386.md (fix<fixunssuffix>_trunchf<mode>2): New expander.
9303 (fixuns_trunchfhi2): Likewise.
9304 (*fixuns_trunchfsi2zext): New define_insn.
9305 * config/i386/sse.md (ssePHmodelower): New mode_attr.
9306 (fix<fixunssuffix>_trunc<ssePHmodelower><mode>2):
9307 New expander for same element vector fix_truncate.
9308 (fix<fixunssuffix>_trunc<ssePHmodelower><mode>2):
9309 Likewise for V4HF to V4SI/V4DI fix_truncate.
9310 (fix<fixunssuffix>_truncv2hfv2di2):
9311 Likeise for V2HF to V2DI fix_truncate.
9313 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
9315 * config/i386/i386.md (<code>hf3): New expander.
9317 2021-09-23 liuhongt <hongtao.liu@intel.com>
9319 * config/i386/sse.md (FMAMODEM): extend to handle FP16.
9320 (VFH_SF_AVX512VL): Extend to handle HFmode.
9321 (VF_SF_AVX512VL): Deleted.
9323 2021-09-23 liuhongt <hongtao.liu@intel.com>
9325 * config/i386/i386.md (rinthf2): New expander.
9326 (nearbyinthf2): New expander.
9328 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
9330 * tree-ssa-dom.c (class dom_jump_threader_simplifier): Rename...
9331 (class dom_jt_state): ...this and provide virtual overrides.
9332 (dom_jt_state::register_equiv): New.
9333 (class dom_jt_simplifier): Rename from
9334 dom_jump_threader_simplifier.
9335 (dom_jump_threader_simplifier::simplify): Rename...
9336 (dom_jt_simplifier::simplify): ...to this.
9337 (pass_dominator::execute): Use dom_jt_simplifier and
9339 * tree-ssa-threadedge.c (jump_threader::jump_threader):
9341 (jt_state::register_equivs_stmt): Abstract out...
9342 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
9344 (jump_threader::thread_around_empty_blocks): Update state.
9345 (jump_threader::thread_through_normal_block): Same.
9346 (jt_state::jt_state): Remove.
9347 (jt_state::push): Remove pass specific bits. Keep block vector
9349 (jt_state::append_path): New.
9350 (jt_state::pop): Remove pass specific bits.
9351 (jt_state::register_equiv): Same.
9352 (jt_state::record_ranges_from_stmt): Same.
9353 (jt_state::register_equivs_on_edge): Same. Rename...
9354 (jt_state::register_equivs_edge): ...to this.
9355 (jt_state::dump): New.
9356 (jt_state::debug): New.
9357 (jump_threader_simplifier::simplify): Remove.
9358 (jt_state::get_path): New.
9359 * tree-ssa-threadedge.h (class jt_simplifier): Make into a base
9360 class. Expose common functionality as virtual methods.
9361 (class jump_threader_simplifier): Same. Rename...
9362 (class jt_simplifier): ...to this.
9363 * tree-vrp.c (class vrp_jump_threader_simplifier): Rename...
9364 (class vrp_jt_simplifier): ...to this. Provide pass specific
9366 (class vrp_jt_state): New.
9367 (vrp_jump_threader_simplifier::simplify): Rename...
9368 (vrp_jt_simplifier::simplify): ...to this. Inline code from
9369 what used to be the base class.
9370 (vrp_jump_threader::vrp_jump_threader): Use vrp_jt_state and
9373 2021-09-22 Tobias Burnus <tobias@codesourcery.com>
9376 * doc/invoke.texi (-Wno-missing-include-dirs.): Document Fortran
9379 2021-09-22 Roger Sayle <roger@nextmovesoftware.com>
9380 Richard Biener <rguenther@suse.de>
9382 * match.pd (negation simplifications): Implement some negation
9383 folding transformations from fold-const.c's fold_negate_expr.
9384 * tree-ssa-sccvn.c (vn_nary_build_or_lookup_1): Add a SIMPLIFY
9385 argument, to control whether the op should be simplified prior
9386 to looking up/assigning a value number.
9387 (vn_nary_build_or_lookup): Update call to vn_nary_build_or_lookup_1.
9388 (vn_nary_simplify): Likewise.
9389 (visit_nary_op): Likewise, but when constructing a NEGATE_EXPR
9390 now call vn_nary_build_or_lookup_1 disabling simplification.
9392 2021-09-22 Jiufu Guo <guojiufu@linux.ibm.com>
9394 PR tree-optimization/102087
9395 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
9396 Update bound/cmp/control for niter.
9398 2021-09-22 Aldy Hernandez <aldyh@redhat.com>
9400 * gimple-range-fold.cc (fold_using_range::range_of_range_op):
9401 Move check for non-empty BB here.
9402 (fur_source::register_outgoing_edges): ...from here.
9404 2021-09-22 Aldy Hernandez <aldyh@redhat.com>
9406 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
9407 Remove call to improve_range_with_equivs.
9408 (path_range_query::improve_range_with_equivs): Remove
9409 * gimple-range-path.h: Remove improve_range_with_equivs.
9411 2021-09-22 dianhong xu <dianhong.xu@intel.com>
9413 * config/i386/avx512fp16intrin.h:
9414 (_mm512_mask_blend_ph): New intrinsic.
9415 (_mm512_permutex2var_ph): Ditto.
9416 (_mm512_permutexvar_ph): Ditto.
9417 * config/i386/avx512fp16vlintrin.h:
9418 (_mm256_mask_blend_ph): New intrinsic.
9419 (_mm256_permutex2var_ph): Ditto.
9420 (_mm256_permutexvar_ph): Ditto.
9421 (_mm_mask_blend_ph): Ditto.
9422 (_mm_permutex2var_ph): Ditto.
9423 (_mm_permutexvar_ph): Ditto.
9425 2021-09-22 dianhong xu <dianhong.xu@intel.com>
9427 * config/i386/avx512fp16intrin.h: Add new intrinsics.
9428 (_mm512_conj_pch): New intrinsic.
9429 (_mm512_mask_conj_pch): Ditto.
9430 (_mm512_maskz_conj_pch): Ditto.
9431 * config/i386/avx512fp16vlintrin.h: Add new intrinsics.
9432 (_mm256_conj_pch): New intrinsic.
9433 (_mm256_mask_conj_pch): Ditto.
9434 (_mm256_maskz_conj_pch): Ditto.
9435 (_mm_conj_pch): Ditto.
9436 (_mm_mask_conj_pch): Ditto.
9437 (_mm_maskz_conj_pch): Ditto.
9439 2021-09-22 dianhong xu <dianhong.xu@intel.com>
9441 * config/i386/avx512fp16intrin.h (_MM512_REDUCE_OP): New macro
9442 (_mm512_reduce_add_ph): New intrinsic.
9443 (_mm512_reduce_mul_ph): Ditto.
9444 (_mm512_reduce_min_ph): Ditto.
9445 (_mm512_reduce_max_ph): Ditto.
9446 * config/i386/avx512fp16vlintrin.h
9447 (_MM256_REDUCE_OP/_MM_REDUCE_OP): New macro.
9448 (_mm256_reduce_add_ph): New intrinsic.
9449 (_mm256_reduce_mul_ph): Ditto.
9450 (_mm256_reduce_min_ph): Ditto.
9451 (_mm256_reduce_max_ph): Ditto.
9452 (_mm_reduce_add_ph): Ditto.
9453 (_mm_reduce_mul_ph): Ditto.
9454 (_mm_reduce_min_ph): Ditto.
9455 (_mm_reduce_max_ph): Ditto.
9457 2021-09-22 dianhong xu <dianhong.xu@intel.com>
9459 * config/i386/avx512fp16intrin.h (__m512h_u, __m256h_u,
9460 __m128h_u): New typedef.
9461 (_mm512_load_ph): New intrinsic.
9462 (_mm256_load_ph): Ditto.
9463 (_mm_load_ph): Ditto.
9464 (_mm512_loadu_ph): Ditto.
9465 (_mm256_loadu_ph): Ditto.
9466 (_mm_loadu_ph): Ditto.
9467 (_mm512_store_ph): Ditto.
9468 (_mm256_store_ph): Ditto.
9469 (_mm_store_ph): Ditto.
9470 (_mm512_storeu_ph): Ditto.
9471 (_mm256_storeu_ph): Ditto.
9472 (_mm_storeu_ph): Ditto.
9473 (_mm512_abs_ph): Ditto.
9474 * config/i386/avx512fp16vlintrin.h
9475 (_mm_abs_ph): Ditto.
9476 (_mm256_abs_ph): Ditto.
9478 2021-09-22 Andreas Krebbel <krebbel@linux.ibm.com>
9480 * config/s390/tpf.md (prologue_tpf, epilogue_tpf): Add cc clobber.
9482 2021-09-22 Andreas Krebbel <krebbel@linux.ibm.com>
9485 * config/s390/s390.c (s390_expand_insv): Emit a normal move if it
9486 is actually a full copy of the source operand into the target.
9487 Don't emit a strict low part move if source and target mode match.
9489 2021-09-22 Jakub Jelinek <jakub@redhat.com>
9491 PR middle-end/102415
9492 * omp-expand.c (expand_omp_single): If region->exit is NULL,
9493 assert region->entry is GIMPLE_OMP_SCOPE region and return.
9495 2021-09-22 Jakub Jelinek <jakub@redhat.com>
9497 * tree.h (OMP_CLAUSE_ALLOCATE_ALIGN): Define.
9498 * tree.c (omp_clause_num_ops): Change number of OMP_CLAUSE_ALLOCATE
9499 arguments from 2 to 3.
9500 * tree-pretty-print.c (dump_omp_clause): Print allocator() around
9501 allocate clause allocator and print align if present.
9502 * omp-low.c (scan_sharing_clauses): Force allocate_map entry even
9503 for omp_default_mem_alloc if align modifier is present. If align
9504 modifier is present, use TREE_LIST to encode both allocator and
9506 (lower_private_allocate, lower_rec_input_clauses, create_task_copyfn):
9507 Handle align modifier on allocator clause if present.
9509 2021-09-22 liuhongt <hongtao.liu@intel.com>
9511 * config/i386/i386.md (define_attr "isa"): Add
9513 (define_attr "enabled"): Correspond fma_or_avx512vl to
9514 TARGET_FMA || TARGET_AVX512VL.
9515 * config/i386/mmx.md (fmav2sf4): Extend to AVX512 fma.
9520 2021-09-22 liuhongt <hongtao.liu@intel.com>
9522 * config/i386/i386.md (cstorehf3): New define_expand.
9524 2021-09-22 liuhongt <hongtao.liu@intel.com>
9526 * config/i386/i386.md (<rounding_insn>hf2): New expander.
9527 (sse4_1_round<mode>2): Extend from MODEF to MODEFH.
9528 * config/i386/sse.md (*sse4_1_round<ssescalarmodesuffix>):
9529 Extend from VF_128 to VFH_128.
9531 2021-09-22 liuhongt <hongtao.liu@intel.com>
9533 * config/i386/i386-features.c (i386-features.c): Handle
9535 * config/i386/i386.md (sqrthf2): New expander.
9536 (*sqrthf2): New define_insn.
9537 * config/i386/sse.md
9538 (*<sse>_vmsqrt<mode>2<mask_scalar_name><round_scalar_name>):
9541 2021-09-22 liuhongt <hongtao.liu@intel.com>
9543 * config/i386/avx512fp16intrin.h (_mm_mask_fcmadd_sch):
9545 (_mm_mask3_fcmadd_sch): Likewise.
9546 (_mm_maskz_fcmadd_sch): Likewise.
9547 (_mm_fcmadd_sch): Likewise.
9548 (_mm_mask_fmadd_sch): Likewise.
9549 (_mm_mask3_fmadd_sch): Likewise.
9550 (_mm_maskz_fmadd_sch): Likewise.
9551 (_mm_fmadd_sch): Likewise.
9552 (_mm_mask_fcmadd_round_sch): Likewise.
9553 (_mm_mask3_fcmadd_round_sch): Likewise.
9554 (_mm_maskz_fcmadd_round_sch): Likewise.
9555 (_mm_fcmadd_round_sch): Likewise.
9556 (_mm_mask_fmadd_round_sch): Likewise.
9557 (_mm_mask3_fmadd_round_sch): Likewise.
9558 (_mm_maskz_fmadd_round_sch): Likewise.
9559 (_mm_fmadd_round_sch): Likewise.
9560 (_mm_fcmul_sch): Likewise.
9561 (_mm_mask_fcmul_sch): Likewise.
9562 (_mm_maskz_fcmul_sch): Likewise.
9563 (_mm_fmul_sch): Likewise.
9564 (_mm_mask_fmul_sch): Likewise.
9565 (_mm_maskz_fmul_sch): Likewise.
9566 (_mm_fcmul_round_sch): Likewise.
9567 (_mm_mask_fcmul_round_sch): Likewise.
9568 (_mm_maskz_fcmul_round_sch): Likewise.
9569 (_mm_fmul_round_sch): Likewise.
9570 (_mm_mask_fmul_round_sch): Likewise.
9571 (_mm_maskz_fmul_round_sch): Likewise.
9572 * config/i386/i386-builtin.def: Add corresponding new builtins.
9573 * config/i386/sse.md
9574 (avx512fp16_fmaddcsh_v8hf_maskz<round_expand_name>): New expander.
9575 (avx512fp16_fcmaddcsh_v8hf_maskz<round_expand_name>): Ditto.
9576 (avx512fp16_fma_<complexopname>sh_v8hf<mask_scalarcz_name><round_scalarcz_name>):
9578 (avx512fp16_<complexopname>sh_v8hf_mask<round_name>): Ditto.
9579 (avx512fp16_<complexopname>sh_v8hf<mask_scalarc_name><round_scalarcz_name>):
9581 * config/i386/subst.md (mask_scalarcz_name): New.
9582 (mask_scalarc_name): Ditto.
9583 (mask_scalarc_operand3): Ditto.
9584 (mask_scalarcz_operand4): Ditto.
9585 (round_scalarcz_name): Ditto.
9586 (round_scalarc_mask_operand3): Ditto.
9587 (round_scalarcz_mask_operand4): Ditto.
9588 (round_scalarc_mask_op3): Ditto.
9589 (round_scalarcz_mask_op4): Ditto.
9590 (round_scalarcz_constraint): Ditto.
9591 (round_scalarcz_nimm_predicate): Ditto.
9592 (mask_scalarcz): Ditto.
9593 (mask_scalarc): Ditto.
9594 (round_scalarcz): Ditto.
9596 2021-09-22 liuhongt <hongtao.liu@intel.com>
9598 * config/i386/avx512fp16intrin.h (_mm512_fcmadd_pch):
9600 (_mm512_mask_fcmadd_pch): Likewise.
9601 (_mm512_mask3_fcmadd_pch): Likewise.
9602 (_mm512_maskz_fcmadd_pch): Likewise.
9603 (_mm512_fmadd_pch): Likewise.
9604 (_mm512_mask_fmadd_pch): Likewise.
9605 (_mm512_mask3_fmadd_pch): Likewise.
9606 (_mm512_maskz_fmadd_pch): Likewise.
9607 (_mm512_fcmadd_round_pch): Likewise.
9608 (_mm512_mask_fcmadd_round_pch): Likewise.
9609 (_mm512_mask3_fcmadd_round_pch): Likewise.
9610 (_mm512_maskz_fcmadd_round_pch): Likewise.
9611 (_mm512_fmadd_round_pch): Likewise.
9612 (_mm512_mask_fmadd_round_pch): Likewise.
9613 (_mm512_mask3_fmadd_round_pch): Likewise.
9614 (_mm512_maskz_fmadd_round_pch): Likewise.
9615 (_mm512_fcmul_pch): Likewise.
9616 (_mm512_mask_fcmul_pch): Likewise.
9617 (_mm512_maskz_fcmul_pch): Likewise.
9618 (_mm512_fmul_pch): Likewise.
9619 (_mm512_mask_fmul_pch): Likewise.
9620 (_mm512_maskz_fmul_pch): Likewise.
9621 (_mm512_fcmul_round_pch): Likewise.
9622 (_mm512_mask_fcmul_round_pch): Likewise.
9623 (_mm512_maskz_fcmul_round_pch): Likewise.
9624 (_mm512_fmul_round_pch): Likewise.
9625 (_mm512_mask_fmul_round_pch): Likewise.
9626 (_mm512_maskz_fmul_round_pch): Likewise.
9627 * config/i386/avx512fp16vlintrin.h (_mm_fmadd_pch):
9629 (_mm_mask_fmadd_pch): Likewise.
9630 (_mm_mask3_fmadd_pch): Likewise.
9631 (_mm_maskz_fmadd_pch): Likewise.
9632 (_mm256_fmadd_pch): Likewise.
9633 (_mm256_mask_fmadd_pch): Likewise.
9634 (_mm256_mask3_fmadd_pch): Likewise.
9635 (_mm256_maskz_fmadd_pch): Likewise.
9636 (_mm_fcmadd_pch): Likewise.
9637 (_mm_mask_fcmadd_pch): Likewise.
9638 (_mm_mask3_fcmadd_pch): Likewise.
9639 (_mm_maskz_fcmadd_pch): Likewise.
9640 (_mm256_fcmadd_pch): Likewise.
9641 (_mm256_mask_fcmadd_pch): Likewise.
9642 (_mm256_mask3_fcmadd_pch): Likewise.
9643 (_mm256_maskz_fcmadd_pch): Likewise.
9644 (_mm_fmul_pch): Likewise.
9645 (_mm_mask_fmul_pch): Likewise.
9646 (_mm_maskz_fmul_pch): Likewise.
9647 (_mm256_fmul_pch): Likewise.
9648 (_mm256_mask_fmul_pch): Likewise.
9649 (_mm256_maskz_fmul_pch): Likewise.
9650 (_mm_fcmul_pch): Likewise.
9651 (_mm_mask_fcmul_pch): Likewise.
9652 (_mm_maskz_fcmul_pch): Likewise.
9653 (_mm256_fcmul_pch): Likewise.
9654 (_mm256_mask_fcmul_pch): Likewise.
9655 (_mm256_maskz_fcmul_pch): Likewise.
9656 * config/i386/i386-builtin-types.def (V8HF_FTYPE_V8HF_V8HF_V8HF,
9657 V8HF_FTYPE_V16HF_V16HF_V16HF, V16HF_FTYPE_V16HF_V16HF_V16HF_UQI,
9658 V32HF_FTYPE_V32HF_V32HF_V32HF_INT,
9659 V32HF_FTYPE_V32HF_V32HF_V32HF_UHI_INT): Add new builtin types.
9660 * config/i386/i386-builtin.def: Add new builtins.
9661 * config/i386/i386-expand.c: Handle new builtin types.
9662 * config/i386/subst.md (SUBST_CV): New.
9663 (maskc_name): Ditto.
9664 (maskc_operand3): Ditto.
9666 (sdc_maskz_name): Ditto.
9667 (sdc_mask_op4): Ditto.
9668 (sdc_mask_op5): Ditto.
9669 (sdc_mask_mode512bit_condition): Ditto.
9671 (round_maskc_operand3): Ditto.
9672 (round_sdc_mask_operand4): Ditto.
9673 (round_maskc_op3): Ditto.
9674 (round_sdc_mask_op4): Ditto.
9675 (round_saeonly_sdc_mask_operand5): Ditto.
9676 * config/i386/sse.md (unspec): Add complex fma unspecs.
9677 (avx512fmaskcmode): New.
9678 (UNSPEC_COMPLEX_F_C_MA): Ditto.
9679 (UNSPEC_COMPLEX_F_C_MUL): Ditto.
9680 (complexopname): Ditto.
9681 (<avx512>_fmaddc_<mode>_maskz<round_expand_name>): New expander.
9682 (<avx512>_fcmaddc_<mode>_maskz<round_expand_name>): Ditto.
9683 (fma_<complexopname>_<mode><sdc_maskz_name><round_name>): New
9685 (<avx512>_<complexopname>_<mode>_mask<round_name>): Ditto.
9686 (<avx512>_<complexopname>_<mode><maskc_name><round_name>): Ditto.
9688 2021-09-22 Kewen Lin <linkw@linux.ibm.com>
9690 * config/rs6000/rs6000.opt (rs6000-density-pct-threshold,
9691 rs6000-density-size-threshold, rs6000-density-penalty,
9692 rs6000-density-load-pct-threshold,
9693 rs6000-density-load-num-threshold): New parameter.
9694 * config/rs6000/rs6000.c (rs6000_density_test): Adjust with
9695 corresponding parameters.
9697 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9699 * gimple-range-path.cc (path_range_query::defined_outside_path):
9701 (path_range_query::range_on_path_entry): New.
9702 (path_range_query::internal_range_of_expr): Resolve unknowns
9704 (path_range_query::improve_range_with_equivs): New.
9705 (path_range_query::ssa_range_in_phi): Resolve unknowns with
9707 * gimple-range-path.h (class path_range_query): Add
9708 defined_outside_path, range_on_path_entry, and
9709 improve_range_with_equivs.
9711 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9713 * gimple-range-path.cc (path_range_query::add_to_imports): New.
9714 (path_range_query::add_copies_to_imports): New.
9715 (path_range_query::precompute_ranges): Call
9716 add_copies_to_imports.
9717 * gimple-range-path.h (class path_range_query): Add prototypes
9718 for add_copies_to_imports and add_to_imports.
9720 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9722 * gimple-range-path.cc (path_range_query::range_defined_in_block):
9723 Remove useless code.
9725 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9727 * gimple-range-fold.h (class fur_source): Make oracle protected.
9728 * gimple-range-path.cc (path_range_query::path_range_query): Add
9729 resolve argument. Initialize oracle.
9730 (path_range_query::~path_range_query): Delete oracle.
9731 (path_range_query::range_of_stmt): Adapt to use relations.
9732 (path_range_query::precompute_ranges): Pre-compute relations.
9733 (class jt_fur_source): New
9734 (jt_fur_source::jt_fur_source): New.
9735 (jt_fur_source::register_relation): New.
9736 (jt_fur_source::query_relation): New.
9737 (path_range_query::precompute_relations): New.
9738 (path_range_query::precompute_phi_relations): New.
9739 * gimple-range-path.h (path_range_query): Add resolve argument.
9740 Add oracle, precompute_relations, precompute_phi_relations.
9741 * tree-ssa-threadbackward.c (back_threader::back_threader): Pass
9742 resolve argument to solver.
9744 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9746 * gimple-range-fold.cc (fold_using_range::range_of_range_op):
9747 Rename postfold_gcond_edges to register_outgoing_edges and
9749 (fold_using_range::postfold_gcond_edges): Rename...
9750 (fur_source::register_outgoing_edges): ...to this.
9751 * gimple-range-fold.h (postfold_gcond_edges): Rename to
9752 register_outgoing_edges and move to fur_source.
9754 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9756 * gimple-range-fold.cc (fold_using_range::range_of_phi): Check
9757 dom_info_available_p.
9759 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9761 * gimple-range-cache.cc (non_null_ref::non_null_ref): Use create
9762 and quick_grow_cleared instead of safe_grow_cleared.
9764 2021-09-21 Thomas Schwinge <thomas@codesourcery.com>
9767 * omp-oacc-neuter-broadcast.cc (oacc_do_neutering): Evaluate
9770 2021-09-21 Richard Earnshaw <rearnsha@arm.com>
9772 * configure.ac: Detect when the assembler supports new-style
9773 architecture extensions.
9774 * common/config/arm/arm-common.c (arm_rewrite_mcpu): Return
9775 the full CPU string if the assembler can grok it.
9776 (arm_rewrite_march): Likewise but for the architecture.
9777 * config.in: Regenerate.
9778 * configure: Regenerate.
9780 2021-09-21 Richard Biener <rguenther@suse.de>
9782 PR tree-optimization/102421
9783 * tree-vect-loop.c (vect_dissolve_slp_only_groups): Copy and
9784 adjust alignment info.
9786 2021-09-21 Kewen Lin <linkw@linux.ibm.com>
9788 * ipa-fnsummary.c (ipa_fn_summary_write): Remove inconsistent
9789 bitfield stream out.
9791 2021-09-20 Andrew MacLeod <amacleod@redhat.com>
9793 * gimple-range-fold.cc (fold_using_range::range_of_phi): Ignore
9794 undefined edges, apply an equivalence if appropriate.
9795 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Return
9796 UNDEFINED if EDGE_EXECUTABLE is not set.
9797 * gimple-range.cc (gimple_ranger::gimple_ranger): Set all edges
9798 as EXECUTABLE upon startup.
9799 (gimple_ranger::range_on_edge): Return UNDEFINED for edges without
9800 EDGE_EXECUTABLE set.
9801 * vr-values.c (set_and_propagate_unexecutable): New.
9802 (simplify_using_ranges::fold_cond): Call set_and_propagate.
9803 (simplify_using_ranges::simplify_switch_using_ranges): Ditto.
9804 * vr-values.h: Add prototype.
9806 2021-09-20 Andrew MacLeod <amacleod@redhat.com>
9808 * value-relation.cc (equiv_oracle::register_initial_def): New.
9809 (equiv_oracle::register_relation): Call register_initial_def.
9810 (equiv_oracle::add_equiv_to_block): New. Split register_relation.
9811 (relation_oracle::register_stmt): Check def block of PHI arguments.
9812 * value-relation.h (equiv_oracle): Add new prototypes.
9814 2021-09-20 Matthias Kretz <m.kretz@gsi.de>
9816 * cppbuiltin.c (define_builtin_macros_for_compilation_flags):
9817 Define __RECIPROCAL_MATH__, __NO_SIGNED_ZEROS__,
9818 __NO_TRAPPING_MATH__, __ASSOCIATIVE_MATH__, and
9819 __ROUNDING_MATH__ according to their corresponding flags.
9820 * doc/cpp.texi: Document __RECIPROCAL_MATH__,
9821 __NO_SIGNED_ZEROS__, __NO_TRAPPING_MATH__, __ASSOCIATIVE_MATH__,
9822 and __ROUNDING_MATH__.
9824 2021-09-20 Richard Biener <rguenther@suse.de>
9826 * tree-vect-stmts.c (vectorizable_load): Use the vectype
9829 2021-09-20 Richard Biener <rguenther@suse.de>
9831 * tree-vect-data-refs.c (vect_duplicate_ssa_name_ptr_info):
9832 Do not compute alignment of the vectorized access here.
9834 2021-09-20 Richard Biener <rguenther@suse.de>
9836 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment):
9837 Store -1 for runtime alias peeling iterations.
9839 2021-09-20 Richard Biener <rguenther@suse.de>
9841 * config.gcc: Obsolete hppa[12]*-*-hpux10* and hppa[12]*-*-hpux11*.
9843 2021-09-20 Thomas Schwinge <thomas@codesourcery.com>
9845 * input.c (string_concat_db::record_string_concatenation)
9846 (string_concat_db::get_string_concatenation): Skip for
9847 'RESERVED_LOCATION_P'.
9849 2021-09-20 Richard Biener <rguenther@suse.de>
9851 PR tree-optimization/65206
9852 * tree-data-ref.h (struct data_reference): Add alt_indices,
9854 * tree-data-ref.c (free_data_ref): Release alt_indices.
9855 (dr_analyze_indices): Work on struct indices and get DR_REF as tree.
9856 (create_data_ref): Adjust.
9857 (initialize_data_dependence_relation): Split into head
9858 and tail. When the base objects fail to match up try
9859 again with pointer-based analysis of indices.
9860 * tree-vectorizer.c (vec_info_shared::check_datarefs): Do
9861 not compare the lazily computed alternate set of indices.
9863 2021-09-20 Iain Sandoe <iain@sandoe.co.uk>
9865 * gcc.c: Test for execute OK when we find the
9866 programs for assembler linker and dsymutil and those
9867 were specified at configure-time.
9869 2021-09-19 Martin Sebor <msebor@redhat.com>
9871 PR middle-end/102403
9872 * gimple-predicate-analysis.cc (predicate::init_from_control_deps):
9873 Correct a function pre/postcondition.
9875 2021-09-19 Martin Sebor <msebor@redhat.com>
9877 PR middle-end/102243
9878 * tree-ssa-strlen.c (get_range): Handle null cfun.
9880 2021-09-19 Iain Sandoe <iain@sandoe.co.uk>
9882 * config/darwin.h (LINK_COMMAND_SPEC_A): Use Darwin10
9883 unwinder shim as a convenience library.
9885 2021-09-19 Andrew Pinski <apinski@marvell.com>
9887 * doc/install.texi: Add note about
9888 binutils 2.35 is required for LTO usage.
9890 2021-09-19 Aldy Hernandez <aldyh@redhat.com>
9892 * tree-ssa-threadbackward.c
9893 (back_threader_registry::register_path): Use push_edge.
9894 * tree-ssa-threadedge.c
9895 (jump_threader::thread_around_empty_blocks): Same.
9896 (jump_threader::thread_through_normal_block): Same.
9897 (jump_threader::thread_across_edge): Same. Also, use auto_bitmap.
9899 * tree-ssa-threadupdate.c
9900 (jt_path_registry::allocate_thread_edge): Remove.
9901 (jt_path_registry::push_edge): New.
9902 (dump_jump_thread_path): Make static.
9903 * tree-ssa-threadupdate.h (allocate_thread_edge): Remove.
9906 2021-09-19 Aldy Hernandez <aldyh@redhat.com>
9908 * gimple-range-path.cc (path_range_query::path_range_query): Add
9910 (path_range_query::dump): Remove extern declaration of dump_ranger.
9911 * gimple-range-trace.cc (dump_ranger): Add DEBUG_FUNCTION marker.
9912 * gimple-range-trace.h (dump_ranger): Add prototype.
9914 2021-09-19 John Ericson <git@JohnEricson.me>
9916 * gcc.c (find_a_program): New function, factored out of...
9917 (find_a_file): Here.
9918 (execute): Use find_a_program when looking for programs rather
9921 2021-09-19 Matwey V. Kornilov <matwey.kornilov@gmail.com>
9923 * config/avr/avr-mcus.def: Add atmega324pb.
9924 * doc/avr-mmcu.texi: Corresponding changes.
9926 2021-09-19 Roger Sayle <roger@nextmovesoftware.com>
9929 * match.pd (cmp @0 REAL_CST@1): When @0 is also REAL_CST, apply
9930 the same transformations as to @1. For comparisons against NaN,
9931 don't check HONOR_SNANS but confirm that neither operand is a
9934 2021-09-19 Benjamin Peterson <benjamin@locrian.net>
9936 * attribs.c (make_unique_name): Delete.
9937 * attribs.h (make_unique_name): Delete.
9939 2021-09-19 Andrew Pinski <apinski@marvell.com>
9941 * lra-constraints.c (check_and_process_move): Assert
9942 that dclass and sclass are greater than or equal to NO_REGS.
9944 2021-09-18 Jakub Jelinek <jakub@redhat.com>
9946 * tree.h (OMP_CLAUSE_ORDER_UNCONSTRAINED): Define.
9947 * tree-pretty-print.c (dump_omp_clause): Print unconstrained:
9948 for OMP_CLAUSE_ORDER_UNCONSTRAINED.
9950 2021-09-18 liuhongt <hongtao.liu@intel.com>
9952 * config/i386/i386-features.c (remove_partial_avx_dependency):
9953 Restrict TARGET_USE_VECTOR_FP_CONVERTS and
9954 TARGET_USE_VECTOR_CONVERTS to conversion instructions only.
9956 2021-09-18 Jakub Jelinek <jakub@redhat.com>
9958 * gimplify.c (omp_default_clause): For C/C++ default({,first}private),
9959 if file/namespace scope variable doesn't have predetermined sharing,
9960 treat it as if there was default(none).
9962 2021-09-18 liuhongt <hongtao.liu@intel.com>
9964 * config/i386/avx512fp16intrin.h (_mm_fmadd_sh):
9966 (_mm_mask_fmadd_sh): Likewise.
9967 (_mm_mask3_fmadd_sh): Likewise.
9968 (_mm_maskz_fmadd_sh): Likewise.
9969 (_mm_fmadd_round_sh): Likewise.
9970 (_mm_mask_fmadd_round_sh): Likewise.
9971 (_mm_mask3_fmadd_round_sh): Likewise.
9972 (_mm_maskz_fmadd_round_sh): Likewise.
9973 (_mm_fnmadd_sh): Likewise.
9974 (_mm_mask_fnmadd_sh): Likewise.
9975 (_mm_mask3_fnmadd_sh): Likewise.
9976 (_mm_maskz_fnmadd_sh): Likewise.
9977 (_mm_fnmadd_round_sh): Likewise.
9978 (_mm_mask_fnmadd_round_sh): Likewise.
9979 (_mm_mask3_fnmadd_round_sh): Likewise.
9980 (_mm_maskz_fnmadd_round_sh): Likewise.
9981 (_mm_fmsub_sh): Likewise.
9982 (_mm_mask_fmsub_sh): Likewise.
9983 (_mm_mask3_fmsub_sh): Likewise.
9984 (_mm_maskz_fmsub_sh): Likewise.
9985 (_mm_fmsub_round_sh): Likewise.
9986 (_mm_mask_fmsub_round_sh): Likewise.
9987 (_mm_mask3_fmsub_round_sh): Likewise.
9988 (_mm_maskz_fmsub_round_sh): Likewise.
9989 (_mm_fnmsub_sh): Likewise.
9990 (_mm_mask_fnmsub_sh): Likewise.
9991 (_mm_mask3_fnmsub_sh): Likewise.
9992 (_mm_maskz_fnmsub_sh): Likewise.
9993 (_mm_fnmsub_round_sh): Likewise.
9994 (_mm_mask_fnmsub_round_sh): Likewise.
9995 (_mm_mask3_fnmsub_round_sh): Likewise.
9996 (_mm_maskz_fnmsub_round_sh): Likewise.
9997 * config/i386/i386-builtin-types.def
9998 (V8HF_FTYPE_V8HF_V8HF_V8HF_UQI_INT): New builtin type.
9999 * config/i386/i386-builtin.def: Add new builtins.
10000 * config/i386/i386-expand.c: Handle new builtin type.
10001 * config/i386/sse.md (fmai_vmfmadd_<mode><round_name>):
10002 Ajdust to support FP16.
10003 (fmai_vmfmsub_<mode><round_name>): Ditto.
10004 (fmai_vmfnmadd_<mode><round_name>): Ditto.
10005 (fmai_vmfnmsub_<mode><round_name>): Ditto.
10006 (*fmai_fmadd_<mode>): Ditto.
10007 (*fmai_fmsub_<mode>): Ditto.
10008 (*fmai_fnmadd_<mode><round_name>): Ditto.
10009 (*fmai_fnmsub_<mode><round_name>): Ditto.
10010 (avx512f_vmfmadd_<mode>_mask<round_name>): Ditto.
10011 (avx512f_vmfmadd_<mode>_mask3<round_name>): Ditto.
10012 (avx512f_vmfmadd_<mode>_maskz<round_expand_name>): Ditto.
10013 (avx512f_vmfmadd_<mode>_maskz_1<round_name>): Ditto.
10014 (*avx512f_vmfmsub_<mode>_mask<round_name>): Ditto.
10015 (avx512f_vmfmsub_<mode>_mask3<round_name>): Ditto.
10016 (*avx512f_vmfmsub_<mode>_maskz_1<round_name>): Ditto.
10017 (*avx512f_vmfnmsub_<mode>_mask<round_name>): Ditto.
10018 (*avx512f_vmfnmsub_<mode>_mask3<round_name>): Ditto.
10019 (*avx512f_vmfnmsub_<mode>_mask<round_name>): Ditto.
10020 (*avx512f_vmfnmadd_<mode>_mask<round_name>): Renamed to ...
10021 (avx512f_vmfnmadd_<mode>_mask<round_name>) ... this, and
10022 adjust to support FP16.
10023 (avx512f_vmfnmadd_<mode>_mask3<round_name>): Ditto.
10024 (avx512f_vmfnmadd_<mode>_maskz_1<round_name>): Ditto.
10025 (avx512f_vmfnmadd_<mode>_maskz<round_expand_name>): New
10028 2021-09-18 H.J. Lu <hjl.tools@gmail.com>
10030 * config/i386/sse.md (avx512fmaskmodelower): Extend to support
10032 (maskload<mode><avx512fmaskmodelower>): Ditto.
10033 (maskstore<mode><avx512fmaskmodelower>): Ditto.
10035 2021-09-18 H.J. Lu <hjl.tools@gmail.com>
10037 * config/i386/i386-expand.c (ix86_expand_fp_absneg_operator):
10039 (ix86_expand_copysign): Ditto.
10040 (ix86_expand_xorsign): Ditto.
10041 * config/i386/i386.c (ix86_build_const_vector): Handle HF vector
10043 (ix86_build_signbit_mask): Ditto.
10044 (ix86_can_change_mode_class): Ditto.
10045 * config/i386/i386.md
10046 (SSEMODEF): Add HFmode.
10047 (ssevecmodef): Ditto.
10048 (<code>hf2): New define_expand.
10049 (*<code>hf2_1): New define_insn_and_split.
10050 (copysign<mode>): Extend to support HFmode under AVX512FP16.
10051 (xorsign<mode>): Ditto.
10052 * config/i386/sse.md (VFB): New mode iterator.
10053 (VFB_128_256): Ditto.
10055 (sseintvecmode2): Support HF vector mode.
10056 (<code><mode>2): Use new mode iterator.
10057 (*<code><mode>2): Ditto.
10058 (copysign<mode>3): Ditto.
10059 (xorsign<mode>3): Ditto.
10060 (<code><mode>3<mask_name>): Ditto.
10061 (<code><mode>3<mask_name>): Ditto.
10062 (<sse>_andnot<mode>3<mask_name>): Adjust for HF vector mode.
10063 (<sse>_andnot<mode>3<mask_name>): Ditto.
10064 (*<code><mode>3<mask_name>): Ditto.
10065 (*<code><mode>3<mask_name>): Ditto.
10067 2021-09-18 liuhongt <hongtao.liu@intel.com>
10069 * config/i386/avx512fp16intrin.h (_mm512_mask_fmadd_ph):
10071 (_mm512_mask3_fmadd_ph): Likewise.
10072 (_mm512_maskz_fmadd_ph): Likewise.
10073 (_mm512_fmadd_round_ph): Likewise.
10074 (_mm512_mask_fmadd_round_ph): Likewise.
10075 (_mm512_mask3_fmadd_round_ph): Likewise.
10076 (_mm512_maskz_fmadd_round_ph): Likewise.
10077 (_mm512_fnmadd_ph): Likewise.
10078 (_mm512_mask_fnmadd_ph): Likewise.
10079 (_mm512_mask3_fnmadd_ph): Likewise.
10080 (_mm512_maskz_fnmadd_ph): Likewise.
10081 (_mm512_fnmadd_round_ph): Likewise.
10082 (_mm512_mask_fnmadd_round_ph): Likewise.
10083 (_mm512_mask3_fnmadd_round_ph): Likewise.
10084 (_mm512_maskz_fnmadd_round_ph): Likewise.
10085 (_mm512_fmsub_ph): Likewise.
10086 (_mm512_mask_fmsub_ph): Likewise.
10087 (_mm512_mask3_fmsub_ph): Likewise.
10088 (_mm512_maskz_fmsub_ph): Likewise.
10089 (_mm512_fmsub_round_ph): Likewise.
10090 (_mm512_mask_fmsub_round_ph): Likewise.
10091 (_mm512_mask3_fmsub_round_ph): Likewise.
10092 (_mm512_maskz_fmsub_round_ph): Likewise.
10093 (_mm512_fnmsub_ph): Likewise.
10094 (_mm512_mask_fnmsub_ph): Likewise.
10095 (_mm512_mask3_fnmsub_ph): Likewise.
10096 (_mm512_maskz_fnmsub_ph): Likewise.
10097 (_mm512_fnmsub_round_ph): Likewise.
10098 (_mm512_mask_fnmsub_round_ph): Likewise.
10099 (_mm512_mask3_fnmsub_round_ph): Likewise.
10100 (_mm512_maskz_fnmsub_round_ph): Likewise.
10101 * config/i386/avx512fp16vlintrin.h (_mm256_fmadd_ph):
10103 (_mm256_mask_fmadd_ph): Likewise.
10104 (_mm256_mask3_fmadd_ph): Likewise.
10105 (_mm256_maskz_fmadd_ph): Likewise.
10106 (_mm_fmadd_ph): Likewise.
10107 (_mm_mask_fmadd_ph): Likewise.
10108 (_mm_mask3_fmadd_ph): Likewise.
10109 (_mm_maskz_fmadd_ph): Likewise.
10110 (_mm256_fnmadd_ph): Likewise.
10111 (_mm256_mask_fnmadd_ph): Likewise.
10112 (_mm256_mask3_fnmadd_ph): Likewise.
10113 (_mm256_maskz_fnmadd_ph): Likewise.
10114 (_mm_fnmadd_ph): Likewise.
10115 (_mm_mask_fnmadd_ph): Likewise.
10116 (_mm_mask3_fnmadd_ph): Likewise.
10117 (_mm_maskz_fnmadd_ph): Likewise.
10118 (_mm256_fmsub_ph): Likewise.
10119 (_mm256_mask_fmsub_ph): Likewise.
10120 (_mm256_mask3_fmsub_ph): Likewise.
10121 (_mm256_maskz_fmsub_ph): Likewise.
10122 (_mm_fmsub_ph): Likewise.
10123 (_mm_mask_fmsub_ph): Likewise.
10124 (_mm_mask3_fmsub_ph): Likewise.
10125 (_mm_maskz_fmsub_ph): Likewise.
10126 (_mm256_fnmsub_ph): Likewise.
10127 (_mm256_mask_fnmsub_ph): Likewise.
10128 (_mm256_mask3_fnmsub_ph): Likewise.
10129 (_mm256_maskz_fnmsub_ph): Likewise.
10130 (_mm_fnmsub_ph): Likewise.
10131 (_mm_mask_fnmsub_ph): Likewise.
10132 (_mm_mask3_fnmsub_ph): Likewise.
10133 (_mm_maskz_fnmsub_ph): Likewise.
10134 * config/i386/i386-builtin.def: Add corresponding new builtins.
10135 * config/i386/sse.md
10136 (<avx512>_fmadd_<mode>_maskz<round_expand_name>): Adjust to
10137 support HF vector modes.
10138 (<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name><round_name>):
10140 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_1): Ditto.
10141 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_2): Ditto.
10142 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_3): Ditto.
10143 (<avx512>_fmadd_<mode>_mask<round_name>): Ditto.
10144 (<avx512>_fmadd_<mode>_mask3<round_name>): Ditto.
10145 (<avx512>_fmsub_<mode>_maskz<round_expand_name>): Ditto.
10146 (<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name><round_name>):
10148 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_1): Ditto.
10149 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_2): Ditto.
10150 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_3): Ditto.
10151 (<avx512>_fmsub_<mode>_mask<round_name>): Ditto.
10152 (<avx512>_fmsub_<mode>_mask3<round_name>): Ditto.
10153 (<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name><round_name>):
10155 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_1): Ditto.
10156 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_2): Ditto.
10157 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_3): Ditto.
10158 (<avx512>_fnmadd_<mode>_mask<round_name>): Ditto.
10159 (<avx512>_fnmadd_<mode>_mask3<round_name>): Ditto.
10160 (<avx512>_fnmsub_<mode>_maskz<round_expand_name>): Ditto.
10161 (<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name><round_name>):
10163 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_1): Ditto.
10164 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_2): Ditto.
10165 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_3): Ditto.
10166 (<avx512>_fnmsub_<mode>_mask<round_name>): Ditto.
10167 (<avx512>_fnmsub_<mode>_mask3<round_name>): Ditto.
10169 2021-09-18 liuhongt <hongtao.liu@intel.com>
10171 * config/i386/avx512fp16intrin.h (_mm512_fmaddsub_ph):
10173 (_mm512_mask_fmaddsub_ph): Likewise.
10174 (_mm512_mask3_fmaddsub_ph): Likewise.
10175 (_mm512_maskz_fmaddsub_ph): Likewise.
10176 (_mm512_fmaddsub_round_ph): Likewise.
10177 (_mm512_mask_fmaddsub_round_ph): Likewise.
10178 (_mm512_mask3_fmaddsub_round_ph): Likewise.
10179 (_mm512_maskz_fmaddsub_round_ph): Likewise.
10180 (_mm512_mask_fmsubadd_ph): Likewise.
10181 (_mm512_mask3_fmsubadd_ph): Likewise.
10182 (_mm512_maskz_fmsubadd_ph): Likewise.
10183 (_mm512_fmsubadd_round_ph): Likewise.
10184 (_mm512_mask_fmsubadd_round_ph): Likewise.
10185 (_mm512_mask3_fmsubadd_round_ph): Likewise.
10186 (_mm512_maskz_fmsubadd_round_ph): Likewise.
10187 * config/i386/avx512fp16vlintrin.h (_mm256_fmaddsub_ph):
10189 (_mm256_mask_fmaddsub_ph): Likewise.
10190 (_mm256_mask3_fmaddsub_ph): Likewise.
10191 (_mm256_maskz_fmaddsub_ph): Likewise.
10192 (_mm_fmaddsub_ph): Likewise.
10193 (_mm_mask_fmaddsub_ph): Likewise.
10194 (_mm_mask3_fmaddsub_ph): Likewise.
10195 (_mm_maskz_fmaddsub_ph): Likewise.
10196 (_mm256_fmsubadd_ph): Likewise.
10197 (_mm256_mask_fmsubadd_ph): Likewise.
10198 (_mm256_mask3_fmsubadd_ph): Likewise.
10199 (_mm256_maskz_fmsubadd_ph): Likewise.
10200 (_mm_fmsubadd_ph): Likewise.
10201 (_mm_mask_fmsubadd_ph): Likewise.
10202 (_mm_mask3_fmsubadd_ph): Likewise.
10203 (_mm_maskz_fmsubadd_ph): Likewise.
10204 * config/i386/i386-builtin.def: Add corresponding new builtins.
10205 * config/i386/sse.md (VFH_SF_AVX512VL): New mode iterator.
10206 * (<avx512>_fmsubadd_<mode>_maskz<round_expand_name>): New expander.
10207 * (<avx512>_fmaddsub_<mode>_maskz<round_expand_name>): Use
10209 * (<sd_mask_codefor>fma_fmaddsub_<mode><sd_maskz_name><round_name>):
10211 * (<avx512>_fmaddsub_<mode>_mask<round_name>): Ditto.
10212 * (<avx512>_fmaddsub_<mode>_mask3<round_name>): Ditto.
10213 * (<sd_mask_codefor>fma_fmsubadd_<mode><sd_maskz_name><round_name>):
10215 * (<avx512>_fmsubadd_<mode>_mask<round_name>): Ditto.
10216 * (<avx512>_fmsubadd_<mode>_mask3<round_name>): Ditto.
10218 2021-09-18 liuhongt <hongtao.liu@intel.com>
10221 * config/i386/i386.c (ix86_print_operand): Handle
10222 V8HF/V16HF/V32HFmode.
10223 * config/i386/i386.h (VALID_BCST_MODE_P): Add HFmode.
10224 * config/i386/sse.md (avx512bcst): Remove.
10226 2021-09-17 Martin Sebor <msebor@redhat.com>
10228 * Makefile.in (OBJS): Add gimple-predicate-analysis.o.
10229 * tree-ssa-uninit.c (max_phi_args): Move to gimple-predicate-analysis.
10230 (MASK_SET_BIT, MASK_TEST_BIT, MASK_EMPTY): Same.
10231 (check_defs): Add comment.
10232 (can_skip_redundant_opnd): Update comment.
10233 (compute_uninit_opnds_pos): Adjust to namespace change.
10234 (find_pdom): Move to gimple-predicate-analysis.cc.
10236 (struct uninit_undef_val_t): New.
10237 (is_non_loop_exit_postdominating): Move to gimple-predicate-analysis.cc.
10238 (find_control_equiv_block): Same.
10239 (MAX_NUM_CHAINS, MAX_CHAIN_LEN, MAX_POSTDOM_CHECK): Same.
10240 (MAX_SWITCH_CASES): Same.
10241 (compute_control_dep_chain): Same.
10242 (find_uninit_use): Use predicate analyzer.
10243 (struct pred_info): Move to gimple-predicate-analysis.
10244 (convert_control_dep_chain_into_preds): Same.
10245 (find_predicates): Same.
10246 (collect_phi_def_edges): Same.
10247 (warn_uninitialized_phi): Use predicate analyzer.
10248 (find_def_preds): Move to gimple-predicate-analysis.
10249 (dump_pred_info): Same.
10250 (dump_pred_chain): Same.
10251 (dump_predicates): Same.
10252 (destroy_predicate_vecs): Remove.
10253 (execute_late_warn_uninitialized): New.
10254 (get_cmp_code): Move to gimple-predicate-analysis.
10255 (is_value_included_in): Same.
10256 (value_sat_pred_p): Same.
10257 (find_matching_predicate_in_rest_chains): Same.
10258 (is_use_properly_guarded): Same.
10259 (prune_uninit_phi_opnds): Same.
10260 (find_var_cmp_const): Same.
10261 (use_pred_not_overlap_with_undef_path_pred): Same.
10262 (pred_equal_p): Same.
10263 (is_neq_relop_p): Same.
10264 (is_neq_zero_form_p): Same.
10265 (pred_expr_equal_p): Same.
10266 (is_pred_expr_subset_of): Same.
10267 (is_pred_chain_subset_of): Same.
10268 (is_included_in): Same.
10269 (is_superset_of): Same.
10270 (pred_neg_p): Same.
10271 (simplify_pred): Same.
10272 (simplify_preds_2): Same.
10273 (simplify_preds_3): Same.
10274 (simplify_preds_4): Same.
10275 (simplify_preds): Same.
10277 (push_to_worklist): Same.
10278 (get_pred_info_from_cmp): Same.
10279 (is_degenerated_phi): Same.
10280 (normalize_one_pred_1): Same.
10281 (normalize_one_pred): Same.
10282 (normalize_one_pred_chain): Same.
10283 (normalize_preds): Same.
10284 (can_one_predicate_be_invalidated_p): Same.
10285 (can_chain_union_be_invalidated_p): Same.
10286 (uninit_uses_cannot_happen): Same.
10287 (pass_late_warn_uninitialized::execute): Define.
10288 * gimple-predicate-analysis.cc: New file.
10289 * gimple-predicate-analysis.h: New file.
10291 2021-09-17 Julian Brown <julian@codesourcery.com>
10293 * config/gcn/gcn.c (gimple.h): Include.
10294 (gcn_fork_join): Emit barrier for worker-level joins.
10295 * omp-oacc-neuter-broadcast.cc (find_local_vars_to_propagate): Add
10296 writes_gang_private bitmap parameter. Set bit for blocks
10297 containing gang-private variable writes.
10298 (worker_single_simple): Don't emit barrier after predicated block.
10299 (worker_single_copy): Don't emit barrier if we're not broadcasting
10300 anything and the block contains no gang-private writes.
10301 (neuter_worker_single): Don't predicate blocks that only contain
10302 NOPs or internal marker functions. Pass has_gang_private_write
10303 argument to worker_single_copy.
10304 (oacc_do_neutering): Add writes_gang_private bitmap handling.
10306 2021-09-17 Julian Brown <julian@codesourcery.com>
10308 * config/gcn/gcn-protos.h
10309 (gcn_goacc_create_worker_broadcast_record): Update prototype.
10310 * config/gcn/gcn-tree.c (gcn_goacc_get_worker_red_decl): Use
10311 preallocated block of LDS memory. Do not cache/share decls for
10312 reduction temporaries between invocations.
10313 (gcn_goacc_reduction_teardown): Unshare VAR on second use.
10314 (gcn_goacc_create_worker_broadcast_record): Add OFFSET parameter
10315 and return temporary LDS space at that offset. Return pointer in
10317 * config/gcn/gcn.c (acc_lds_size, gang_private_hwm, lds_allocs):
10319 (ACC_LDS_SIZE): Define as acc_lds_size.
10320 (gcn_init_machine_status): Don't initialise lds_allocated,
10321 lds_allocs, reduc_decls fields of machine function struct.
10322 (gcn_option_override): Handle default size for gang-private
10323 variables and -mgang-private-size option.
10324 (gcn_expand_prologue): Use LDS_SIZE instead of LDS_SIZE-1 when
10325 initialising M0_REG.
10326 (gcn_shared_mem_layout): New function.
10327 (gcn_print_lds_decl): Update comment. Use global lds_allocs map and
10328 gang_private_hwm variable.
10329 (TARGET_GOACC_SHARED_MEM_LAYOUT): Define target hook.
10330 * config/gcn/gcn.h (machine_function): Remove lds_allocated,
10331 lds_allocs, reduc_decls. Add reduction_base, reduction_limit.
10332 * config/gcn/gcn.opt (gang_private_size_opt): New global.
10333 (mgang-private-size=): New option.
10334 * doc/tm.texi.in (TARGET_GOACC_SHARED_MEM_LAYOUT): Place
10335 documentation hook.
10336 * doc/tm.texi: Regenerate.
10337 * omp-oacc-neuter-broadcast.cc (targhooks.h, diagnostic-core.h):
10339 (build_sender_ref): Handle sender_decl being pointer.
10340 (worker_single_copy): Add PLACEMENT and ISOLATE_BROADCASTS
10341 parameters. Pass placement argument to
10342 create_worker_broadcast_record hook invocations. Handle
10343 sender_decl being pointer and isolate_broadcasts inserting extra
10345 (blk_offset_map_t): Add typedef.
10346 (neuter_worker_single): Add BLK_OFFSET_MAP parameter. Pass
10347 preallocated range to worker_single_copy call.
10348 (dfs_broadcast_reachable_1): New function.
10349 (idx_decl_pair_t, used_range_vec_t): New typedefs.
10350 (sort_size_descending): New function.
10351 (addr_range): New class.
10352 (splay_tree_compare_addr_range, splay_tree_free_key)
10353 (first_fit_range, merge_ranges_1, merge_ranges): New functions.
10354 (execute_omp_oacc_neuter_broadcast): Rename to...
10355 (oacc_do_neutering): ... this. Add BOUNDS_LO, BOUNDS_HI
10356 parameters. Arrange layout of shared memory for broadcast
10358 (execute_omp_oacc_neuter_broadcast): New function.
10359 (pass_omp_oacc_neuter_broadcast::gate): Remove num_workers==1
10360 handling from here. Enable pass for all OpenACC routines in order
10361 to call shared memory-layout hook.
10362 * target.def (create_worker_broadcast_record): Add OFFSET
10364 (shared_mem_layout): New hook.
10366 2021-09-17 Julian Brown <julian@codesourcery.com>
10367 Thomas Schwinge <thomas@codesourcery.com>
10369 * omp-oacc-neuter-broadcast.cc
10370 (pass_omp_oacc_neuter_broadcast::gate): Disable if num_workers is
10372 (execute_omp_oacc_neuter_broadcast): Adjust.
10374 2021-09-17 Andrew MacLeod <amacleod@redhat.com>
10376 * value-relation.cc (class equiv_chain): Move to header file.
10377 (path_oracle::path_oracle): New.
10378 (path_oracle::~path_oracle): New.
10379 (path_oracle::register_relation): New.
10380 (path_oracle::query_relation): New.
10381 (path_oracle::reset_path): New.
10382 (path_oracle::dump): New.
10383 * value-relation.h (class equiv_chain): Move to here.
10384 (class path_oracle): New.
10386 2021-09-17 Andrew MacLeod <amacleod@redhat.com>
10388 * gimple-range-cache.cc (ranger_cache::ranger_cache): Create a DOM
10390 * gimple-range-fold.cc (fur_depend::register_relation): Use
10391 register_stmt/edge routines.
10392 * value-relation.cc (equiv_chain::find): Relocate from equiv_oracle.
10393 (equiv_oracle::equiv_oracle): Create self equivalence cache.
10394 (equiv_oracle::~equiv_oracle): Release same.
10395 (equiv_oracle::equiv_set): Return entry from self equiv cache if there
10396 are no equivalences.
10397 (equiv_oracle::find_equiv_block): Move list find to equiv_chain.
10398 (equiv_oracle::register_relation): Rename from register_equiv.
10399 (relation_chain_head::find_relation): Relocate from dom_oracle.
10400 (relation_oracle::register_stmt): New.
10401 (relation_oracle::register_edge): New.
10402 (dom_oracle::*): Rename from relation_oracle.
10403 (dom_oracle::register_relation): Adjust to call equiv_oracle.
10404 (dom_oracle::set_one_relation): Split from register_relation.
10405 (dom_oracle::register_transitives): Consolidate 2 methods.
10406 (dom_oracle::find_relation_block): Move core to relation_chain.
10407 (dom_oracle::query_relation): Rename from find_relation_dom and adjust.
10408 * value-relation.h (class relation_oracle): New pure virtual base.
10409 (class equiv_oracle): Inherit from relation_oracle and adjust.
10410 (class dom_oracle): Rename from old relation_oracle and adjust.
10412 2021-09-17 Martin Sebor <msebor@redhat.com>
10414 PR middle-end/102200
10415 * pointer-query.cc (access_ref::inform_access): Handle MIN/MAX_EXPR.
10416 (handle_min_max_size): Change argument. Store original SSA_NAME for
10417 operands to potentially distinct (sub)objects.
10418 (compute_objsize_r): Adjust call to the above.
10420 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
10422 * config/rs6000/rs6000.c (rs6000-builtins.h): New include.
10423 (rs6000_new_builtin_vectorized_function): New function.
10424 (rs6000_new_builtin_md_vectorized_function): Likewise.
10425 (rs6000_builtin_vectorized_function): Call
10426 rs6000_new_builtin_vectorized_function.
10427 (rs6000_builtin_md_vectorized_function): Call
10428 rs6000_new_builtin_md_vectorized_function.
10430 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
10432 * config/rs6000/rs6000-builtin-new.def (ASSEMBLE_ACC): Add mmaint flag.
10433 (ASSEMBLE_PAIR): Likewise.
10434 (BUILD_ACC): Likewise.
10435 (DISASSEMBLE_ACC): Likewise.
10436 (DISASSEMBLE_PAIR): Likewise.
10437 (PMXVBF16GER2): Likewise.
10438 (PMXVBF16GER2NN): Likewise.
10439 (PMXVBF16GER2NP): Likewise.
10440 (PMXVBF16GER2PN): Likewise.
10441 (PMXVBF16GER2PP): Likewise.
10442 (PMXVF16GER2): Likewise.
10443 (PMXVF16GER2NN): Likewise.
10444 (PMXVF16GER2NP): Likewise.
10445 (PMXVF16GER2PN): Likewise.
10446 (PMXVF16GER2PP): Likewise.
10447 (PMXVF32GER): Likewise.
10448 (PMXVF32GERNN): Likewise.
10449 (PMXVF32GERNP): Likewise.
10450 (PMXVF32GERPN): Likewise.
10451 (PMXVF32GERPP): Likewise.
10452 (PMXVF64GER): Likewise.
10453 (PMXVF64GERNN): Likewise.
10454 (PMXVF64GERNP): Likewise.
10455 (PMXVF64GERPN): Likewise.
10456 (PMXVF64GERPP): Likewise.
10457 (PMXVI16GER2): Likewise.
10458 (PMXVI16GER2PP): Likewise.
10459 (PMXVI16GER2S): Likewise.
10460 (PMXVI16GER2SPP): Likewise.
10461 (PMXVI4GER8): Likewise.
10462 (PMXVI4GER8PP): Likewise.
10463 (PMXVI8GER4): Likewise.
10464 (PMXVI8GER4PP): Likewise.
10465 (PMXVI8GER4SPP): Likewise.
10466 (XVBF16GER2): Likewise.
10467 (XVBF16GER2NN): Likewise.
10468 (XVBF16GER2NP): Likewise.
10469 (XVBF16GER2PN): Likewise.
10470 (XVBF16GER2PP): Likewise.
10471 (XVF16GER2): Likewise.
10472 (XVF16GER2NN): Likewise.
10473 (XVF16GER2NP): Likewise.
10474 (XVF16GER2PN): Likewise.
10475 (XVF16GER2PP): Likewise.
10476 (XVF32GER): Likewise.
10477 (XVF32GERNN): Likewise.
10478 (XVF32GERNP): Likewise.
10479 (XVF32GERPN): Likewise.
10480 (XVF32GERPP): Likewise.
10481 (XVF64GER): Likewise.
10482 (XVF64GERNN): Likewise.
10483 (XVF64GERNP): Likewise.
10484 (XVF64GERPN): Likewise.
10485 (XVF64GERPP): Likewise.
10486 (XVI16GER2): Likewise.
10487 (XVI16GER2PP): Likewise.
10488 (XVI16GER2S): Likewise.
10489 (XVI16GER2SPP): Likewise.
10490 (XVI4GER8): Likewise.
10491 (XVI4GER8PP): Likewise.
10492 (XVI8GER4): Likewise.
10493 (XVI8GER4PP): Likewise.
10494 (XVI8GER4SPP): Likewise.
10495 (XXMFACC): Likewise.
10496 (XXMTACC): Likewise.
10497 (XXSETACCZ): Likewise.
10498 (ASSEMBLE_PAIR_V): Likewise.
10499 (BUILD_PAIR): Likewise.
10500 (DISASSEMBLE_PAIR_V): Likewise.
10503 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_mma_builtin):
10504 Handle RS6000_BIF_LXVP and RS6000_BIF_STXVP.
10505 * config/rs6000/rs6000-gen-builtins.c (attrinfo): Add ismmaint.
10506 (parse_bif_attrs): Handle ismmaint.
10507 (write_decls): Add bif_mmaint_bit and bif_is_mmaint.
10508 (write_bif_static_init): Handle ismmaint.
10510 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
10512 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_builtin): New
10514 (rs6000_gimple_fold_builtin): Call rs6000_gimple_fold_new_builtin.
10515 (rs6000_new_builtin_valid_without_lhs): New function.
10516 (rs6000_gimple_fold_new_mma_builtin): Likewise.
10517 (rs6000_gimple_fold_new_builtin): Likewise.
10519 2021-09-17 Thomas Schwinge <thomas@codesourcery.com>
10521 * hash-table.h (hash_table<Descriptor, Lazy, Allocator>::expand):
10522 Destruct stale Value objects.
10523 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor_expand):
10526 2021-09-17 Roger Sayle <roger@nextmovesoftware.com>
10529 * match.pd (shift optimizations): Disable recent sign-changing
10530 optimization for shifts by zero, these will be folded later.
10532 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
10534 * config/rs6000/rs6000-builtin-new.def (__builtin_mffsl): Move from
10535 [power9] to [always].
10537 2021-09-17 Richard Biener <rguenther@suse.de>
10539 * tree-vect-stmts.c (vectorizable_load): Do not frob
10542 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
10544 * config/i386/i386-features.c (remove_partial_avx_dependency):
10545 Also check TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY and
10546 and TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY before generating
10548 * config/i386/i386.h (TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY):
10550 (TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
10551 * config/i386/i386.md (SSE FP to FP splitters): Replace
10552 TARGET_SSE_PARTIAL_REG_DEPENDENCY with
10553 TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY.
10554 (SSE INT to FP splitter): Replace TARGET_SSE_PARTIAL_REG_DEPENDENCY
10555 with TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY.
10556 * config/i386/x86-tune.def
10557 (X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY): New.
10558 (X86_TUNE_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
10560 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
10563 * config/i386/i386-features.c (remove_partial_avx_dependency):
10564 Check TARGET_USE_VECTOR_FP_CONVERTS and TARGET_USE_VECTOR_CONVERTS
10565 before generating vxorps.
10567 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
10569 * config/i386/i386-options.c (processor_cost_table): Use
10570 tremont_cost for Tremont.
10571 * config/i386/x86-tune-costs.h (tremont_memcpy): New.
10572 (tremont_memset): Likewise.
10573 (tremont_cost): Likewise.
10574 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
10575 Enable for Tremont.
10577 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
10579 * common/config/i386/i386-common.c: Use Haswell scheduling model
10581 * config/i386/i386.c (ix86_sched_init_global): Prepare for Tremont
10583 * config/i386/x86-tune-sched.c (ix86_issue_rate): Change Tremont
10585 (ix86_adjust_cost): Handle Tremont.
10586 * config/i386/x86-tune.def (X86_TUNE_SSE_PARTIAL_REG_DEPENDENCY):
10587 Enable for Tremont.
10588 (X86_TUNE_USE_LEAVE): Likewise.
10589 (X86_TUNE_PUSH_MEMORY): Likewise.
10590 (X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Likewise.
10591 (X86_TUNE_USE_CLTD): Likewise.
10592 (X86_TUNE_AVOID_FALSE_DEP_FOR_BMI): Likewise.
10593 (X86_TUNE_AVOID_MFENCE): Likewise.
10594 (X86_TUNE_SSE_TYPELESS_STORES): Likewise.
10595 (X86_TUNE_SSE_LOAD0_BY_PXOR): Likewise.
10596 (X86_TUNE_ACCUMULATE_OUTGOING_ARGS): Disable for Tremont.
10597 (X86_TUNE_FOUR_JUMP_LIMIT): Likewise.
10598 (X86_TUNE_OPT_AGU): Likewise.
10599 (X86_TUNE_AVOID_LEA_FOR_ADDR): Likewise.
10600 (X86_TUNE_AVOID_MEM_OPND_FOR_CMOVE): Likewise.
10601 (X86_TUNE_EXPAND_ABS): Likewise.
10602 (X86_TUNE_SPLIT_MEM_OPND_FOR_FP_CONVERTS): Likewise.
10603 (X86_TUNE_SLOW_PSHUFB): Likewise.
10605 2021-09-17 Eric Botcazou <ebotcazou@adacore.com>
10607 PR rtl-optimization/102306
10608 * combine.c (try_combine): Abort the combination if we are about to
10609 duplicate volatile references.
10611 2021-09-17 liuhongt <hongtao.liu@intel.com>
10613 * config/i386/avx512fp16intrin.h (_mm_undefined_ph):
10615 (_mm256_undefined_ph): Likewise.
10616 (_mm512_undefined_ph): Likewise.
10617 (_mm_cvtsh_h): Likewise.
10618 (_mm256_cvtsh_h): Likewise.
10619 (_mm512_cvtsh_h): Likewise.
10620 (_mm512_castph_ps): Likewise.
10621 (_mm512_castph_pd): Likewise.
10622 (_mm512_castph_si512): Likewise.
10623 (_mm512_castph512_ph128): Likewise.
10624 (_mm512_castph512_ph256): Likewise.
10625 (_mm512_castph128_ph512): Likewise.
10626 (_mm512_castph256_ph512): Likewise.
10627 (_mm512_zextph128_ph512): Likewise.
10628 (_mm512_zextph256_ph512): Likewise.
10629 (_mm512_castps_ph): Likewise.
10630 (_mm512_castpd_ph): Likewise.
10631 (_mm512_castsi512_ph): Likewise.
10632 * config/i386/avx512fp16vlintrin.h (_mm_castph_ps):
10634 (_mm256_castph_ps): Likewise.
10635 (_mm_castph_pd): Likewise.
10636 (_mm256_castph_pd): Likewise.
10637 (_mm_castph_si128): Likewise.
10638 (_mm256_castph_si256): Likewise.
10639 (_mm_castps_ph): Likewise.
10640 (_mm256_castps_ph): Likewise.
10641 (_mm_castpd_ph): Likewise.
10642 (_mm256_castpd_ph): Likewise.
10643 (_mm_castsi128_ph): Likewise.
10644 (_mm256_castsi256_ph): Likewise.
10645 (_mm256_castph256_ph128): Likewise.
10646 (_mm256_castph128_ph256): Likewise.
10647 (_mm256_zextph128_ph256): Likewise.
10649 2021-09-17 liuhongt <hongtao.liu@intel.com>
10651 * config/i386/avx512fp16intrin.h (_mm_cvtsh_ss):
10653 (_mm_mask_cvtsh_ss): Likewise.
10654 (_mm_maskz_cvtsh_ss): Likewise.
10655 (_mm_cvtsh_sd): Likewise.
10656 (_mm_mask_cvtsh_sd): Likewise.
10657 (_mm_maskz_cvtsh_sd): Likewise.
10658 (_mm_cvt_roundsh_ss): Likewise.
10659 (_mm_mask_cvt_roundsh_ss): Likewise.
10660 (_mm_maskz_cvt_roundsh_ss): Likewise.
10661 (_mm_cvt_roundsh_sd): Likewise.
10662 (_mm_mask_cvt_roundsh_sd): Likewise.
10663 (_mm_maskz_cvt_roundsh_sd): Likewise.
10664 (_mm_cvtss_sh): Likewise.
10665 (_mm_mask_cvtss_sh): Likewise.
10666 (_mm_maskz_cvtss_sh): Likewise.
10667 (_mm_cvtsd_sh): Likewise.
10668 (_mm_mask_cvtsd_sh): Likewise.
10669 (_mm_maskz_cvtsd_sh): Likewise.
10670 (_mm_cvt_roundss_sh): Likewise.
10671 (_mm_mask_cvt_roundss_sh): Likewise.
10672 (_mm_maskz_cvt_roundss_sh): Likewise.
10673 (_mm_cvt_roundsd_sh): Likewise.
10674 (_mm_mask_cvt_roundsd_sh): Likewise.
10675 (_mm_maskz_cvt_roundsd_sh): Likewise.
10676 * config/i386/i386-builtin-types.def
10677 (V8HF_FTYPE_V2DF_V8HF_V8HF_UQI_INT,
10678 V8HF_FTYPE_V4SF_V8HF_V8HF_UQI_INT,
10679 V2DF_FTYPE_V8HF_V2DF_V2DF_UQI_INT,
10680 V4SF_FTYPE_V8HF_V4SF_V4SF_UQI_INT): Add new builtin types.
10681 * config/i386/i386-builtin.def: Add corrresponding new builtins.
10682 * config/i386/i386-expand.c: Handle new builtin types.
10683 * config/i386/sse.md (VF48_128): New mode iterator.
10684 (avx512fp16_vcvtsh2<ssescalarmodesuffix><mask_scalar_name><round_saeonly_scalar_name>):
10686 (avx512fp16_vcvt<ssescalarmodesuffix>2sh<mask_scalar_name><round_scalar_name>):
10689 2021-09-17 liuhongt <hongtao.liu@intel.com>
10691 * config/i386/avx512fp16intrin.h (_mm512_cvtph_pd):
10693 (_mm512_mask_cvtph_pd): Likewise.
10694 (_mm512_maskz_cvtph_pd): Likewise.
10695 (_mm512_cvt_roundph_pd): Likewise.
10696 (_mm512_mask_cvt_roundph_pd): Likewise.
10697 (_mm512_maskz_cvt_roundph_pd): Likewise.
10698 (_mm512_cvtxph_ps): Likewise.
10699 (_mm512_mask_cvtxph_ps): Likewise.
10700 (_mm512_maskz_cvtxph_ps): Likewise.
10701 (_mm512_cvtx_roundph_ps): Likewise.
10702 (_mm512_mask_cvtx_roundph_ps): Likewise.
10703 (_mm512_maskz_cvtx_roundph_ps): Likewise.
10704 (_mm512_cvtxps_ph): Likewise.
10705 (_mm512_mask_cvtxps_ph): Likewise.
10706 (_mm512_maskz_cvtxps_ph): Likewise.
10707 (_mm512_cvtx_roundps_ph): Likewise.
10708 (_mm512_mask_cvtx_roundps_ph): Likewise.
10709 (_mm512_maskz_cvtx_roundps_ph): Likewise.
10710 (_mm512_cvtpd_ph): Likewise.
10711 (_mm512_mask_cvtpd_ph): Likewise.
10712 (_mm512_maskz_cvtpd_ph): Likewise.
10713 (_mm512_cvt_roundpd_ph): Likewise.
10714 (_mm512_mask_cvt_roundpd_ph): Likewise.
10715 (_mm512_maskz_cvt_roundpd_ph): Likewise.
10716 * config/i386/avx512fp16vlintrin.h (_mm_cvtph_pd):
10718 (_mm_mask_cvtph_pd): Likewise.
10719 (_mm_maskz_cvtph_pd): Likewise.
10720 (_mm256_cvtph_pd): Likewise.
10721 (_mm256_mask_cvtph_pd): Likewise.
10722 (_mm256_maskz_cvtph_pd): Likewise.
10723 (_mm_cvtxph_ps): Likewise.
10724 (_mm_mask_cvtxph_ps): Likewise.
10725 (_mm_maskz_cvtxph_ps): Likewise.
10726 (_mm256_cvtxph_ps): Likewise.
10727 (_mm256_mask_cvtxph_ps): Likewise.
10728 (_mm256_maskz_cvtxph_ps): Likewise.
10729 (_mm_cvtxps_ph): Likewise.
10730 (_mm_mask_cvtxps_ph): Likewise.
10731 (_mm_maskz_cvtxps_ph): Likewise.
10732 (_mm256_cvtxps_ph): Likewise.
10733 (_mm256_mask_cvtxps_ph): Likewise.
10734 (_mm256_maskz_cvtxps_ph): Likewise.
10735 (_mm_cvtpd_ph): Likewise.
10736 (_mm_mask_cvtpd_ph): Likewise.
10737 (_mm_maskz_cvtpd_ph): Likewise.
10738 (_mm256_cvtpd_ph): Likewise.
10739 (_mm256_mask_cvtpd_ph): Likewise.
10740 (_mm256_maskz_cvtpd_ph): Likewise.
10741 * config/i386/i386-builtin.def: Add corresponding new builtins.
10742 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
10743 * config/i386/i386-expand.c: Handle new builtin types.
10744 * config/i386/sse.md
10745 (VF4_128_8_256): New.
10746 (VF48H_AVX512VL): Ditto.
10747 (ssePHmode): Add HF vector modes.
10748 (castmode): Add new convertable modes.
10749 (qq2phsuff): Ditto.
10750 (ph2pssuffix): New.
10751 (avx512fp16_vcvt<castmode>2ph_<mode><mask_name><round_name>): Ditto.
10752 (avx512fp16_vcvt<castmode>2ph_<mode>): Ditto.
10753 (*avx512fp16_vcvt<castmode>2ph_<mode>): Ditto.
10754 (avx512fp16_vcvt<castmode>2ph_<mode>_mask): Ditto.
10755 (*avx512fp16_vcvt<castmode>2ph_<mode>_mask): Ditto.
10756 (*avx512fp16_vcvt<castmode>2ph_<mode>_mask_1): Ditto.
10757 (avx512fp16_float_extend_ph<mode>2<mask_name><round_saeonly_name>):
10759 (avx512fp16_float_extend_ph<mode>2<mask_name>): Ditto.
10760 (*avx512fp16_float_extend_ph<mode>2_load<mask_name>): Ditto.
10761 (avx512fp16_float_extend_phv2df2<mask_name>): Ditto.
10762 (*avx512fp16_float_extend_phv2df2_load<mask_name>): Ditto.
10764 2021-09-17 liuhongt <hongtao.liu@intel.com>
10766 * config/i386/avx512fp16intrin.h (_mm_cvttsh_i32):
10768 (_mm_cvttsh_u32): Likewise.
10769 (_mm_cvtt_roundsh_i32): Likewise.
10770 (_mm_cvtt_roundsh_u32): Likewise.
10771 (_mm_cvttsh_i64): Likewise.
10772 (_mm_cvttsh_u64): Likewise.
10773 (_mm_cvtt_roundsh_i64): Likewise.
10774 (_mm_cvtt_roundsh_u64): Likewise.
10775 * config/i386/i386-builtin.def: Add corresponding new builtins.
10776 * config/i386/sse.md
10777 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<round_saeonly_name>):
10780 2021-09-17 liuhongt <hongtao.liu@intel.com>
10782 * config/i386/avx512fp16intrin.h (_mm512_cvttph_epi32):
10784 (_mm512_mask_cvttph_epi32): Likewise.
10785 (_mm512_maskz_cvttph_epi32): Likewise.
10786 (_mm512_cvtt_roundph_epi32): Likewise.
10787 (_mm512_mask_cvtt_roundph_epi32): Likewise.
10788 (_mm512_maskz_cvtt_roundph_epi32): Likewise.
10789 (_mm512_cvttph_epu32): Likewise.
10790 (_mm512_mask_cvttph_epu32): Likewise.
10791 (_mm512_maskz_cvttph_epu32): Likewise.
10792 (_mm512_cvtt_roundph_epu32): Likewise.
10793 (_mm512_mask_cvtt_roundph_epu32): Likewise.
10794 (_mm512_maskz_cvtt_roundph_epu32): Likewise.
10795 (_mm512_cvttph_epi64): Likewise.
10796 (_mm512_mask_cvttph_epi64): Likewise.
10797 (_mm512_maskz_cvttph_epi64): Likewise.
10798 (_mm512_cvtt_roundph_epi64): Likewise.
10799 (_mm512_mask_cvtt_roundph_epi64): Likewise.
10800 (_mm512_maskz_cvtt_roundph_epi64): Likewise.
10801 (_mm512_cvttph_epu64): Likewise.
10802 (_mm512_mask_cvttph_epu64): Likewise.
10803 (_mm512_maskz_cvttph_epu64): Likewise.
10804 (_mm512_cvtt_roundph_epu64): Likewise.
10805 (_mm512_mask_cvtt_roundph_epu64): Likewise.
10806 (_mm512_maskz_cvtt_roundph_epu64): Likewise.
10807 (_mm512_cvttph_epi16): Likewise.
10808 (_mm512_mask_cvttph_epi16): Likewise.
10809 (_mm512_maskz_cvttph_epi16): Likewise.
10810 (_mm512_cvtt_roundph_epi16): Likewise.
10811 (_mm512_mask_cvtt_roundph_epi16): Likewise.
10812 (_mm512_maskz_cvtt_roundph_epi16): Likewise.
10813 (_mm512_cvttph_epu16): Likewise.
10814 (_mm512_mask_cvttph_epu16): Likewise.
10815 (_mm512_maskz_cvttph_epu16): Likewise.
10816 (_mm512_cvtt_roundph_epu16): Likewise.
10817 (_mm512_mask_cvtt_roundph_epu16): Likewise.
10818 (_mm512_maskz_cvtt_roundph_epu16): Likewise.
10819 * config/i386/avx512fp16vlintrin.h (_mm_cvttph_epi32):
10821 (_mm_mask_cvttph_epi32): Likewise.
10822 (_mm_maskz_cvttph_epi32): Likewise.
10823 (_mm256_cvttph_epi32): Likewise.
10824 (_mm256_mask_cvttph_epi32): Likewise.
10825 (_mm256_maskz_cvttph_epi32): Likewise.
10826 (_mm_cvttph_epu32): Likewise.
10827 (_mm_mask_cvttph_epu32): Likewise.
10828 (_mm_maskz_cvttph_epu32): Likewise.
10829 (_mm256_cvttph_epu32): Likewise.
10830 (_mm256_mask_cvttph_epu32): Likewise.
10831 (_mm256_maskz_cvttph_epu32): Likewise.
10832 (_mm_cvttph_epi64): Likewise.
10833 (_mm_mask_cvttph_epi64): Likewise.
10834 (_mm_maskz_cvttph_epi64): Likewise.
10835 (_mm256_cvttph_epi64): Likewise.
10836 (_mm256_mask_cvttph_epi64): Likewise.
10837 (_mm256_maskz_cvttph_epi64): Likewise.
10838 (_mm_cvttph_epu64): Likewise.
10839 (_mm_mask_cvttph_epu64): Likewise.
10840 (_mm_maskz_cvttph_epu64): Likewise.
10841 (_mm256_cvttph_epu64): Likewise.
10842 (_mm256_mask_cvttph_epu64): Likewise.
10843 (_mm256_maskz_cvttph_epu64): Likewise.
10844 (_mm_cvttph_epi16): Likewise.
10845 (_mm_mask_cvttph_epi16): Likewise.
10846 (_mm_maskz_cvttph_epi16): Likewise.
10847 (_mm256_cvttph_epi16): Likewise.
10848 (_mm256_mask_cvttph_epi16): Likewise.
10849 (_mm256_maskz_cvttph_epi16): Likewise.
10850 (_mm_cvttph_epu16): Likewise.
10851 (_mm_mask_cvttph_epu16): Likewise.
10852 (_mm_maskz_cvttph_epu16): Likewise.
10853 (_mm256_cvttph_epu16): Likewise.
10854 (_mm256_mask_cvttph_epu16): Likewise.
10855 (_mm256_maskz_cvttph_epu16): Likewise.
10856 * config/i386/i386-builtin.def: Add new builtins.
10857 * config/i386/sse.md
10858 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<mask_name><round_saeonly_name>):
10860 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<mask_name>): Ditto.
10861 (*avx512fp16_fix<fixunssuffix>_trunc<mode>2_load<mask_name>): Ditto.
10862 (avx512fp16_fix<fixunssuffix>_truncv2di2<mask_name>): Ditto.
10863 (avx512fp16_fix<fixunssuffix>_truncv2di2_load<mask_name>): Ditto.
10865 2021-09-17 liuhongt <hongtao.liu@intel.com>
10867 * config/i386/avx512fp16intrin.h (_mm_cvtsh_i32): New intrinsic.
10868 (_mm_cvtsh_u32): Likewise.
10869 (_mm_cvt_roundsh_i32): Likewise.
10870 (_mm_cvt_roundsh_u32): Likewise.
10871 (_mm_cvtsh_i64): Likewise.
10872 (_mm_cvtsh_u64): Likewise.
10873 (_mm_cvt_roundsh_i64): Likewise.
10874 (_mm_cvt_roundsh_u64): Likewise.
10875 (_mm_cvti32_sh): Likewise.
10876 (_mm_cvtu32_sh): Likewise.
10877 (_mm_cvt_roundi32_sh): Likewise.
10878 (_mm_cvt_roundu32_sh): Likewise.
10879 (_mm_cvti64_sh): Likewise.
10880 (_mm_cvtu64_sh): Likewise.
10881 (_mm_cvt_roundi64_sh): Likewise.
10882 (_mm_cvt_roundu64_sh): Likewise.
10883 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
10884 * config/i386/i386-builtin.def: Add corresponding new builtins.
10885 * config/i386/i386-expand.c (ix86_expand_round_builtin):
10886 Handle new builtin types.
10887 * config/i386/sse.md
10888 (avx512fp16_vcvtsh2<sseintconvertsignprefix>si<rex64namesuffix><round_name>):
10890 (avx512fp16_vcvtsh2<sseintconvertsignprefix>si<rex64namesuffix>_2): Likewise.
10891 (avx512fp16_vcvt<floatsuffix>si2sh<rex64namesuffix><round_name>): Likewise.
10893 2021-09-16 Bill Schmidt <wschmidt@linux.ibm.com>
10895 * config/rs6000/rs6000-c.c (rs6000-builtins.h): New include.
10896 (altivec_resolve_new_overloaded_builtin): New forward decl.
10897 (rs6000_new_builtin_type_compatible): New function.
10898 (altivec_resolve_overloaded_builtin): Call
10899 altivec_resolve_new_overloaded_builtin.
10900 (altivec_build_new_resolved_builtin): New function.
10901 (altivec_resolve_new_overloaded_builtin): Likewise.
10902 * config/rs6000/rs6000-call.c (rs6000_new_builtin_is_supported):
10904 * config/rs6000/rs6000-gen-builtins.c (write_decls): Remove _p from
10905 name of rs6000_new_builtin_is_supported.
10907 2021-09-16 Uroš Bizjak <ubizjak@gmail.com>
10909 * config/i386/i386-protos.h (ix86_decompose_address):
10910 Change return type to bool.
10911 * config/i386/i386.c (ix86_decompose_address): Ditto.
10913 2021-09-16 Tobias Burnus <tobias@codesourcery.com>
10916 * config/rs6000/t-rs6000 (build/rs6000-gen-builtins.o, build/rbtree.o):
10917 Added 'build/' to target, use build/%.o rule.
10918 (build/rs6000-gen-builtins$(build_exeext)): Add 'build/' and
10919 '$(build_exeext)' to target and 'build/' for the *.o files.
10920 (rs6000-builtins.c): Update for those changes; run rs6000-gen-builtins
10923 2021-09-16 Martin Jambor <mjambor@suse.cz>
10925 * cgraph.c (cgraph_node::dump): Do not check caller count sums if
10926 the body has been removed. Remove trailing whitespace.
10928 2021-09-16 Richard Biener <rguenther@suse.de>
10930 PR middle-end/102360
10931 * internal-fn.c (expand_DEFERRED_INIT): Make pattern-init
10932 of non-memory more robust.
10934 2021-09-16 Daniel Cederman <cederman@gaisler.com>
10936 * config/sparc/sparc-opts.h (enum sparc_processor_type): Add LEON5
10937 * config/sparc/sparc.c (struct processor_costs): Add LEON5 costs
10938 (leon5_adjust_cost): Increase cost of store with data dependency
10939 on ALU instruction and FPU anti-dependencies.
10940 (sparc_option_override): Add LEON5 costs
10941 (sparc_adjust_cost): Add LEON5 cost adjustments
10942 * config/sparc/sparc.h: Add LEON5
10943 * config/sparc/sparc.md: Include LEON5 scheduling information
10944 * config/sparc/sparc.opt: Add LEON5
10945 * doc/invoke.texi: Add LEON5
10946 * config/sparc/leon5.md: New file.
10948 2021-09-16 Daniel Cederman <cederman@gaisler.com>
10950 * config/sparc/sparc.md (stack_protect_set32): Add NOP to prevent
10951 sensitive sequence for B2BST errata workaround.
10953 2021-09-16 Daniel Cederman <cederman@gaisler.com>
10955 * config/sparc/sparc.c (sparc_do_work_around_errata): Do not begin
10956 functions with atomic instruction in the UT700 errata workaround.
10958 2021-09-16 Daniel Cederman <cederman@gaisler.com>
10960 * config/sparc/sparc.c (next_active_non_empty_insn): New function
10961 that returns next active non empty assembly instruction.
10962 (sparc_do_work_around_errata): Use new function.
10964 2021-09-16 Daniel Cederman <cederman@gaisler.com>
10966 * config/sparc/sparc.c (store_insn_p): Add predicate for store
10968 (load_insn_p): Add predicate for load attributes.
10969 (sparc_do_work_around_errata): Use new predicates.
10971 2021-09-16 Andreas Larsson <andreas@gaisler.com>
10973 * config/sparc/sparc.c (dump_target_flag_bits): Print bit names for
10976 2021-09-16 Martin Liska <mliska@suse.cz>
10978 * config/mips/netbsd.h: Fix typo in name of a macro.
10980 2021-09-16 liuhongt <hongtao.liu@intel.com>
10982 PR middle-end/102080
10983 * match.pd: Check mask type when doing cond_op related gimple
10985 * tree.c (is_truth_type_for): New function.
10986 * tree.h (is_truth_type_for): New declaration.
10988 2021-09-16 liuhongt <hongtao.liu@intel.com>
10990 * config/i386/avx512fp16intrin.h (_mm512_cvtepi32_ph): New
10992 (_mm512_mask_cvtepi32_ph): Likewise.
10993 (_mm512_maskz_cvtepi32_ph): Likewise.
10994 (_mm512_cvt_roundepi32_ph): Likewise.
10995 (_mm512_mask_cvt_roundepi32_ph): Likewise.
10996 (_mm512_maskz_cvt_roundepi32_ph): Likewise.
10997 (_mm512_cvtepu32_ph): Likewise.
10998 (_mm512_mask_cvtepu32_ph): Likewise.
10999 (_mm512_maskz_cvtepu32_ph): Likewise.
11000 (_mm512_cvt_roundepu32_ph): Likewise.
11001 (_mm512_mask_cvt_roundepu32_ph): Likewise.
11002 (_mm512_maskz_cvt_roundepu32_ph): Likewise.
11003 (_mm512_cvtepi64_ph): Likewise.
11004 (_mm512_mask_cvtepi64_ph): Likewise.
11005 (_mm512_maskz_cvtepi64_ph): Likewise.
11006 (_mm512_cvt_roundepi64_ph): Likewise.
11007 (_mm512_mask_cvt_roundepi64_ph): Likewise.
11008 (_mm512_maskz_cvt_roundepi64_ph): Likewise.
11009 (_mm512_cvtepu64_ph): Likewise.
11010 (_mm512_mask_cvtepu64_ph): Likewise.
11011 (_mm512_maskz_cvtepu64_ph): Likewise.
11012 (_mm512_cvt_roundepu64_ph): Likewise.
11013 (_mm512_mask_cvt_roundepu64_ph): Likewise.
11014 (_mm512_maskz_cvt_roundepu64_ph): Likewise.
11015 (_mm512_cvtepi16_ph): Likewise.
11016 (_mm512_mask_cvtepi16_ph): Likewise.
11017 (_mm512_maskz_cvtepi16_ph): Likewise.
11018 (_mm512_cvt_roundepi16_ph): Likewise.
11019 (_mm512_mask_cvt_roundepi16_ph): Likewise.
11020 (_mm512_maskz_cvt_roundepi16_ph): Likewise.
11021 (_mm512_cvtepu16_ph): Likewise.
11022 (_mm512_mask_cvtepu16_ph): Likewise.
11023 (_mm512_maskz_cvtepu16_ph): Likewise.
11024 (_mm512_cvt_roundepu16_ph): Likewise.
11025 (_mm512_mask_cvt_roundepu16_ph): Likewise.
11026 (_mm512_maskz_cvt_roundepu16_ph): Likewise.
11027 * config/i386/avx512fp16vlintrin.h (_mm_cvtepi32_ph): New
11029 (_mm_mask_cvtepi32_ph): Likewise.
11030 (_mm_maskz_cvtepi32_ph): Likewise.
11031 (_mm256_cvtepi32_ph): Likewise.
11032 (_mm256_mask_cvtepi32_ph): Likewise.
11033 (_mm256_maskz_cvtepi32_ph): Likewise.
11034 (_mm_cvtepu32_ph): Likewise.
11035 (_mm_mask_cvtepu32_ph): Likewise.
11036 (_mm_maskz_cvtepu32_ph): Likewise.
11037 (_mm256_cvtepu32_ph): Likewise.
11038 (_mm256_mask_cvtepu32_ph): Likewise.
11039 (_mm256_maskz_cvtepu32_ph): Likewise.
11040 (_mm_cvtepi64_ph): Likewise.
11041 (_mm_mask_cvtepi64_ph): Likewise.
11042 (_mm_maskz_cvtepi64_ph): Likewise.
11043 (_mm256_cvtepi64_ph): Likewise.
11044 (_mm256_mask_cvtepi64_ph): Likewise.
11045 (_mm256_maskz_cvtepi64_ph): Likewise.
11046 (_mm_cvtepu64_ph): Likewise.
11047 (_mm_mask_cvtepu64_ph): Likewise.
11048 (_mm_maskz_cvtepu64_ph): Likewise.
11049 (_mm256_cvtepu64_ph): Likewise.
11050 (_mm256_mask_cvtepu64_ph): Likewise.
11051 (_mm256_maskz_cvtepu64_ph): Likewise.
11052 (_mm_cvtepi16_ph): Likewise.
11053 (_mm_mask_cvtepi16_ph): Likewise.
11054 (_mm_maskz_cvtepi16_ph): Likewise.
11055 (_mm256_cvtepi16_ph): Likewise.
11056 (_mm256_mask_cvtepi16_ph): Likewise.
11057 (_mm256_maskz_cvtepi16_ph): Likewise.
11058 (_mm_cvtepu16_ph): Likewise.
11059 (_mm_mask_cvtepu16_ph): Likewise.
11060 (_mm_maskz_cvtepu16_ph): Likewise.
11061 (_mm256_cvtepu16_ph): Likewise.
11062 (_mm256_mask_cvtepu16_ph): Likewise.
11063 (_mm256_maskz_cvtepu16_ph): Likewise.
11064 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11065 * config/i386/i386-builtin.def: Add corresponding new builtins.
11066 * config/i386/i386-expand.c
11067 (ix86_expand_args_builtin): Handle new builtin types.
11068 (ix86_expand_round_builtin): Ditto.
11069 * config/i386/i386-modes.def: Declare V2HF and V6HF.
11070 * config/i386/sse.md (VI2H_AVX512VL): New.
11071 (qq2phsuff): Ditto.
11072 (sseintvecmode): Add HF vector modes.
11073 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode><mask_name><round_name>):
11075 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>): Ditto.
11076 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>): Ditto.
11077 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask): Ditto.
11078 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask): Ditto.
11079 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask_1): Ditto.
11080 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Ditto.
11081 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Ditto.
11082 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask): Ditto.
11083 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask): Ditto.
11084 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask_1): Ditto.
11085 * config/i386/subst.md (round_qq2phsuff): New subst_attr.
11087 2021-09-16 liuhongt <hongtao.liu@intel.com>
11089 * config/i386/avx512fp16intrin.h (_mm512_cvtph_epi32):
11091 (_mm512_mask_cvtph_epi32): Likewise.
11092 (_mm512_maskz_cvtph_epi32): Likewise.
11093 (_mm512_cvt_roundph_epi32): Likewise.
11094 (_mm512_mask_cvt_roundph_epi32): Likewise.
11095 (_mm512_maskz_cvt_roundph_epi32): Likewise.
11096 (_mm512_cvtph_epu32): Likewise.
11097 (_mm512_mask_cvtph_epu32): Likewise.
11098 (_mm512_maskz_cvtph_epu32): Likewise.
11099 (_mm512_cvt_roundph_epu32): Likewise.
11100 (_mm512_mask_cvt_roundph_epu32): Likewise.
11101 (_mm512_maskz_cvt_roundph_epu32): Likewise.
11102 (_mm512_cvtph_epi64): Likewise.
11103 (_mm512_mask_cvtph_epi64): Likewise.
11104 (_mm512_maskz_cvtph_epi64): Likewise.
11105 (_mm512_cvt_roundph_epi64): Likewise.
11106 (_mm512_mask_cvt_roundph_epi64): Likewise.
11107 (_mm512_maskz_cvt_roundph_epi64): Likewise.
11108 (_mm512_cvtph_epu64): Likewise.
11109 (_mm512_mask_cvtph_epu64): Likewise.
11110 (_mm512_maskz_cvtph_epu64): Likewise.
11111 (_mm512_cvt_roundph_epu64): Likewise.
11112 (_mm512_mask_cvt_roundph_epu64): Likewise.
11113 (_mm512_maskz_cvt_roundph_epu64): Likewise.
11114 (_mm512_cvtph_epi16): Likewise.
11115 (_mm512_mask_cvtph_epi16): Likewise.
11116 (_mm512_maskz_cvtph_epi16): Likewise.
11117 (_mm512_cvt_roundph_epi16): Likewise.
11118 (_mm512_mask_cvt_roundph_epi16): Likewise.
11119 (_mm512_maskz_cvt_roundph_epi16): Likewise.
11120 (_mm512_cvtph_epu16): Likewise.
11121 (_mm512_mask_cvtph_epu16): Likewise.
11122 (_mm512_maskz_cvtph_epu16): Likewise.
11123 (_mm512_cvt_roundph_epu16): Likewise.
11124 (_mm512_mask_cvt_roundph_epu16): Likewise.
11125 (_mm512_maskz_cvt_roundph_epu16): Likewise.
11126 * config/i386/avx512fp16vlintrin.h (_mm_cvtph_epi32):
11128 (_mm_mask_cvtph_epi32): Likewise.
11129 (_mm_maskz_cvtph_epi32): Likewise.
11130 (_mm256_cvtph_epi32): Likewise.
11131 (_mm256_mask_cvtph_epi32): Likewise.
11132 (_mm256_maskz_cvtph_epi32): Likewise.
11133 (_mm_cvtph_epu32): Likewise.
11134 (_mm_mask_cvtph_epu32): Likewise.
11135 (_mm_maskz_cvtph_epu32): Likewise.
11136 (_mm256_cvtph_epu32): Likewise.
11137 (_mm256_mask_cvtph_epu32): Likewise.
11138 (_mm256_maskz_cvtph_epu32): Likewise.
11139 (_mm_cvtph_epi64): Likewise.
11140 (_mm_mask_cvtph_epi64): Likewise.
11141 (_mm_maskz_cvtph_epi64): Likewise.
11142 (_mm256_cvtph_epi64): Likewise.
11143 (_mm256_mask_cvtph_epi64): Likewise.
11144 (_mm256_maskz_cvtph_epi64): Likewise.
11145 (_mm_cvtph_epu64): Likewise.
11146 (_mm_mask_cvtph_epu64): Likewise.
11147 (_mm_maskz_cvtph_epu64): Likewise.
11148 (_mm256_cvtph_epu64): Likewise.
11149 (_mm256_mask_cvtph_epu64): Likewise.
11150 (_mm256_maskz_cvtph_epu64): Likewise.
11151 (_mm_cvtph_epi16): Likewise.
11152 (_mm_mask_cvtph_epi16): Likewise.
11153 (_mm_maskz_cvtph_epi16): Likewise.
11154 (_mm256_cvtph_epi16): Likewise.
11155 (_mm256_mask_cvtph_epi16): Likewise.
11156 (_mm256_maskz_cvtph_epi16): Likewise.
11157 (_mm_cvtph_epu16): Likewise.
11158 (_mm_mask_cvtph_epu16): Likewise.
11159 (_mm_maskz_cvtph_epu16): Likewise.
11160 (_mm256_cvtph_epu16): Likewise.
11161 (_mm256_mask_cvtph_epu16): Likewise.
11162 (_mm256_maskz_cvtph_epu16): Likewise.
11163 * config/i386/i386-builtin-types.def: Add new builtin types.
11164 * config/i386/i386-builtin.def: Add new builtins.
11165 * config/i386/i386-expand.c
11166 (ix86_expand_args_builtin): Handle new builtin types.
11167 (ix86_expand_round_builtin): Ditto.
11168 * config/i386/sse.md (sseintconvert): New.
11169 (ssePHmode): Ditto.
11170 (UNSPEC_US_FIX_NOTRUNC): Ditto.
11171 (sseintconvertsignprefix): Ditto.
11172 (avx512fp16_vcvtph2<sseintconvertsignprefix><sseintconvert>_<mode><mask_name><round_name>):
11175 2021-09-16 liuhongt <hongtao.liu@intel.com>
11177 * config/i386/avx512fp16intrin.h: (_mm_cvtsi16_si128):
11179 (_mm_cvtsi128_si16): Likewise.
11180 (_mm_mask_load_sh): Likewise.
11181 (_mm_maskz_load_sh): Likewise.
11182 (_mm_mask_store_sh): Likewise.
11183 (_mm_move_sh): Likewise.
11184 (_mm_mask_move_sh): Likewise.
11185 (_mm_maskz_move_sh): Likewise.
11186 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11187 * config/i386/i386-builtin.def: Add corresponding new builtins.
11188 * config/i386/i386-expand.c
11189 (ix86_expand_special_args_builtin): Handle new builtin types.
11190 (ix86_expand_vector_init_one_nonzero): Adjust for FP16 target.
11191 * config/i386/sse.md (VI2F): New mode iterator.
11192 (vec_set<mode>_0): Use new mode iterator.
11193 (avx512f_mov<ssescalarmodelower>_mask): Adjust for HF vector mode.
11194 (avx512f_store<mode>_mask): Ditto.
11196 2021-09-16 Kewen Lin <linkw@linux.ibm.com>
11198 * config/rs6000/rs6000.opt (-mtoc-fusion): Remove.
11200 2021-09-15 David Edelsohn <dje.gcc@gmail.com>
11202 * config/rs6000/rs6000.c (rs6000_xcoff_encode_section_info):
11203 Proceed if no symbol summary or the symbol alias flag is false.
11205 2021-09-15 Jakub Jelinek <jakub@redhat.com>
11209 * varasm.c (output_constructor_regular_field): Instead of assertion
11210 that array_size_for_constructor result is equal to size of
11211 TREE_TYPE (local->val) in bytes, assert that the type size is greater
11212 or equal to array_size_for_constructor result and use type size as
11215 2021-09-15 Martin Liska <mliska@suse.cz>
11218 * config/i386/vxworks.h: Use new macro TARGET_CPU_P.
11220 2021-09-15 Martin Liska <mliska@suse.cz>
11223 * config/rs6000/rs6000.c (rs6000_xcoff_encode_section_info):
11224 Check that we have a symbol summary for a symbol.
11226 2021-09-15 Richard Biener <rguenther@suse.de>
11229 * config/rs6000/lynx.h: Remove undef of PREFERRED_DEBUGGING_TYPE
11230 to inherit from elfos.h
11232 2021-09-15 liuhongt <hongtao.liu@intel.com>
11235 * config/i386/i386-expand.c
11236 (ix86_expand_vector_init_interleave): Use puncklwd to pack 2
11238 (ix86_expand_vector_set): Use blendw instead of pinsrw.
11239 * config/i386/i386.c (ix86_can_change_mode_class): Adjust for
11240 AVX512FP16 which supports 16bit vector load.
11241 * config/i386/sse.md (avx512bw_interleave_highv32hi<mask_name>):
11243 (avx512bw_interleave_high<mode><mask_name>): .. this, and
11244 extend to V32HFmode.
11245 (avx2_interleave_highv16hi<mask_name>): Rename to ..
11246 (avx2_interleave_high<mode><mask_name>): .. this, and extend
11248 (vec_interleave_highv8hi<mask_name>): Rename to ..
11249 (vec_interleave_high<mode><mask_name>): .. this, and extend to V8HFmode.
11250 (<mask_codefor>avx512bw_interleave_lowv32hi<mask_name>):
11252 (<mask_codefor>avx512bw_interleave_low<mode><mask_name>):
11253 this, and extend to V32HFmode.
11254 (avx2_interleave_lowv16hi<mask_name>): Rename to ..
11255 (avx2_interleave_low<mode><mask_name>): .. this, and extend to V16HFmode.
11256 (vec_interleave_lowv8hi<mask_name>): Rename to ..
11257 (vec_interleave_low<mode><mask_name>): .. this, and extend to V8HFmode.
11258 (sse4_1_pblendw): Rename to ..
11259 (sse4_1_pblend<blendsuf>): .. this, and extend to V8HFmode.
11260 (avx2_pblendph): New define_expand.
11261 (<sse2p4_1>_pinsr<ssemodesuffix>): Refactor, use
11262 sseintmodesuffix instead of ssemodesuffix.
11263 (blendsuf): New mode attr.
11265 2021-09-15 Richard Biener <rguenther@suse.de>
11267 * tree-vectorizer.h (dr_misalignment): Move out of line.
11268 (dr_target_alignment): New.
11269 (DR_TARGET_ALIGNMENT): Wrap dr_target_alignment.
11270 (set_dr_target_alignment): New.
11271 (SET_DR_TARGET_ALIGNMENT): Wrap set_dr_target_alignment.
11272 * tree-vect-data-refs.c (dr_misalignment): Compute and
11273 return the group members misalignment.
11274 (vect_compute_data_ref_alignment): Use SET_DR_TARGET_ALIGNMENT.
11275 (vect_analyze_data_refs_alignment): Compute alignment only
11276 for the first element of a DR group.
11277 (vect_slp_analyze_node_alignment): Likewise.
11279 2021-09-15 Hongyu Wang <hongyu.wang@intel.com>
11281 * config/i386/avx512fp16intrin.h: Adjust all builtin calls.
11282 * config/i386/avx512fp16vlintrin.h: Likewise.
11283 * config/i386/i386-builtin.def: Adjust builtin name and
11284 enumeration to match AVX512F style.
11286 2021-09-15 Richard Biener <rguenther@suse.de>
11288 PR tree-optimization/102318
11289 * tree-vect-loop.c (vect_transform_cycle_phi): Revert
11290 previous change and do the mode conversion separately from
11291 the sign conversion.
11293 2021-09-15 Hongtao Liu <hongtao.liu@intel.com>
11294 Peter Cordes <peter@cordes.ca>
11297 * config/i386/sse.md (extract_suf): Add V8SF/V8SI/V4DF/V4DI.
11298 (*vec_extract<mode><ssescalarmodelower>_valign): Output
11299 vextract{i,f}{32x4,64x2} instruction when byte_offset % 16 ==
11302 2021-09-15 Richard Biener <rguenther@suse.de>
11304 * config.gcc: Remove vax-*-openbsd* configuration.
11306 2021-09-15 Richard Biener <rguenther@suse.de>
11308 * config.gcc: Remove m68k-openbsd.
11310 2021-09-15 Max Filippov <jcmvbkbc@gmail.com>
11313 * config/xtensa/t-xtensa (TM_H): Add include/xtensa-config.h.
11315 2021-09-14 Peter Bergner <bergner@linux.ibm.com>
11317 * config/rs6000/mma.md (unspec): Delete UNSPEC_MMA_XXSETACCZ.
11318 (unspecv): Add UNSPECV_MMA_XXSETACCZ.
11319 (*mma_xxsetaccz): Delete.
11320 (mma_xxsetaccz): Change to define_insn. Remove operand 1.
11321 Use UNSPECV_MMA_XXSETACCZ. Update comment.
11322 * config/rs6000/rs6000.c (rs6000_rtx_costs): Use UNSPECV_MMA_XXSETACCZ.
11324 2021-09-14 Iain Sandoe <iain@sandoe.co.uk>
11326 * Makefile.in: Remove variables related to applying no-PIE
11327 to the exes on $build.
11328 * configure: Regenerate.
11329 * configure.ac: Remove configuration related to applying
11330 no-PIE to the exes on $build.
11332 2021-09-14 Claudiu Zissulescu <claziss@synopsys.com>
11334 * config/arc/arc.md (doloop_end): Add missing mode.
11335 (loop_end): Likewise.
11337 2021-09-14 Jakub Jelinek <jakub@redhat.com>
11339 * gimplify.c (goa_stabilize_expr): Add depth argument, propagate
11340 it to recursive calls, for depth above 7 just gimplify or return.
11341 Perform a test even for MODIFY_EXPR, ADDR_EXPR, COMPOUND_EXPR with
11342 __builtin_clear_padding and TARGET_EXPR.
11343 (gimplify_omp_atomic): Adjust goa_stabilize_expr callers.
11345 2021-09-14 liuhongt <hongtao.liu@intel.com>
11347 * config/i386/avx512fp16intrin.h (_mm_fpclass_sh_mask):
11349 (_mm_mask_fpclass_sh_mask): Likewise.
11350 (_mm512_mask_fpclass_ph_mask): Likewise.
11351 (_mm512_fpclass_ph_mask): Likewise.
11352 (_mm_getexp_sh): Likewise.
11353 (_mm_mask_getexp_sh): Likewise.
11354 (_mm_maskz_getexp_sh): Likewise.
11355 (_mm512_getexp_ph): Likewise.
11356 (_mm512_mask_getexp_ph): Likewise.
11357 (_mm512_maskz_getexp_ph): Likewise.
11358 (_mm_getexp_round_sh): Likewise.
11359 (_mm_mask_getexp_round_sh): Likewise.
11360 (_mm_maskz_getexp_round_sh): Likewise.
11361 (_mm512_getexp_round_ph): Likewise.
11362 (_mm512_mask_getexp_round_ph): Likewise.
11363 (_mm512_maskz_getexp_round_ph): Likewise.
11364 (_mm_getmant_sh): Likewise.
11365 (_mm_mask_getmant_sh): Likewise.
11366 (_mm_maskz_getmant_sh): Likewise.
11367 (_mm512_getmant_ph): Likewise.
11368 (_mm512_mask_getmant_ph): Likewise.
11369 (_mm512_maskz_getmant_ph): Likewise.
11370 (_mm_getmant_round_sh): Likewise.
11371 (_mm_mask_getmant_round_sh): Likewise.
11372 (_mm_maskz_getmant_round_sh): Likewise.
11373 (_mm512_getmant_round_ph): Likewise.
11374 (_mm512_mask_getmant_round_ph): Likewise.
11375 (_mm512_maskz_getmant_round_ph): Likewise.
11376 * config/i386/avx512fp16vlintrin.h (_mm_mask_fpclass_ph_mask):
11378 (_mm_fpclass_ph_mask): Likewise.
11379 (_mm256_mask_fpclass_ph_mask): Likewise.
11380 (_mm256_fpclass_ph_mask): Likewise.
11381 (_mm256_getexp_ph): Likewise.
11382 (_mm256_mask_getexp_ph): Likewise.
11383 (_mm256_maskz_getexp_ph): Likewise.
11384 (_mm_getexp_ph): Likewise.
11385 (_mm_mask_getexp_ph): Likewise.
11386 (_mm_maskz_getexp_ph): Likewise.
11387 (_mm256_getmant_ph): Likewise.
11388 (_mm256_mask_getmant_ph): Likewise.
11389 (_mm256_maskz_getmant_ph): Likewise.
11390 (_mm_getmant_ph): Likewise.
11391 (_mm_mask_getmant_ph): Likewise.
11392 (_mm_maskz_getmant_ph): Likewise.
11393 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11394 * config/i386/i386-builtin.def: Add corresponding new builtins.
11395 * config/i386/i386-expand.c
11396 (ix86_expand_args_builtin): Handle new builtin types.
11397 (ix86_expand_round_builtin): Ditto.
11398 * config/i386/sse.md (vecmemsuffix): Add HF vector modes.
11399 (<avx512>_getexp<mode><mask_name><round_saeonly_name>): Adjust
11400 to support HF vector modes.
11401 (avx512f_sgetexp<mode><mask_scalar_name><round_saeonly_scalar_name):
11403 (avx512dq_fpclass<mode><mask_scalar_merge_name>): Ditto.
11404 (avx512dq_vmfpclass<mode><mask_scalar_merge_name>): Ditto.
11405 (<avx512>_getmant<mode><mask_name><round_saeonly_name>): Ditto.
11406 (avx512f_vgetmant<mode><mask_scalar_name><round_saeonly_scalar_name>):
11409 2021-09-14 liuhongt <hongtao.liu@intel.com>
11411 * config/i386/avx512fp16intrin.h (_mm512_reduce_ph):
11413 (_mm512_mask_reduce_ph): Likewise.
11414 (_mm512_maskz_reduce_ph): Likewise.
11415 (_mm512_reduce_round_ph): Likewise.
11416 (_mm512_mask_reduce_round_ph): Likewise.
11417 (_mm512_maskz_reduce_round_ph): Likewise.
11418 (_mm_reduce_sh): Likewise.
11419 (_mm_mask_reduce_sh): Likewise.
11420 (_mm_maskz_reduce_sh): Likewise.
11421 (_mm_reduce_round_sh): Likewise.
11422 (_mm_mask_reduce_round_sh): Likewise.
11423 (_mm_maskz_reduce_round_sh): Likewise.
11424 (_mm512_roundscale_ph): Likewise.
11425 (_mm512_mask_roundscale_ph): Likewise.
11426 (_mm512_maskz_roundscale_ph): Likewise.
11427 (_mm512_roundscale_round_ph): Likewise.
11428 (_mm512_mask_roundscale_round_ph): Likewise.
11429 (_mm512_maskz_roundscale_round_ph): Likewise.
11430 (_mm_roundscale_sh): Likewise.
11431 (_mm_mask_roundscale_sh): Likewise.
11432 (_mm_maskz_roundscale_sh): Likewise.
11433 (_mm_roundscale_round_sh): Likewise.
11434 (_mm_mask_roundscale_round_sh): Likewise.
11435 (_mm_maskz_roundscale_round_sh): Likewise.
11436 * config/i386/avx512fp16vlintrin.h: (_mm_reduce_ph):
11438 (_mm_mask_reduce_ph): Likewise.
11439 (_mm_maskz_reduce_ph): Likewise.
11440 (_mm256_reduce_ph): Likewise.
11441 (_mm256_mask_reduce_ph): Likewise.
11442 (_mm256_maskz_reduce_ph): Likewise.
11443 (_mm_roundscale_ph): Likewise.
11444 (_mm_mask_roundscale_ph): Likewise.
11445 (_mm_maskz_roundscale_ph): Likewise.
11446 (_mm256_roundscale_ph): Likewise.
11447 (_mm256_mask_roundscale_ph): Likewise.
11448 (_mm256_maskz_roundscale_ph): Likewise.
11449 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11450 * config/i386/i386-builtin.def: Add corresponding new builtins.
11451 * config/i386/i386-expand.c
11452 (ix86_expand_args_builtin): Handle new builtin types.
11453 (ix86_expand_round_builtin): Ditto.
11454 * config/i386/sse.md (<mask_codefor>reducep<mode><mask_name>):
11456 (<mask_codefor>reducep<mode><mask_name><round_saeonly_name>):
11457 ... this, and adjust for round operands.
11458 (reduces<mode><mask_scalar_name>): Likewise, with ...
11459 (reduces<mode><mask_scalar_name><round_saeonly_scalar_name):
11461 (<avx512>_rndscale<mode><mask_name><round_saeonly_name>):
11462 Adjust for HF vector modes.
11463 (avx512f_rndscale<mode><mask_scalar_name><round_saeonly_scalar_name>):
11465 (*avx512f_rndscale<mode><round_saeonly_name>): Ditto.
11467 2021-09-14 liuhongt <hongtao.liu@intel.com>
11469 * config/i386/avx512fp16intrin.h: (_mm512_rcp_ph):
11471 (_mm512_mask_rcp_ph): Likewise.
11472 (_mm512_maskz_rcp_ph): Likewise.
11473 (_mm_rcp_sh): Likewise.
11474 (_mm_mask_rcp_sh): Likewise.
11475 (_mm_maskz_rcp_sh): Likewise.
11476 (_mm512_scalef_ph): Likewise.
11477 (_mm512_mask_scalef_ph): Likewise.
11478 (_mm512_maskz_scalef_ph): Likewise.
11479 (_mm512_scalef_round_ph): Likewise.
11480 (_mm512_mask_scalef_round_ph): Likewise.
11481 (_mm512_maskz_scalef_round_ph): Likewise.
11482 (_mm_scalef_sh): Likewise.
11483 (_mm_mask_scalef_sh): Likewise.
11484 (_mm_maskz_scalef_sh): Likewise.
11485 (_mm_scalef_round_sh): Likewise.
11486 (_mm_mask_scalef_round_sh): Likewise.
11487 (_mm_maskz_scalef_round_sh): Likewise.
11488 * config/i386/avx512fp16vlintrin.h (_mm_rcp_ph):
11490 (_mm256_rcp_ph): Likewise.
11491 (_mm_mask_rcp_ph): Likewise.
11492 (_mm256_mask_rcp_ph): Likewise.
11493 (_mm_maskz_rcp_ph): Likewise.
11494 (_mm256_maskz_rcp_ph): Likewise.
11495 (_mm_scalef_ph): Likewise.
11496 (_mm256_scalef_ph): Likewise.
11497 (_mm_mask_scalef_ph): Likewise.
11498 (_mm256_mask_scalef_ph): Likewise.
11499 (_mm_maskz_scalef_ph): Likewise.
11500 (_mm256_maskz_scalef_ph): Likewise.
11501 * config/i386/i386-builtin.def: Add new builtins.
11502 * config/i386/sse.md (VFH_AVX512VL): New.
11503 (avx512fp16_rcp<mode>2<mask_name>): Ditto.
11504 (avx512fp16_vmrcpv8hf2<mask_scalar_name>): Ditto.
11505 (avx512f_vmscalef<mode><mask_scalar_name><round_scalar_name>):
11506 Adjust to support HF vector modes.
11507 (<avx512>_scalef<mode><mask_name><round_name>): Ditto.
11509 2021-09-14 liuhongt <hongtao.liu@intel.com>
11511 * config/i386/avx512fp16intrin.h: (_mm512_sqrt_ph):
11513 (_mm512_mask_sqrt_ph): Likewise.
11514 (_mm512_maskz_sqrt_ph): Likewise.
11515 (_mm512_sqrt_round_ph): Likewise.
11516 (_mm512_mask_sqrt_round_ph): Likewise.
11517 (_mm512_maskz_sqrt_round_ph): Likewise.
11518 (_mm512_rsqrt_ph): Likewise.
11519 (_mm512_mask_rsqrt_ph): Likewise.
11520 (_mm512_maskz_rsqrt_ph): Likewise.
11521 (_mm_rsqrt_sh): Likewise.
11522 (_mm_mask_rsqrt_sh): Likewise.
11523 (_mm_maskz_rsqrt_sh): Likewise.
11524 (_mm_sqrt_sh): Likewise.
11525 (_mm_mask_sqrt_sh): Likewise.
11526 (_mm_maskz_sqrt_sh): Likewise.
11527 (_mm_sqrt_round_sh): Likewise.
11528 (_mm_mask_sqrt_round_sh): Likewise.
11529 (_mm_maskz_sqrt_round_sh): Likewise.
11530 * config/i386/avx512fp16vlintrin.h (_mm_sqrt_ph): New intrinsic.
11531 (_mm256_sqrt_ph): Likewise.
11532 (_mm_mask_sqrt_ph): Likewise.
11533 (_mm256_mask_sqrt_ph): Likewise.
11534 (_mm_maskz_sqrt_ph): Likewise.
11535 (_mm256_maskz_sqrt_ph): Likewise.
11536 (_mm_rsqrt_ph): Likewise.
11537 (_mm256_rsqrt_ph): Likewise.
11538 (_mm_mask_rsqrt_ph): Likewise.
11539 (_mm256_mask_rsqrt_ph): Likewise.
11540 (_mm_maskz_rsqrt_ph): Likewise.
11541 (_mm256_maskz_rsqrt_ph): Likewise.
11542 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11543 * config/i386/i386-builtin.def: Add corresponding new builtins.
11544 * config/i386/i386-expand.c
11545 (ix86_expand_args_builtin): Handle new builtins.
11546 (ix86_expand_round_builtin): Ditto.
11547 * config/i386/sse.md (VF_AVX512FP16VL): New.
11548 (sqrt<mode>2): Adjust for HF vector modes.
11549 (<sse>_sqrt<mode>2<mask_name><round_name>): Likewise.
11550 (<sse>_vmsqrt<mode>2<mask_scalar_name><round_scalar_name>):
11552 (<sse>_rsqrt<mode>2<mask_name>): New.
11553 (avx512fp16_vmrsqrtv8hf2<mask_scalar_name>): Likewise.
11555 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
11557 PR bootstrap/101574
11558 * diagnostic-spec.c (warning_suppressed_at, copy_warning): Handle
11559 'RESERVED_LOCATION_P' locations.
11560 * warning-control.cc (get_nowarn_spec, suppress_warning)
11561 (copy_warning): Likewise.
11563 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
11565 * diagnostic-spec.h (typedef xint_hash_t): Use 'location_t' instead of...
11566 (typedef key_type_t): ... this. Remove.
11567 (nowarn_map): Document.
11568 * diagnostic-spec.c (nowarn_map): Likewise.
11569 * warning-control.cc (convert_to_key): Evolve functions into...
11570 (get_location): ... these. Adjust all users.
11572 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
11574 * warning-control.cc (copy_warning): Remove 'nowarn_map' setup.
11576 2021-09-13 Jason Merrill <jason@redhat.com>
11578 * params.opt: Add destructive-interference-size and
11579 constructive-interference-size.
11580 * doc/invoke.texi: Document them.
11581 * config/aarch64/aarch64.c (aarch64_override_options_internal):
11583 * config/arm/arm.c (arm_option_override): Set them.
11584 * config/i386/i386-options.c (ix86_option_override_internal):
11587 2021-09-13 Martin Liska <mliska@suse.cz>
11588 H.J. Lu <hjl.tools@gmail.com>
11591 * common/config/i386/cpuinfo.h (cpu_indicator_init): Add support
11592 for x86-64 micro levels for __builtin_cpu_supports.
11593 * common/config/i386/i386-cpuinfo.h (enum feature_priority):
11594 Add priorities for the micro-arch levels.
11595 (enum processor_features): Add new features.
11596 * common/config/i386/i386-isas.h: Add micro-arch features.
11597 * config/i386/i386-builtins.c (get_builtin_code_for_version):
11598 Support the micro-arch levels by callsing
11599 __builtin_cpu_supports.
11600 * doc/extend.texi: Document that the levels are support by
11601 __builtin_cpu_supports.
11603 2021-09-13 Andrew Pinski <apinski@marvell.com>
11606 * config/aarch64/aarch64-builtins.c (aarch64_fold_builtin_lane_check):
11608 (aarch64_general_fold_builtin): Handle AARCH64_SIMD_BUILTIN_LANE_CHECK.
11609 (aarch64_general_gimple_fold_builtin): Likewise.
11611 2021-09-13 Andrew Pinski <apinski@marvell.com>
11613 * config.gcc: Add m32r-*-linux* and m32rle-*-linux*
11614 to the Unsupported targets list.
11615 Remove support for m32r-*-linux* and m32rle-*-linux*.
11616 * config/m32r/linux.h: Removed.
11617 * config/m32r/t-linux: Removed.
11619 2021-09-13 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
11622 * config/aarch64/aarch64.c (aarch64_classify_address): Don't allow
11623 register index for SVE predicate modes.
11625 2021-09-13 Aldy Hernandez <aldyh@redhat.com>
11627 * tree-ssa-threadbackward.c
11628 (back_threader_profitability::profitable_path_p): Remove FSM
11630 (back_threader_registry::register_path): Same.
11631 * tree-ssa-threadedge.c
11632 (jump_threader::simplify_control_stmt_condition): Same.
11633 * tree-ssa-threadupdate.c (jt_path_registry::jt_path_registry):
11634 Add backedge_threads argument.
11635 (fwd_jt_path_registry::fwd_jt_path_registry): Pass
11636 backedge_threads argument.
11637 (back_jt_path_registry::back_jt_path_registry): Same.
11638 (dump_jump_thread_path): Adjust for FSM removal.
11639 (back_jt_path_registry::rewire_first_differing_edge): Same.
11640 (back_jt_path_registry::adjust_paths_after_duplication): Same.
11641 (back_jt_path_registry::update_cfg): Same.
11642 (jt_path_registry::register_jump_thread): Same.
11643 * tree-ssa-threadupdate.h (enum jump_thread_edge_type): Remove
11645 (class back_jt_path_registry): Add backedge_threads to
11648 2021-09-13 Martin Liska <mliska@suse.cz>
11651 * asan.h (sanitize_coverage_p): Handle when fn == NULL.
11653 2021-09-13 H.J. Lu <hjl.tools@gmail.com>
11656 * config/i386/i386.h (TARGET_AVX256_MOVE_BY_PIECES): New.
11657 (TARGET_AVX256_STORE_BY_PIECES): Likewise.
11658 (MOVE_MAX): Check TARGET_AVX256_MOVE_BY_PIECES and
11659 TARGET_AVX256_STORE_BY_PIECES instead of
11660 TARGET_AVX256_SPLIT_UNALIGNED_LOAD and
11661 TARGET_AVX256_SPLIT_UNALIGNED_STORE.
11662 (STORE_MAX_PIECES): Check TARGET_AVX256_STORE_BY_PIECES instead
11663 of TARGET_AVX256_SPLIT_UNALIGNED_STORE.
11664 * config/i386/x86-tune.def (X86_TUNE_AVX256_MOVE_BY_PIECES): New.
11665 (X86_TUNE_AVX256_STORE_BY_PIECES): Likewise.
11667 2021-09-13 liuhongt <hongtao.liu@intel.com>
11669 PR bootstrap/102302
11670 * expmed.c (extract_bit_field_using_extv): Use
11671 gen_lowpart_if_possible instead of gen_lowpart to avoid ICE.
11673 2021-09-13 Aldy Hernandez <aldyh@redhat.com>
11675 * Makefile.in (OBJS): Add value-pointer-equiv.o.
11676 * gimple-ssa-evrp.c (class ssa_equiv_stack): Move to
11677 value-pointer-equiv.*.
11678 (ssa_equiv_stack::ssa_equiv_stack): Same.
11679 (ssa_equiv_stack::enter): Same.
11680 (ssa_equiv_stack::leave): Same.
11681 (ssa_equiv_stack::push_replacement): Same.
11682 (ssa_equiv_stack::get_replacement): Same.
11683 (is_pointer_ssa): Same.
11684 (class pointer_equiv_analyzer): Same.
11685 (pointer_equiv_analyzer::pointer_equiv_analyzer): Same.
11686 (pointer_equiv_analyzer::~pointer_equiv_analyzer): Same.
11687 (pointer_equiv_analyzer::set_global_equiv): Same.
11688 (pointer_equiv_analyzer::set_cond_equiv): Same.
11689 (pointer_equiv_analyzer::get_equiv): Same.
11690 (pointer_equiv_analyzer::enter): Same.
11691 (pointer_equiv_analyzer::leave): Same.
11692 (pointer_equiv_analyzer::get_equiv_expr): Same.
11693 (pta_valueize): Same.
11694 (pointer_equiv_analyzer::visit_stmt): Same.
11695 (pointer_equiv_analyzer::visit_edge): Same.
11696 (hybrid_folder::value_of_expr): Same.
11697 (hybrid_folder::value_on_edge): Same.
11698 * value-pointer-equiv.cc: New file.
11699 * value-pointer-equiv.h: New file.
11701 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
11704 * gimple-fold.c (gimple_fold_builtin_memory_op): Allow folding
11705 memcpy if the size is not more than MOVE_MAX * MOVE_RATIO.
11707 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
11710 * config/arm/arm.md (movmisaligndi): New define_expand.
11711 * config/arm/vec-common.md (movmisalign<mode>): Iterate over VDQ mode.
11713 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
11716 * emit-rtl.c (gen_highpart): Use adjust_address to handle
11717 MEM rather than calling simplify_gen_subreg.
11719 2021-09-13 Jan-Benedict Glaw <jbglaw@ług-owl.de>
11721 * config/alpha/vms.h (INIT_CUMULATIVE_ARGS): Wrap multi-statment
11722 define into a block.
11724 2021-09-13 Richard Biener <rguenther@suse.de>
11726 * config/darwin.h (DARWIN_PREFER_DWARF): Do not define.
11727 * config/i386/darwin.h (PREFERRED_DEBUGGING_TYPE): Do not
11728 change based on DARWIN_PREFER_DWARF not being defined.
11730 2021-09-13 Richard Biener <rguenther@suse.de>
11732 * config/i386/lynx.h: Remove undef of PREFERRED_DEBUGGING_TYPE
11733 to inherit from elfos.h
11735 2021-09-13 Richard Biener <rguenther@suse.de>
11737 * config.gcc: Add cr16-*-* to the list of obsoleted targets.
11739 2021-09-13 Richard Biener <rguenther@suse.de>
11741 * config/avr/elf.h (PREFERRED_DEBUGGING_TYPE): Remove
11742 override, pick up DWARF2_DEBUG define from elfos.h
11744 2021-09-13 Richard Biener <rguenther@suse.de>
11746 * config/rx/rx.h (PREFERRED_DEBUGGING_TYPE): Always define to
11749 2021-09-13 Richard Biener <rguenther@suse.de>
11751 * config/alpha/vms.h (PREFERRED_DEBUGGING_TYPE): Define to
11754 2021-09-13 Richard Biener <rguenther@suse.de>
11756 * config/i386/cygming.h: Always default to DWARF2 debugging.
11757 Do not define DBX_DEBUGGING_INFO, that's done via dbxcoff.h
11759 * doc/install.texi: Document binutils 2.16 as minimum
11760 requirement for mingw.
11762 2021-09-13 Kewen Lin <linkw@linux.ibm.com>
11764 * config/rs6000/rs6000.c (struct rs6000_cost_data): New members
11765 nstmts, nloads and extra_ctor_cost.
11766 (rs6000_density_test): Add load density related heuristics. Do
11767 extra costing on vector construction statements if need.
11768 (rs6000_init_cost): Init new members.
11769 (rs6000_update_target_cost_per_stmt): New function.
11770 (rs6000_add_stmt_cost): Factor vect_nonmem hunk out to function
11771 rs6000_update_target_cost_per_stmt and call it.
11773 2021-09-13 Kewen Lin <linkw@linux.ibm.com>
11775 * config/rs6000/rs6000.c (struct rs6000_cost_data): Remove typedef.
11776 (rs6000_init_cost): Adjust.
11778 2021-09-13 liuhongt <hongtao.liu@intel.com>
11780 * config/i386/i386.md: (UNSPEC_COPYSIGN): Remove.
11781 (UNSPEC_XORSIGN): Ditto.
11783 2021-09-12 Roger Sayle <roger@nextmovesoftware.com>
11785 * expr.c (convert_move): Preserve SUBREG_PROMOTED_VAR_P when
11786 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
11789 2021-09-11 Aldy Hernandez <aldyh@redhat.com>
11791 * tree-ssa-threadbackward.c (class back_threader_registry): Use
11792 back_jt_path_registry.
11793 * tree-ssa-threadedge.c (jump_threader::jump_threader): Use
11794 fwd_jt_path_registry.
11795 * tree-ssa-threadedge.h (class jump_threader): Same..
11796 * tree-ssa-threadupdate.c
11797 (jump_thread_path_registry::jump_thread_path_registry): Rename...
11798 (jt_path_registry::jt_path_registry): ...to this.
11799 (jump_thread_path_registry::~jump_thread_path_registry): Rename...
11800 (jt_path_registry::~jt_path_registry): ...this.
11801 (fwd_jt_path_registry::fwd_jt_path_registry): New.
11802 (fwd_jt_path_registry::~fwd_jt_path_registry): New.
11803 (jump_thread_path_registry::allocate_thread_edge): Rename...
11804 (jt_path_registry::allocate_thread_edge): ...to this.
11805 (jump_thread_path_registry::allocate_thread_path): Rename...
11806 (jt_path_registry::allocate_thread_path): ...to this.
11807 (jump_thread_path_registry::lookup_redirection_data): Rename...
11808 (fwd_jt_path_registry::lookup_redirection_data): ...to this.
11809 (jump_thread_path_registry::thread_block_1): Rename...
11810 (fwd_jt_path_registry::thread_block_1): ...to this.
11811 (jump_thread_path_registry::thread_block): Rename...
11812 (fwd_jt_path_registry::thread_block): ...to this.
11813 (jt_path_registry::thread_through_loop_header): Rename...
11814 (fwd_jt_path_registry::thread_through_loop_header): ...to this.
11815 (jump_thread_path_registry::mark_threaded_blocks): Rename...
11816 (fwd_jt_path_registry::mark_threaded_blocks): ...to this.
11817 (jump_thread_path_registry::debug_path): Rename...
11818 (jt_path_registry::debug_path): ...to this.
11819 (jump_thread_path_registry::dump): Rename...
11820 (jt_path_registry::debug): ...to this.
11821 (jump_thread_path_registry::rewire_first_differing_edge): Rename...
11822 (back_jt_path_registry::rewire_first_differing_edge): ...to this.
11823 (jump_thread_path_registry::adjust_paths_after_duplication): Rename...
11824 (back_jt_path_registry::adjust_paths_after_duplication): ...to this.
11825 (jump_thread_path_registry::duplicate_thread_path): Rename...
11826 (back_jt_path_registry::duplicate_thread_path): ...to this. Also,
11827 drop ill-formed candidates.
11828 (jump_thread_path_registry::remove_jump_threads_including): Rename...
11829 (fwd_jt_path_registry::remove_jump_threads_including): ...to this.
11830 (jt_path_registry::thread_through_all_blocks): New.
11831 (back_jt_path_registry::update_cfg): New.
11832 (fwd_jt_path_registry::update_cfg): New.
11833 (jump_thread_path_registry::register_jump_thread): Rename...
11834 (jt_path_registry::register_jump_thread): ...to this.
11835 * tree-ssa-threadupdate.h (class jump_thread_path_registry):
11837 (class jt_path_registry): ...here.
11838 (class fwd_jt_path_registry): New.
11839 (class back_jt_path_registry): New.
11841 2021-09-10 liuhongt <hongtao.liu@intel.com>
11844 2021-09-01 liuhongt <hongtao.liu@intel.com>
11846 * emit-rtl.c (validate_subreg): Get rid of all float-int
11849 2021-09-10 Jakub Jelinek <jakub@redhat.com>
11851 * tree-core.h (enum omp_memory_order): Add OMP_MEMORY_ORDER_MASK,
11852 OMP_FAIL_MEMORY_ORDER_UNSPECIFIED, OMP_FAIL_MEMORY_ORDER_RELAXED,
11853 OMP_FAIL_MEMORY_ORDER_ACQUIRE, OMP_FAIL_MEMORY_ORDER_RELEASE,
11854 OMP_FAIL_MEMORY_ORDER_ACQ_REL, OMP_FAIL_MEMORY_ORDER_SEQ_CST and
11855 OMP_FAIL_MEMORY_ORDER_MASK enumerators.
11856 (OMP_FAIL_MEMORY_ORDER_SHIFT): Define.
11857 * gimple-pretty-print.c (dump_gimple_omp_atomic_load,
11858 dump_gimple_omp_atomic_store): Print [weak] for weak atomic
11860 * gimple.h (enum gf_mask): Change GF_OMP_ATOMIC_MEMORY_ORDER
11861 to 6-bit mask, adjust GF_OMP_ATOMIC_NEED_VALUE value and add
11862 GF_OMP_ATOMIC_WEAK.
11863 (gimple_omp_atomic_weak_p, gimple_omp_atomic_set_weak): New inline
11865 * tree.h (OMP_ATOMIC_WEAK): Define.
11866 * tree-pretty-print.c (dump_omp_atomic_memory_order): Adjust for
11867 fail memory order being encoded in the same enum and also print
11868 fail clause if present.
11869 (dump_generic_node): Print weak clause if OMP_ATOMIC_WEAK.
11870 * gimplify.c (goa_stabilize_expr): Add target_expr and rhs arguments,
11871 handle pre_p == NULL case as a test mode that only returns value
11872 but doesn't change gimplify nor change anything otherwise, adjust
11873 recursive calls, add MODIFY_EXPR, ADDR_EXPR, COND_EXPR, TARGET_EXPR
11874 and CALL_EXPR handling, adjust COMPOUND_EXPR handling for
11875 __builtin_clear_padding calls, for !rhs gimplify as lvalue rather
11877 (gimplify_omp_atomic): Adjust goa_stabilize_expr caller. Handle
11878 COND_EXPR rhs. Set weak flag on gimple load/store for
11880 * omp-expand.c (omp_memory_order_to_fail_memmodel): New function.
11881 (omp_memory_order_to_memmodel): Adjust for fail clause encoded
11883 (expand_omp_atomic_cas): New function.
11884 (expand_omp_atomic_pipeline): Use omp_memory_order_to_fail_memmodel
11886 (expand_omp_atomic): Attempt to optimize atomic compare and exchange
11887 using expand_omp_atomic_cas.
11889 2021-09-10 Aldy Hernandez <aldyh@redhat.com>
11890 Michael Matz <matz@suse.de>
11892 * tree-pass.h (PROP_loop_opts_done): New.
11893 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
11894 Intersect with global range.
11895 * tree-ssa-loop.c (tree_ssa_loop_done): Set PROP_loop_opts_done.
11896 * tree-ssa-threadbackward.c
11897 (back_threader_profitability::profitable_path_p): Disable
11898 threading through latches until after loop optimizations have run.
11900 2021-09-10 David Faust <david.faust@oracle.com>
11902 * doc/invoke.texi: Document BPF -mcpu, -mjmpext, -mjmp32 and -malu32
11905 2021-09-10 David Faust <david.faust@oracle.com>
11907 * config/bpf/bpf-opts.h (bpf_isa_version): New enum.
11908 * config/bpf/bpf-protos.h (bpf_expand_cbranch): New.
11909 * config/bpf/bpf.c (bpf_option_override): Handle -mcpu option.
11910 (bpf_expand_cbranch): New function.
11911 * config/bpf/bpf.md (AM mode iterator): Conditionalize support for SI
11913 (zero_extendsidi2): Only use mov32 instruction if it is available.
11914 (SIM mode iterator): Conditionalize support for SI mode.
11915 (JM mode iterator): New.
11916 (cbranchdi4): Update name, use new JM iterator. Use bpf_expand_cbranch.
11917 (*branch_on_di): Update name, use new JM iterator.
11918 * config/bpf/bpf.opt: (mjmpext): New option.
11919 (malu32): Likewise.
11920 (mjmp32): Likewise.
11922 (bpf_isa): New enum.
11924 2021-09-10 David Faust <david.faust@oracle.com>
11926 * config/bpf/bpf.md (zero_extendhidi2): Add new output template
11927 for register-to-register extensions.
11928 (zero_extendqidi2): Likewise.
11930 2021-09-10 Richard Biener <rguenther@suse.de>
11932 PR middle-end/102273
11933 * internal-fn.c (expand_DEFERRED_INIT): Always expand non-SSA vars.
11935 2021-09-10 Richard Biener <rguenther@suse.de>
11937 PR middle-end/102269
11938 * gimplify.c (is_var_need_auto_init): Empty types do not need
11941 2021-09-10 Richard Biener <rguenther@suse.de>
11943 * configure.ac (--with-stabs): Remove.
11944 * configure: Regenerate.
11945 * doc/install.texi: Remove --with-stabs documentation.
11947 2021-09-10 liuhongt <hongtao.liu@intel.com>
11949 * config/i386/avx512fp16intrin.h: (_mm512_cmp_ph_mask):
11951 (_mm512_mask_cmp_ph_mask): Likewise.
11952 (_mm512_cmp_round_ph_mask): Likewise.
11953 (_mm512_mask_cmp_round_ph_mask): Likewise.
11954 (_mm_cmp_sh_mask): Likewise.
11955 (_mm_mask_cmp_sh_mask): Likewise.
11956 (_mm_cmp_round_sh_mask): Likewise.
11957 (_mm_mask_cmp_round_sh_mask): Likewise.
11958 (_mm_comieq_sh): Likewise.
11959 (_mm_comilt_sh): Likewise.
11960 (_mm_comile_sh): Likewise.
11961 (_mm_comigt_sh): Likewise.
11962 (_mm_comige_sh): Likewise.
11963 (_mm_comineq_sh): Likewise.
11964 (_mm_ucomieq_sh): Likewise.
11965 (_mm_ucomilt_sh): Likewise.
11966 (_mm_ucomile_sh): Likewise.
11967 (_mm_ucomigt_sh): Likewise.
11968 (_mm_ucomige_sh): Likewise.
11969 (_mm_ucomineq_sh): Likewise.
11970 (_mm_comi_round_sh): Likewise.
11971 (_mm_comi_sh): Likewise.
11972 * config/i386/avx512fp16vlintrin.h (_mm_cmp_ph_mask): New intrinsic.
11973 (_mm_mask_cmp_ph_mask): Likewise.
11974 (_mm256_cmp_ph_mask): Likewise.
11975 (_mm256_mask_cmp_ph_mask): Likewise.
11976 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11977 * config/i386/i386-builtin.def: Add corresponding new builtins.
11978 * config/i386/i386-expand.c
11979 (ix86_expand_args_builtin): Handle new builtin types.
11980 (ix86_expand_round_builtin): Ditto.
11981 * config/i386/i386.md (ssevecmode): Add HF mode.
11982 (MODEFH): New mode iterator.
11983 * config/i386/sse.md
11984 (V48H_AVX512VL): New mode iterator to support HF vector modes.
11985 Ajdust corresponding description.
11986 (ssecmpintprefix): New.
11987 (VI12_AVX512VL): Adjust to support HF vector modes.
11988 (cmp_imm_predicate): Likewise.
11989 (<avx512>_cmp<mode>3<mask_scalar_merge_name><round_saeonly_name>):
11991 (avx512f_vmcmp<mode>3<round_saeonly_name>): Likewise.
11992 (avx512f_vmcmp<mode>3_mask<round_saeonly_name>): Likewise.
11993 (<sse>_<unord>comi<round_saeonly_name>): Likewise.
11995 2021-09-10 liuhongt <hongtao.liu@intel.com>
11997 * config/i386/avx512fp16intrin.h: (_mm512_max_ph): New intrinsic.
11998 (_mm512_mask_max_ph): Likewise.
11999 (_mm512_maskz_max_ph): Likewise.
12000 (_mm512_min_ph): Likewise.
12001 (_mm512_mask_min_ph): Likewise.
12002 (_mm512_maskz_min_ph): Likewise.
12003 (_mm512_max_round_ph): Likewise.
12004 (_mm512_mask_max_round_ph): Likewise.
12005 (_mm512_maskz_max_round_ph): Likewise.
12006 (_mm512_min_round_ph): Likewise.
12007 (_mm512_mask_min_round_ph): Likewise.
12008 (_mm512_maskz_min_round_ph): Likewise.
12009 (_mm_max_sh): Likewise.
12010 (_mm_mask_max_sh): Likewise.
12011 (_mm_maskz_max_sh): Likewise.
12012 (_mm_min_sh): Likewise.
12013 (_mm_mask_min_sh): Likewise.
12014 (_mm_maskz_min_sh): Likewise.
12015 (_mm_max_round_sh): Likewise.
12016 (_mm_mask_max_round_sh): Likewise.
12017 (_mm_maskz_max_round_sh): Likewise.
12018 (_mm_min_round_sh): Likewise.
12019 (_mm_mask_min_round_sh): Likewise.
12020 (_mm_maskz_min_round_sh): Likewise.
12021 * config/i386/avx512fp16vlintrin.h (_mm_max_ph): New intrinsic.
12022 (_mm256_max_ph): Likewise.
12023 (_mm_mask_max_ph): Likewise.
12024 (_mm256_mask_max_ph): Likewise.
12025 (_mm_maskz_max_ph): Likewise.
12026 (_mm256_maskz_max_ph): Likewise.
12027 (_mm_min_ph): Likewise.
12028 (_mm256_min_ph): Likewise.
12029 (_mm_mask_min_ph): Likewise.
12030 (_mm256_mask_min_ph): Likewise.
12031 (_mm_maskz_min_ph): Likewise.
12032 (_mm256_maskz_min_ph): Likewise.
12033 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
12034 * config/i386/i386-builtin.def: Add corresponding new builtins.
12035 * config/i386/i386-expand.c
12036 (ix86_expand_args_builtin): Handle new builtin types.
12037 * config/i386/sse.md
12038 (<code><mode>3<mask_name><round_saeonly_name>): Adjust to
12039 support HF vector modes.
12040 (*<code><mode>3<mask_name><round_saeonly_name>): Likewise.
12041 (ieee_<ieee_maxmin><mode>3<mask_name><round_saeonly_name>):
12043 (<sse>_vm<code><mode>3<mask_scalar_name><round_saeonly_scalar_name>):
12045 * config/i386/subst.md (round_saeonly_mode512bit_condition):
12046 Adjust for HF vector modes.
12048 2021-09-10 Liu, Hongtao <hongtao.liu@intel.com>
12050 * config/i386/avx512fp16intrin.h (_mm_add_sh): New intrinsic.
12051 (_mm_mask_add_sh): Likewise.
12052 (_mm_maskz_add_sh): Likewise.
12053 (_mm_sub_sh): Likewise.
12054 (_mm_mask_sub_sh): Likewise.
12055 (_mm_maskz_sub_sh): Likewise.
12056 (_mm_mul_sh): Likewise.
12057 (_mm_mask_mul_sh): Likewise.
12058 (_mm_maskz_mul_sh): Likewise.
12059 (_mm_div_sh): Likewise.
12060 (_mm_mask_div_sh): Likewise.
12061 (_mm_maskz_div_sh): Likewise.
12062 (_mm_add_round_sh): Likewise.
12063 (_mm_mask_add_round_sh): Likewise.
12064 (_mm_maskz_add_round_sh): Likewise.
12065 (_mm_sub_round_sh): Likewise.
12066 (_mm_mask_sub_round_sh): Likewise.
12067 (_mm_maskz_sub_round_sh): Likewise.
12068 (_mm_mul_round_sh): Likewise.
12069 (_mm_mask_mul_round_sh): Likewise.
12070 (_mm_maskz_mul_round_sh): Likewise.
12071 (_mm_div_round_sh): Likewise.
12072 (_mm_mask_div_round_sh): Likewise.
12073 (_mm_maskz_div_round_sh): Likewise.
12074 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
12075 * config/i386/i386-builtin.def: Add corresponding new builtins.
12076 * config/i386/i386-expand.c
12077 (ix86_expand_round_builtin): Handle new builtins.
12078 * config/i386/sse.md (VF_128): Change description.
12079 (<sse>_vm<plusminus_insn><mode>3<mask_scalar_name><round_scalar_name>):
12080 Adjust to support HF vector modes.
12081 (<sse>_vm<multdiv_mnemonic><mode>3<mask_scalar_name><round_scalar_name>):
12084 2021-09-10 H.J. Lu <hjl.tools@gmail.com>
12086 * config/i386/i386-expand.c
12087 (ix86_avx256_split_vector_move_misalign): Handle V16HF mode.
12088 * config/i386/i386.c
12089 (ix86_preferred_simd_mode): Handle HF mode.
12090 * config/i386/sse.md (V_256H): New mode iterator.
12091 (avx_vextractf128<mode>): Use it.
12092 (VEC_INIT_MODE): Align vector HFmode condition to vector
12093 HImodes since there're no real HF instruction used.
12094 (VEC_INIT_HALF_MODE): Ditto.
12096 (VIHF_AVX512BW): Ditto.
12097 (*vec_extracthf): Ditto.
12098 (VEC_EXTRACT_MODE): Ditto.
12100 2021-09-10 Richard Biener <rguenther@suse.de>
12103 * config/dbx.h: Remove.
12104 * config/dbxcoff.h: Do not define PREFERRED_DEBUGGING_TYPE.
12105 * config/lynx.h: Likewise.
12107 2021-09-10 liuhongt <hongtao.liu@intel.com>
12109 * config/i386/i386-expand.c (ix86_expand_copysign): Expand
12110 right into ANDNOT + AND + IOR, using paradoxical subregs.
12111 (ix86_split_copysign_const): Remove.
12112 (ix86_split_copysign_var): Ditto.
12113 * config/i386/i386-protos.h (ix86_split_copysign_const): Dotto.
12114 (ix86_split_copysign_var): Ditto.
12115 * config/i386/i386.md (@copysign<mode>3_const): Ditto.
12116 (@copysign<mode>3_var): Ditto.
12118 2021-09-09 qing zhao <qing.zhao@oracle.com>
12120 * builtins.c (expand_builtin_memset): Make external visible.
12121 * builtins.h (expand_builtin_memset): Declare extern.
12122 * common.opt (ftrivial-auto-var-init=): New option.
12123 * doc/extend.texi: Document the uninitialized attribute.
12124 * doc/invoke.texi: Document -ftrivial-auto-var-init.
12125 * flag-types.h (enum auto_init_type): New enumerated type
12127 * gimple-fold.c (clear_padding_type): Add one new parameter.
12128 (clear_padding_union): Likewise.
12129 (clear_padding_emit_loop): Likewise.
12130 (clear_type_padding_in_mask): Likewise.
12131 (gimple_fold_builtin_clear_padding): Handle this new parameter.
12132 * gimplify.c (gimple_add_init_for_auto_var): New function.
12133 (gimple_add_padding_init_for_auto_var): New function.
12134 (is_var_need_auto_init): New function.
12135 (gimplify_decl_expr): Add initialization to automatic variables per
12137 (gimplify_call_expr): Add one new parameter for call to
12138 __builtin_clear_padding.
12139 (gimplify_init_constructor): Add padding initialization in the end.
12140 * internal-fn.c (INIT_PATTERN_VALUE): New macro.
12141 (expand_DEFERRED_INIT): New function.
12142 * internal-fn.def (DEFERRED_INIT): New internal function.
12143 * tree-cfg.c (verify_gimple_call): Verify calls to .DEFERRED_INIT.
12144 * tree-sra.c (generate_subtree_deferred_init): New function.
12145 (scan_function): Avoid setting cannot_scalarize_away_bitmap for
12146 calls to .DEFERRED_INIT.
12147 (sra_modify_deferred_init): New function.
12148 (sra_modify_function_body): Handle calls to DEFERRED_INIT specially.
12149 * tree-ssa-structalias.c (find_func_aliases_for_call): Likewise.
12150 * tree-ssa-uninit.c (warn_uninit): Handle calls to DEFERRED_INIT
12152 (check_defs): Likewise.
12153 (warn_uninitialized_vars): Likewise.
12154 * tree-ssa.c (ssa_undefined_value_p): Likewise.
12155 * tree.c (build_common_builtin_nodes): Build tree node for
12156 BUILT_IN_CLEAR_PADDING when needed.
12158 2021-09-09 Richard Biener <rguenther@suse.de>
12160 * tree-ssa-loop-im.c (fill_always_executed_in_1): Walk
12163 2021-09-09 Richard Biener <rguenther@suse.de>
12165 * tree-ssa-loop-im.c (fill_always_executed_in_1): Integrate
12166 DOM walk from get_loop_body_in_dom_order using a worklist
12169 2021-09-09 liuhongt <hongtao.liu@intel.com>
12171 * config.gcc: Add avx512fp16vlintrin.h.
12172 * config/i386/avx512fp16intrin.h: (_mm512_add_ph): New intrinsic.
12173 (_mm512_mask_add_ph): Likewise.
12174 (_mm512_maskz_add_ph): Likewise.
12175 (_mm512_sub_ph): Likewise.
12176 (_mm512_mask_sub_ph): Likewise.
12177 (_mm512_maskz_sub_ph): Likewise.
12178 (_mm512_mul_ph): Likewise.
12179 (_mm512_mask_mul_ph): Likewise.
12180 (_mm512_maskz_mul_ph): Likewise.
12181 (_mm512_div_ph): Likewise.
12182 (_mm512_mask_div_ph): Likewise.
12183 (_mm512_maskz_div_ph): Likewise.
12184 (_mm512_add_round_ph): Likewise.
12185 (_mm512_mask_add_round_ph): Likewise.
12186 (_mm512_maskz_add_round_ph): Likewise.
12187 (_mm512_sub_round_ph): Likewise.
12188 (_mm512_mask_sub_round_ph): Likewise.
12189 (_mm512_maskz_sub_round_ph): Likewise.
12190 (_mm512_mul_round_ph): Likewise.
12191 (_mm512_mask_mul_round_ph): Likewise.
12192 (_mm512_maskz_mul_round_ph): Likewise.
12193 (_mm512_div_round_ph): Likewise.
12194 (_mm512_mask_div_round_ph): Likewise.
12195 (_mm512_maskz_div_round_ph): Likewise.
12196 * config/i386/avx512fp16vlintrin.h: New header.
12197 * config/i386/i386-builtin-types.def (V16HF, V8HF, V32HF):
12198 Add new builtin types.
12199 * config/i386/i386-builtin.def: Add corresponding builtins.
12200 * config/i386/i386-expand.c
12201 (ix86_expand_args_builtin): Handle new builtin types.
12202 (ix86_expand_round_builtin): Likewise.
12203 * config/i386/immintrin.h: Include avx512fp16vlintrin.h
12204 * config/i386/sse.md (VFH): New mode_iterator.
12206 (avx512fmaskmode): Add HF vector modes.
12207 (avx512fmaskhalfmode): Likewise.
12208 (<plusminus_insn><mode>3<mask_name><round_name>): Adjust to for
12210 (*<plusminus_insn><mode>3<mask_name><round_name>): Likewise.
12211 (mul<mode>3<mask_name><round_name>): Likewise.
12212 (*mul<mode>3<mask_name><round_name>): Likewise.
12213 (div<mode>3): Likewise.
12214 (<sse>_div<mode>3<mask_name><round_name>): Likewise.
12215 * config/i386/subst.md (SUBST_V): Add HF vector modes.
12216 (SUBST_A): Likewise.
12217 (round_mode512bit_condition): Adjust for V32HFmode.
12219 2021-09-09 liuhongt <hongtao.liu@intel.com>
12222 * config/i386/sse.md (reduc_plus_scal_<mode>): Split to ..
12223 (reduc_plus_scal_v4sf): .. this, New define_expand.
12224 (reduc_plus_scal_v2df): .. and this, New define_expand.
12226 2021-09-09 liuhongt <hongtao.liu@intel.com>
12229 * config/i386/sse.md (*vec_extract<mode><ssescalarmodelower>_valign):
12232 2021-09-08 Jonathan Wakely <jwakely@redhat.com>
12235 * doc/trouble.texi (Copy Assignment): Fix description of
12236 behaviour and fix code in example.
12238 2021-09-08 Segher Boessenkool <segher@kernel.crashing.org>
12241 * config/rs6000/rs6000-logue.c (rs6000_emit_epilogue): For ELFv2 use
12242 r11 instead of r12 for restoring CR.
12244 2021-09-08 Jakub Jelinek <jakub@redhat.com>
12245 liuhongt <hongtao.liu@intel.com>
12248 * config/i386/i386.md (@xorsign<mode>3_1): Remove.
12249 * config/i386/i386-expand.c (ix86_expand_xorsign): Expand right away
12250 into AND with mask and XOR, using paradoxical subregs.
12251 (ix86_split_xorsign): Remove.
12252 * config/i386/i386-protos.h (ix86_split_xorsign): Remove.
12254 2021-09-08 Di Zhao <dizhao@os.amperecomputing.com>
12256 * tree-ssa-sccvn.c (vn_nary_op_insert_into): fix result compare
12258 2021-09-08 Jakub Jelinek <jakub@redhat.com>
12261 * config/i386/i386.md (xorsign<mode>3): If operands[1] is equal to
12262 operands[2], emit abs<mode>2 instead.
12263 (@xorsign<mode>3_1): Add early-clobbers for output operand, enable
12264 first alternative even for avx, add another alternative with
12265 =&Yv <- 0, Yv, Yvm constraints.
12266 * config/i386/i386-expand.c (ix86_split_xorsign): If op0 is equal
12267 to op1, emit vpandn instead.
12269 2021-09-08 liuhongt <hongtao.liu@intel.com>
12271 * config/i386/avx512fp16intrin.h (_mm_set_ph): New intrinsic.
12272 (_mm256_set_ph): Likewise.
12273 (_mm512_set_ph): Likewise.
12274 (_mm_setr_ph): Likewise.
12275 (_mm256_setr_ph): Likewise.
12276 (_mm512_setr_ph): Likewise.
12277 (_mm_set1_ph): Likewise.
12278 (_mm256_set1_ph): Likewise.
12279 (_mm512_set1_ph): Likewise.
12280 (_mm_setzero_ph): Likewise.
12281 (_mm256_setzero_ph): Likewise.
12282 (_mm512_setzero_ph): Likewise.
12283 (_mm_set_sh): Likewise.
12284 (_mm_load_sh): Likewise.
12285 (_mm_store_sh): Likewise.
12286 * config/i386/i386-builtin-types.def (V8HF): New type.
12287 (DEF_FUNCTION_TYPE (V8HF, V8HI)): New builtin function type
12288 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
12289 Support vector HFmodes.
12290 (ix86_expand_vector_init_one_nonzero): Likewise.
12291 (ix86_expand_vector_init_one_var): Likewise.
12292 (ix86_expand_vector_init_interleave): Likewise.
12293 (ix86_expand_vector_init_general): Likewise.
12294 (ix86_expand_vector_set): Likewise.
12295 (ix86_expand_vector_extract): Likewise.
12296 (ix86_expand_vector_init_concat): Likewise.
12297 (ix86_expand_sse_movcc): Handle vector HFmodes.
12298 (ix86_expand_vector_set_var): Ditto.
12299 * config/i386/i386-modes.def: Add HF vector modes in comment.
12300 * config/i386/i386.c (classify_argument): Add HF vector modes.
12301 (ix86_hard_regno_mode_ok): Allow HF vector modes for AVX512FP16.
12302 (ix86_vector_mode_supported_p): Likewise.
12303 (ix86_set_reg_reg_cost): Handle vector HFmode.
12304 (ix86_get_ssemov): Handle vector HFmode.
12305 (function_arg_advance_64): Pass unamed V16HFmode and V32HFmode
12307 (function_arg_advance_32): Pass V8HF/V16HF/V32HF by sse reg for 32bit
12309 (function_arg_advance_32): Ditto.
12310 * config/i386/i386.h (VALID_AVX512FP16_REG_MODE): New.
12311 (VALID_AVX256_REG_OR_OI_MODE): Rename to ..
12312 (VALID_AVX256_REG_OR_OI_VHF_MODE): .. this, and add V16HF.
12313 (VALID_SSE2_REG_VHF_MODE): New.
12314 (VALID_AVX512VL_128_REG_MODE): Add V8HF and TImode.
12315 (SSE_REG_MODE_P): Add vector HFmode.
12316 * config/i386/i386.md (mode): Add HF vector modes.
12317 (MODE_SIZE): Likewise.
12318 (ssemodesuffix): Add ph suffix for HF vector modes.
12319 * config/i386/sse.md (VFH_128): New mode iterator.
12320 (VMOVE): Adjust for HF vector modes.
12322 (V_256_512): Likewise.
12323 (avx512): Likewise.
12324 (avx512fmaskmode): Likewise.
12325 (shuffletype): Likewise.
12326 (sseinsnmode): Likewise.
12327 (ssedoublevecmode): Likewise.
12328 (ssehalfvecmode): Likewise.
12329 (ssehalfvecmodelower): Likewise.
12330 (ssePScmode): Likewise.
12331 (ssescalarmode): Likewise.
12332 (ssescalarmodelower): Likewise.
12333 (sseintprefix): Likewise.
12335 (bcstscalarsuff): Likewise.
12336 (xtg_mode): Likewise.
12337 (VI12HF_AVX512VL): New mode_iterator.
12338 (VF_AVX512FP16): Likewise.
12340 (VIHF_256): Likewise.
12341 (VIHF_AVX512BW): Likewise.
12342 (V16_256): Likewise.
12343 (V32_512): Likewise.
12344 (sseintmodesuffix): New mode_attr.
12345 (sse): Add scalar and vector HFmodes.
12346 (ssescalarmode): Add vector HFmode mapping.
12347 (ssescalarmodesuffix): Add sh suffix for HFmode.
12348 (*<sse>_vm<insn><mode>3): Use VFH_128.
12349 (*<sse>_vm<multdiv_mnemonic><mode>3): Likewise.
12350 (*ieee_<ieee_maxmin><mode>3): Likewise.
12351 (<avx512>_blendm<mode>): New define_insn.
12352 (vec_setv8hf): New define_expand.
12353 (vec_set<mode>_0): New define_insn for HF vector set.
12354 (*avx512fp16_movsh): Likewise.
12355 (avx512fp16_movsh): Likewise.
12356 (vec_extract_lo_v32hi): Rename to ...
12357 (vec_extract_lo_<mode>): ... this, and adjust to allow HF
12359 (vec_extract_hi_v32hi): Likewise.
12360 (vec_extract_hi_<mode>): Likewise.
12361 (vec_extract_lo_v16hi): Likewise.
12362 (vec_extract_lo_<mode>): Likewise.
12363 (vec_extract_hi_v16hi): Likewise.
12364 (vec_extract_hi_<mode>): Likewise.
12365 (vec_set_hi_v16hi): Likewise.
12366 (vec_set_hi_<mode>): Likewise.
12367 (vec_set_lo_v16hi): Likewise.
12368 (vec_set_lo_<mode>): Likewise.
12369 (*vec_extract<mode>_0): New define_insn_and_split for HF
12371 (*vec_extracthf): New define_insn.
12372 (VEC_EXTRACT_MODE): Add HF vector modes.
12373 (PINSR_MODE): Add V8HF.
12374 (sse2p4_1): Likewise.
12375 (pinsr_evex_isa): Likewise.
12376 (<sse2p4_1>_pinsr<ssemodesuffix>): Adjust to support
12377 insert for V8HFmode.
12378 (pbroadcast_evex_isa): Add HF vector modes.
12379 (AVX2_VEC_DUP_MODE): Likewise.
12380 (VEC_INIT_MODE): Likewise.
12381 (VEC_INIT_HALF_MODE): Likewise.
12382 (avx2_pbroadcast<mode>): Adjust to support HF vector mode
12384 (avx2_pbroadcast<mode>_1): Likewise.
12385 (<avx512>_vec_dup<mode>_1): Likewise.
12386 (<avx512>_vec_dup<mode><mask_name>): Likewise.
12387 (<mask_codefor><avx512>_vec_dup_gpr<mode><mask_name>):
12390 2021-09-08 Guo, Xuepeng <xuepeng.guo@intel.com>
12391 H.J. Lu <hongjiu.lu@intel.com>
12392 Liu Hongtao <hongtao.liu@intel.com>
12393 Wang Hongyu <hongyu.wang@intel.com>
12394 Xu Dianhong <dianhong.xu@intel.com>
12396 * common/config/i386/cpuinfo.h (get_available_features):
12397 Detect FEATURE_AVX512FP16.
12398 * common/config/i386/i386-common.c
12399 (OPTION_MASK_ISA_AVX512FP16_SET,
12400 OPTION_MASK_ISA_AVX512FP16_UNSET,
12401 OPTION_MASK_ISA2_AVX512FP16_SET,
12402 OPTION_MASK_ISA2_AVX512FP16_UNSET): New.
12403 (OPTION_MASK_ISA2_AVX512BW_UNSET,
12404 OPTION_MASK_ISA2_AVX512BF16_UNSET): Add AVX512FP16.
12405 (ix86_handle_option): Handle -mavx512fp16.
12406 * common/config/i386/i386-cpuinfo.h (enum processor_features):
12407 Add FEATURE_AVX512FP16.
12408 * common/config/i386/i386-isas.h: Add entry for AVX512FP16.
12409 * config.gcc: Add avx512fp16intrin.h.
12410 * config/i386/avx512fp16intrin.h: New intrinsic header.
12411 * config/i386/cpuid.h: Add bit_AVX512FP16.
12412 * config/i386/i386-builtin-types.def: (FLOAT16): New primitive type.
12413 * config/i386/i386-builtins.c: Support _Float16 type for i386
12415 (ix86_register_float16_builtin_type): New function.
12416 (ix86_float16_type_node): New.
12417 * config/i386/i386-c.c (ix86_target_macros_internal): Define
12419 * config/i386/i386-expand.c (ix86_expand_branch): Support
12421 (ix86_prepare_fp_compare_args): Adjust TARGET_SSE_MATH &&
12422 SSE_FLOAT_MODE_P to SSE_FLOAT_MODE_SSEMATH_OR_HF_P.
12423 (ix86_expand_fp_movcc): Ditto.
12424 * config/i386/i386-isa.def: Add PTA define for AVX512FP16.
12425 * config/i386/i386-options.c (isa2_opts): Add -mavx512fp16.
12426 (ix86_valid_target_attribute_inner_p): Add avx512fp16 attribute.
12427 * config/i386/i386.c (ix86_get_ssemov): Use
12428 vmovdqu16/vmovw/vmovsh for HFmode/HImode scalar or vector.
12429 (ix86_get_excess_precision): Use
12430 FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when TARGET_AVX512FP16
12432 (sse_store_index): Use SFmode cost for HFmode cost.
12433 (inline_memory_move_cost): Add HFmode, and perfer SSE cost over
12434 GPR cost for HFmode.
12435 (ix86_hard_regno_mode_ok): Allow HImode in sse register.
12436 (ix86_mangle_type): Add manlging for _Float16 type.
12437 (inline_secondary_memory_needed): No memory is needed for
12438 16bit movement between gpr and sse reg under
12440 (ix86_multiplication_cost): Adjust TARGET_SSE_MATH &&
12441 SSE_FLOAT_MODE_P to SSE_FLOAT_MODE_SSEMATH_OR_HF_P.
12442 (ix86_division_cost): Ditto.
12443 (ix86_rtx_costs): Ditto.
12444 (ix86_add_stmt_cost): Ditto.
12445 (ix86_optab_supported_p): Ditto.
12446 * config/i386/i386.h (VALID_AVX512F_SCALAR_MODE): Add HFmode.
12447 (SSE_FLOAT_MODE_SSEMATH_OR_HF_P): Add HFmode.
12448 (PTA_SAPPHIRERAPIDS): Add PTA_AVX512FP16.
12449 * config/i386/i386.md (mode): Add HFmode.
12450 (MODE_SIZE): Add HFmode.
12451 (isa): Add avx512fp16.
12452 (enabled): Handle avx512fp16.
12453 (ssemodesuffix): Add sh suffix for HFmode.
12454 (comm): Add mult, div.
12455 (plusminusmultdiv): New code iterator.
12456 (insn): Add mult, div.
12457 (*movhf_internal): Adjust for avx512fp16 instruction.
12458 (*movhi_internal): Ditto.
12459 (*cmpi<unord>hf): New define_insn for HFmode.
12460 (*ieee_s<ieee_maxmin>hf3): Likewise.
12461 (extendhf<mode>2): Likewise.
12462 (trunc<mode>hf2): Likewise.
12463 (float<floatunssuffix><mode>hf2): Likewise.
12464 (*<insn>hf): Likewise.
12465 (cbranchhf4): New expander.
12466 (movhfcc): Likewise.
12467 (<insn>hf3): Likewise.
12468 (mulhf3): Likewise.
12469 (divhf3): Likewise.
12470 * config/i386/i386.opt: Add mavx512fp16.
12471 * config/i386/immintrin.h: Include avx512fp16intrin.h.
12472 * doc/invoke.texi: Add mavx512fp16.
12473 * doc/extend.texi: Add avx512fp16 Usage Notes.
12475 2021-09-08 liuhongt <hongtao.liu@intel.com>
12477 * common.opt: Support -fexcess-precision=16.
12478 * config/aarch64/aarch64.c (aarch64_excess_precision): Return
12479 FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when
12480 EXCESS_PRECISION_TYPE_FLOAT16.
12481 * config/arm/arm.c (arm_excess_precision): Ditto.
12482 * config/i386/i386.c (ix86_get_excess_precision): Ditto.
12483 * config/m68k/m68k.c (m68k_excess_precision): Issue an error
12484 when EXCESS_PRECISION_TYPE_FLOAT16.
12485 * config/s390/s390.c (s390_excess_precision): Ditto.
12486 * coretypes.h (enum excess_precision_type): Add
12487 EXCESS_PRECISION_TYPE_FLOAT16.
12488 * doc/tm.texi (TARGET_C_EXCESS_PRECISION): Update documents.
12489 * doc/tm.texi.in (TARGET_C_EXCESS_PRECISION): Ditto.
12490 * doc/extend.texi (Half-Precision): Document
12491 -fexcess-precision=16.
12492 * flag-types.h (enum excess_precision): Add
12493 EXCESS_PRECISION_FLOAT16.
12494 * target.def (excess_precision): Update document.
12495 * tree.c (excess_precision_type): Set excess_precision_type to
12496 EXCESS_PRECISION_FLOAT16 when -fexcess-precision=16.
12498 2021-09-08 liuhongt <hongtao.liu@intel.com>
12500 * doc/extend.texi: (@node Floating Types): Adjust the wording.
12501 (@node Half-Precision): Ditto.
12503 2021-09-07 Takayuki 'January June' Suwa <jjsuwa_sys3175@yahoo.co.jp>
12506 * config/xtensa/xtensa.c (xtensa_emit_move_sequence): Add
12507 'CONST_INT_P (src)' to the condition of the block that tries to
12508 eliminate literal when loading integer contant.
12510 2021-09-07 David Faust <david.faust@oracle.com>
12512 * doc/extend.texi (BPF Type Attributes) New node.
12513 Document new preserve_access_index attribute.
12514 Document new preserve_access_index builtin.
12515 * doc/invoke.texi: Document -mco-re and -mno-co-re options.
12517 2021-09-07 David Faust <david.faust@oracle.com>
12519 * config/bpf/bpf.c: Adjust includes.
12520 (bpf_handle_preserve_access_index_attribute): New function.
12521 (bpf_attribute_table): Use it here.
12522 (bpf_builtins): Add BPF_BUILTIN_PRESERVE_ACCESS_INDEX.
12523 (bpf_option_override): Handle "-mco-re" option.
12524 (bpf_asm_init_sections): New.
12525 (TARGET_ASM_INIT_SECTIONS): Redefine.
12526 (bpf_file_end): New.
12527 (TARGET_ASM_FILE_END): Redefine.
12528 (bpf_init_builtins): Add "__builtin_preserve_access_index".
12529 (bpf_core_compute, bpf_core_get_index): New.
12530 (is_attr_preserve_access): New.
12531 (bpf_expand_builtin): Handle new builtins.
12532 (bpf_core_newdecl, bpf_core_is_maybe_aggregate_access): New.
12533 (bpf_core_walk): New.
12534 (bpf_resolve_overloaded_builtin): New.
12535 (TARGET_RESOLVE_OVERLOADED_BUILTIN): Redefine.
12536 (handle_attr): New.
12537 (pass_bpf_core_attr): New RTL pass.
12538 * config/bpf/bpf-passes.def: New file.
12539 * config/bpf/bpf-protos.h (make_pass_bpf_core_attr): New.
12540 * config/bpf/coreout.c: New file.
12541 * config/bpf/coreout.h: Likewise.
12542 * config/bpf/t-bpf (TM_H): Add $(srcdir)/config/bpf/coreout.h.
12543 (coreout.o): New rule.
12544 (PASSES_EXTRA): Add $(srcdir)/config/bpf/bpf-passes.def.
12545 * config.gcc (bpf): Add coreout.h to extra_headers.
12546 Add coreout.o to extra_objs.
12547 Add $(srcdir)/config/bpf/coreout.c to target_gtfiles.
12549 2021-09-07 David Faust <david.faust@oracle.com>
12551 * btfout.c (get_btf_id): Function is no longer static.
12552 * ctfc.h: Expose it here.
12554 2021-09-07 David Faust <david.faust@oracle.com>
12556 * ctfc.c (ctf_lookup_tree_type): New function.
12557 * ctfc.h: Likewise.
12559 2021-09-07 David Faust <david.faust@oracle.com>
12561 * ctfc.c (ctf_dtd_lookup): Function is no longer static.
12562 * ctfc.h: Analogous change.
12564 2021-09-07 David Faust <david.faust@oracle.com>
12566 * dwarf2out.c (lookup_type_die): Function is no longer static.
12567 * dwarf2out.h: Expose it here.
12569 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
12571 * dwarf2ctf.c (ctf_debug_finalize): Make it static.
12572 (ctf_debug_early_finish): New definition.
12573 (ctf_debug_finish): Likewise.
12574 * dwarf2ctf.h (ctf_debug_finalize): Remove declaration.
12575 (ctf_debug_early_finish): New declaration.
12576 (ctf_debug_finish): Likewise.
12577 * dwarf2out.c (dwarf2out_finish): Invoke ctf_debug_finish.
12578 (dwarf2out_early_finish): Invoke ctf_debug_early_finish.
12580 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
12582 * config/bpf/bpf.c (bpf_option_override): For BPF backend, disable LTO
12583 support when compiling for CO-RE.
12584 * config/bpf/bpf.opt: Add new command line option -mco-re.
12586 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
12588 * flag-types.h (enum debug_info_type): Add new enum
12589 DINFO_TYPE_BTF_WITH_CORE.
12590 (BTF_WITH_CORE_DEBUG): New bitmask.
12591 * flags.h (btf_with_core_debuginfo_p): New declaration.
12592 * opts.c (btf_with_core_debuginfo_p): New definition.
12594 2021-09-07 Jason Merrill <jason@redhat.com>
12596 * tree.h (error_operand_p): Change to inline function.
12598 2021-09-07 Aldy Hernandez <aldyh@redhat.com>
12600 * tree-ssa-threadedge.c (forwarder_block_p): Rename to...
12601 (empty_block_with_phis_p): ...this.
12602 (potentially_threadable_block): Same.
12603 (jump_threader::thread_through_normal_block): Same.
12605 2021-09-07 Eric Botcazou <ebotcazou@adacore.com>
12608 * dwarf2out.c (mark_base_types): New overloaded function.
12609 (dwarf2out_early_finish): Invoke it on the COMDAT type list as well
12610 as the compilation unit, and call move_marked_base_types afterward.
12612 2021-09-07 H.J. Lu <hjl.tools@gmail.com>
12615 * config/i386/i386-expand.c (ix86_expand_convert_uns_sisf_sse):
12617 (ix86_expand_vector_convert_uns_vsivsf): Likewise.
12619 2021-09-07 Richard Biener <rguenther@suse.de>
12621 PR tree-optimization/102226
12622 * tree-vect-loop.c (vect_transform_cycle_phi): Record
12623 the converted value for the epilogue PHI use.
12625 2021-09-07 Martin Liska <mliska@suse.cz>
12627 PR gcov-profile/80223
12628 * ipa-inline.c (can_inline_edge_p): Similarly to sanitizer
12629 options, do not inline when no_profile_instrument_function
12630 attributes are different in early inliner. It's fine to inline
12631 it after PGO instrumentation.
12633 2021-09-07 Richard Biener <rguenther@suse.de>
12635 PR tree-optimization/101555
12636 * tree-ssa-pre.c (translate_vuse_through_block): Do not
12637 perform an alias walk to determine the validity of the
12638 mem at the start of the block which is already guaranteed
12639 by means of prune_clobbered_mems.
12640 (phi_translate_1): Pass edge to translate_vuse_through_block.
12642 2021-09-07 Xionghu Luo <luoxhu@linux.ibm.com>
12645 * config/rs6000/rs6000.md (fmod<mode>3): New define_expand.
12646 (remainder<mode>3): Likewise.
12648 2021-09-07 YunQiang Su <yunqiang.su@cipunited.com>
12650 * config/mips/mips.c (mips_file_start): add .module for
12653 2021-09-06 Roger Sayle <roger@nextmovesoftware.com>
12655 * wide-int.cc (wi::clz): Reorder tests to ensure the result
12656 is zero for all negative values.
12658 2021-09-06 Tobias Burnus <tobias@codesourcery.com>
12660 * doc/invoke.texi (-foffload-options): Fix @opindex.
12662 2021-09-06 H.J. Lu <hjl.tools@gmail.com>
12665 * config/i386/i386-expand.c (ix86_split_xorsign): Use operands[2].
12666 * config/i386/i386.md (@xorsign<mode>3_1): Add non-destructive
12667 source alternative for AVX.
12669 2021-09-06 liuhongt <hongtao.liu@intel.com>
12671 PR middle-end/102182
12672 * optabs.c (expand_fix): Add from1 to avoid from being
12675 2021-09-06 Eric Botcazou <ebotcazou@adacore.com>
12677 * dwarf2out.c (modified_type_die): Deal with all array types earlier
12678 and use local variable consistently throughout the function.
12680 2021-09-06 Jakub Jelinek <jakub@redhat.com>
12682 PR tree-optimization/102207
12683 * match.pd: Don't demote operands of IFN_{ADD,SUB,MUL}_OVERFLOW if they
12684 were promoted from signed to wider unsigned type.
12686 2021-09-06 Andrew Pinski <apinski@marvell.com>
12688 PR tree-optimization/63184
12689 * match.pd: Add simplification of pointer_diff of two pointer_plus
12690 with addr_expr in the first operand of each pointer_plus.
12691 Add simplificatoin of ne/eq of two pointer_plus with addr_expr
12692 in the first operand of each pointer_plus.
12694 2021-09-06 Richard Biener <rguenther@suse.de>
12696 PR tree-optimization/102176
12697 * tree-vect-slp.c (vect_slp_gather_vectorized_scalar_stmts):
12699 (vect_bb_slp_scalar_cost): Use the computed set of
12700 vectorized scalar stmts instead of relying on the out-of-date
12701 and not accurate PURE_SLP_STMT.
12702 (vect_bb_vectorization_profitable_p): Compute the set
12703 of vectorized scalar stmts.
12705 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
12707 * gimple-range-path.cc (path_range_query::range_of_stmt): Remove
12708 GIMPLE_COND special casing.
12709 (path_range_query::range_defined_in_block): Use range_of_stmt
12710 instead of calling fold_range directly.
12712 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
12714 * gimple-range-path.cc (path_range_query::range_of_expr): Set
12715 m_undefined_path when appropriate.
12716 (path_range_query::internal_range_of_expr): Copy from range_of_expr.
12717 (path_range_query::unreachable_path_p): New.
12718 (path_range_query::precompute_ranges): Set m_undefined_path.
12719 * gimple-range-path.h (path_range_query::unreachable_path_p): New.
12720 (path_range_query::internal_range_of_expr): New.
12721 * tree-ssa-threadbackward.c (back_threader::find_taken_edge_cond):
12722 Use unreachable_path_p.
12724 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
12726 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
12727 Remove argument and call find_taken_edge.
12728 (back_threader::resolve_phi): Do not calculate taken edge before
12729 calling maybe_register_path.
12730 (back_threader::find_paths_to_names): Same.
12732 2021-09-05 Jeff Law <jlaw@localhost.localdomain>
12734 * config/h8300/h8300.md (QHSI2 mode iterator): New mode iterator.
12735 * config/h8300/testcompare.md (store_c): Update name, use new
12737 (store_neg_c, store_shifted_c): New patterns.
12739 2021-09-03 Segher Boessenkool <segher@kernel.crashing.org>
12742 * config/rs6000/rs6000-logue.c (rs6000_emit_prologue): On ELFv2 use r11
12743 instead of r12 for CR save, in all cases.
12745 2021-09-03 Andrew Pinski <apinski@marvell.com>
12747 * config/aarch64/aarch64-sve-builtins.cc (register_vector_type):
12748 Handle error_mark_node as the type of the type_decl.
12750 2021-09-03 Andrew Pinski <apinski@marvell.com>
12752 * config/aarch64/aarch64-builtins.c (struct aarch64_simd_type_info):
12754 (aarch64_simd_types): Likewise.
12755 (aarch64_simd_intOI_type_node): Likewise.
12756 (aarch64_simd_intCI_type_node): Likewise.
12757 (aarch64_simd_intXI_type_node): Likewise.
12758 * config/aarch64/aarch64.h (aarch64_fp16_type_node): Likewise.
12759 (aarch64_fp16_ptr_type_node): Likewise.
12760 (aarch64_bf16_type_node): Likewise.
12761 (aarch64_bf16_ptr_type_node): Likewise.
12763 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12765 * range-op.cc (operator_minus::op1_op2_relation_effect): Abstract
12767 (minus_op1_op2_relation_effect): ...here.
12768 (class operator_pointer_diff): New.
12769 (operator_pointer_diff::op1_op2_relation_effect): Call
12770 minus_op1_op2_relation_effect.
12771 (integral_table::integral_table): Add entry for POINTER_DIFF_EXPR.
12773 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12775 * tree-ssa-threadbackward.c (back_threader::thread_through_all_blocks):
12776 Add may_peel_loop_headers.
12777 (back_threader_registry::thread_through_all_blocks): Same.
12778 (try_thread_blocks): Pass may_peel_loop_headers argument.
12779 (pass_early_thread_jumps::execute): Same.
12781 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12783 * tree-ssa-threadedge.c (has_phis_p): New.
12784 (forwarder_block_p): New.
12785 (potentially_threadable_block): Call forwarder_block_p.
12786 (jump_threader::thread_around_empty_blocks): Call has_phis_p.
12787 (jump_threader::thread_through_normal_block): Call
12790 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12792 * tree-ssa-threadbackward.c (back_threader::dump): New.
12793 (back_threader::debug): New.
12794 (back_threader_profitability::profitable_path_p): Dump blocks
12795 even if we are bailing early.
12797 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12799 * tree-ssa-threadupdate.c (cancel_thread): New.
12800 (jump_thread_path_registry::thread_block_1): Use cancel_thread.
12801 (jump_thread_path_registry::mark_threaded_blocks): Same.
12802 (jump_thread_path_registry::register_jump_thread): Same.
12804 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12806 * tree-ssa-threadedge.c (jt_state::push): Only call methods for
12807 which objects are available.
12808 (jt_state::pop): Same.
12809 (jt_state::register_equiv): Same.
12810 (jt_state::register_equivs_on_edge): Same.
12812 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12814 * tree-ssa-threadedge.c (jump_threader::thread_across_edge):
12815 Move pop until after a thread is registered.
12817 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12819 * tree-ssa-threadupdate.c (debug): New.
12821 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12823 * gimple-range-trace.cc (push_dump_file::push_dump_file): New.
12824 (push_dump_file::~push_dump_file): New.
12825 (dump_ranger): Change dump_file temporarily while dumping
12827 * gimple-range-trace.h (class push_dump_file): New.
12829 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12831 * gimple-range-trace.cc (debug_seed_ranger): Remove static.
12832 (dump_ranger): Dump function name.
12834 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12836 * gimple-range-path.cc (path_range_query::range_defined_in_block):
12837 Adjust for non-null.
12838 (path_range_query::adjust_for_non_null_uses): New.
12839 (path_range_query::precompute_ranges): Call
12840 adjust_for_non_null_uses.
12841 * gimple-range-path.h: Add m_non_null and
12842 adjust_for_non_null_uses.
12844 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12846 * gimple-range-path.cc (path_range_query::dump): Dump path
12848 (path_range_query::precompute_ranges): Dump entire path.
12850 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12852 * value-relation.cc (relation_oracle::debug): New.
12853 * value-relation.h (relation_oracle::debug): New.
12855 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12857 * tree-ssa-loop-ch.c: Remove unnecessary include file.
12859 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12861 * gimple-range-fold.cc (fold_using_range::postfold_gcond_edges):
12862 Skip statements with no defining BB.
12863 * gimple-range-path.cc (path_range_query::range_defined_in_block):
12864 Do not get confused by statements with no defining BB.
12866 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12868 * gimple-range-fold.cc (adjust_imagpart_expr): Move from
12869 gimple_range_adjustment. Add support for constants.
12870 (adjust_realpart_expr): New.
12871 (gimple_range_adjustment): Move IMAGPART_EXPR code to
12872 adjust_imagpart_expr.
12873 * range-op.cc (integral_table::integral_table): Add entry for
12876 2021-09-03 Jakub Jelinek <jakub@redhat.com>
12878 * omp-expand.c (expand_omp_atomic_pipeline): Use
12879 IFN_ATOMIC_COMPARE_EXCHANGE instead of
12880 BUILT_IN_SYNC_VAL_COMPARE_AND_SWAP_? so that memory order
12883 2021-09-03 Jakub Jelinek <jakub@redhat.com>
12886 * tree.h (DECL_FIELD_ABI_IGNORED): Changed into rvalue only macro
12887 that is false if DECL_BIT_FIELD.
12888 (SET_DECL_FIELD_ABI_IGNORED, DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD,
12889 SET_DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD): Define.
12890 * tree-streamer-out.c (pack_ts_decl_common_value_fields): For
12891 DECL_BIT_FIELD stream DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead
12892 of DECL_FIELD_ABI_IGNORED.
12893 * tree-streamer-in.c (unpack_ts_decl_common_value_fields): Use
12894 SET_DECL_FIELD_ABI_IGNORED instead of writing to
12895 DECL_FIELD_ABI_IGNORED and for DECL_BIT_FIELD use
12896 SET_DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead.
12897 * lto-streamer-out.c (hash_tree): For DECL_BIT_FIELD hash
12898 DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead of DECL_FIELD_ABI_IGNORED.
12900 2021-09-03 liuhongt <hongtao.liu@intel.com>
12903 * config/i386/amxbf16intrin.h : Remove macro check for __AMX_BF16__.
12904 * config/i386/amxint8intrin.h : Remove macro check for __AMX_INT8__.
12905 * config/i386/amxtileintrin.h : Remove macro check for __AMX_TILE__.
12907 2021-09-02 Martin Sebor <msebor@redhat.com>
12909 PR tree-optimization/17506
12911 * tree-ssa-uninit.c (warn_uninit): Remove conditional guarding note.
12913 2021-09-02 Richard Biener <rguenther@suse.de>
12915 * tree-ssa-loop-im.c (fill_always_executed_in_1): Refine
12916 fix for PR78185 and continue processing when leaving
12917 finite inner loops.
12919 2021-09-02 Jakub Jelinek <jakub@redhat.com>
12921 PR tree-optimization/99591
12922 * match.pd: Demote operands of IFN_{ADD,SUB,MUL}_OVERFLOW if they
12925 2021-09-02 Richard Biener <rguenther@suse.de>
12928 2021-09-02 Richard Biener <rguenther@suse.de>
12930 PR tree-optimization/102155
12931 * tree-ssa-loop-im.c (fill_always_executed_in_1): Iterate
12932 over a part of the RPO array and do not recurse here.
12933 Dump blocks marked as always executed.
12934 (fill_always_executed_in): Walk over the RPO array and
12935 process loops whose header we run into.
12936 (loop_invariant_motion_in_fun): Compute the first RPO
12937 using rev_post_order_and_mark_dfs_back_seme in iteration
12938 order and pass that to fill_always_executed_in.
12940 2021-09-02 liuhongt <hongtao.liu@intel.com>
12942 * config/i386/i386-modes.def (FLOAT_MODE): Define ieee HFmode.
12943 * config/i386/i386.c (enum x86_64_reg_class): Add
12944 X86_64_SSEHF_CLASS.
12945 (merge_classes): Handle X86_64_SSEHF_CLASS.
12946 (examine_argument): Ditto.
12947 (construct_container): Ditto.
12948 (classify_argument): Ditto, and set HFmode/HCmode to
12949 X86_64_SSEHF_CLASS.
12950 (function_value_32): Return _FLoat16/Complex Float16 by
12952 (function_value_64): Return _Float16/Complex Float16 by SSE
12954 (ix86_print_operand): Handle CONST_DOUBLE HFmode.
12955 (ix86_secondary_reload): Require gpr as intermediate register
12956 to store _Float16 from sse register when sse4 is not
12958 (ix86_libgcc_floating_mode_supported_p): Enable _FLoat16 under
12960 (ix86_scalar_mode_supported_p): Ditto.
12961 (TARGET_LIBGCC_FLOATING_MODE_SUPPORTED_P): Defined.
12962 * config/i386/i386.h (VALID_SSE2_REG_MODE): Add HFmode.
12963 (VALID_INT_MODE_P): Add HFmode and HCmode.
12964 * config/i386/i386.md (*pushhf_rex64): New define_insn.
12966 (*movhf_internal): Ditto.
12967 * doc/extend.texi (Half-Precision Floating Point): Documemt
12970 2021-09-02 Richard Biener <rguenther@suse.de>
12972 PR tree-optimization/102155
12973 * tree-ssa-loop-im.c (fill_always_executed_in_1): Iterate
12974 over a part of the RPO array and do not recurse here.
12975 Dump blocks marked as always executed.
12976 (fill_always_executed_in): Walk over the RPO array and
12977 process loops whose header we run into.
12978 (loop_invariant_motion_in_fun): Compute the first RPO
12979 using rev_post_order_and_mark_dfs_back_seme in iteration
12980 order and pass that to fill_always_executed_in.
12982 2021-09-02 YunQiang Su <syq@debian.org>
12985 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
12987 * config/mips/mips.c (mips_module_isa_name): New.
12988 mips_file_start: add .module mipsREV to all asm output
12990 2021-09-01 Jeff Law <jlaw@localhost.localdomain>
12992 PR tree-optimization/102152
12993 * tree-ssa-dom.c (dom_opt_dom_walker::optimize_stmt): Reduce a vector
12994 comparison to a scalar comparison before calling
12995 update_stmt_if_modified.
12997 2021-09-01 Andrew Pinski <apinski@marvell.com>
13000 * config/aarch64/aarch64.c (aarch64_expand_setmem):
13001 Check STRICT_ALIGNMENT before creating an overlapping
13004 2021-09-01 Martin Sebor <msebor@redhat.com>
13006 * gimple-ssa-warn-access.cc (get_size_range): Add argument.
13007 (check_access): Pass additional argument.
13008 (check_memop_access): Remove template and make a member function.
13009 (maybe_check_dealloc_call): Make a pass_waccess member function.
13010 (class pass_waccess): Add, rename, and remove members.
13011 (pass_waccess::pass_waccess): Adjust to name change.
13012 (pass_waccess::~pass_waccess): Same.
13013 (check_alloca): Make a member function.
13014 (check_alloc_size_call): Same.
13015 (check_strcat): Same.
13016 (check_strncat): Same.
13017 (check_stxcpy): Same.
13018 (check_stxncpy): Same.
13019 (check_strncmp): Same.
13020 (maybe_warn_rdwr_sizes): Rename...
13021 (pass_waccess::maybe_check_access_sizes): ...to this.
13022 (pass_waccess::check_call): Adjust to name changes.
13023 (pass_waccess::maybe_check_dealloc_call): Make a pass_waccess member
13025 (pass_waccess::execute): Adjust to name changes.
13026 * gimple-ssa-warn-access.h (check_memop_access): Remove.
13027 * pointer-query.cc (access_ref::phi): Handle null pointer.
13028 (access_ref::inform_access): Same.
13029 (pointer_query::put_ref): Modify a cached value, not a copy of it.
13030 (pointer_query::dump): New function.
13031 (compute_objsize_r): Avoid overwriting access_ref::bndrng. Cache
13033 * pointer-query.h (pointer_query::dump): Declare.
13034 * tree-ssa-strlen.c (get_range): Simplify. Use function query.
13035 (dump_strlen_info): Use function query.
13036 (printf_strlen_execute): Factor code out into pointer_query::put_ref.
13038 2021-09-01 Thomas Schwinge <thomas@codesourcery.com>
13040 * tree.c (walk_tree_1) <OMP_CLAUSE>: Simplify.
13042 2021-09-01 Iain Sandoe <iain@sandoe.co.uk>
13044 * doc/extend.texi: Document unavailable attribute.
13045 * print-tree.c (print_node): Handle unavailable attribute.
13046 * tree-core.h (struct tree_base): Add a bit to carry unavailability.
13047 * tree.c (error_unavailable_use): New.
13048 * tree.h (TREE_UNAVAILABLE): New.
13049 (error_unavailable_use): New.
13051 2021-09-01 Jakub Jelinek <jakub@redhat.com>
13053 PR tree-optimization/102124
13054 * tree-vect-patterns.c (vect_recog_widen_op_pattern): For ORIG_CODE
13055 MINUS_EXPR, if itype is unsigned with smaller precision than type,
13056 add an extra cast to signed variant of itype to ensure sign-extension.
13058 2021-09-01 Martin Liska <mliska@suse.cz>
13060 * graph.c (draw_cfg_node_succ_edges): Do not color fallthru
13061 edges and rather use colors for TRUE and FALSE edges.
13063 2021-09-01 Richard Biener <rguenther@suse.de>
13065 PR tree-optimization/93491
13066 * tree-ssa-pre.c (compute_avail): Set BB_MAY_NOTRETURN
13067 after processing the stmt itself. Do not consider
13068 pure functions possibly not returning. Properly avoid
13069 adding possibly trapping calls to EXP_GEN when there's
13070 a preceeding possibly not returning call.
13071 * tree-ssa-sccvn.c (vn_reference_may_trap): Conservatively
13074 2021-09-01 Richard Biener <rguenther@suse.de>
13076 PR tree-optimization/102139
13077 * tree-vectorizer.h (vec_base_alignments): Adjust hash-map
13078 type to record a std::pair of the stmt-info and the innermost
13080 (dr_vec_info::group): New member.
13081 * tree-vect-data-refs.c (vect_record_base_alignment): Adjust.
13082 (vect_compute_data_ref_alignment): Verify the recorded
13083 base alignment can be used.
13084 (data_ref_pair): Remove.
13085 (dr_group_sort_cmp): Adjust.
13086 (vect_analyze_data_ref_accesses): Store the group-ID in the
13087 dr_vec_info and operate on a vector of dr_vec_infos.
13089 2021-09-01 YunQiang Su <yunqiang.su@cipunited.com>
13091 * read-md.c (md_reader::handle_enum): support value assignation.
13092 * doc/md.texi: record define_c_enum value assignation support.
13094 2021-09-01 Jakub Jelinek <jakub@redhat.com>
13096 PR tree-optimization/102141
13097 * gimple-ssa-store-merging.c (bswap_view_convert): Add BEFORE
13098 argument. If false, emit stmts after gsi instead of before, and
13100 (bswap_replace): Adjust callers. When converting output of bswap,
13101 emit VIEW_CONVERT prepratation stmts after a copy of gsi instead
13104 2021-09-01 liuhongt <hongtao.liu@intel.com>
13106 * emit-rtl.c (validate_subreg): Get rid of all float-int
13109 2021-09-01 liuhongt <hongtao.liu@intel.com>
13112 2021-08-30 liuhongt <hongtao.liu@intel.com>
13114 * expmed.c (extract_bit_field_1): Make sure we're playing with
13115 integral modes before call extract_integral_bit_field.
13116 (extract_integral_bit_field): Add a parameter of type
13117 scalar_int_mode which corresponds to of tmode.
13118 And call extract_and_convert_fixed_bit_field instead of
13119 extract_fixed_bit_field and convert_extracted_bit_field.
13120 (extract_and_convert_fixed_bit_field): New function, it's a
13121 combination of extract_fixed_bit_field and
13122 convert_extracted_bit_field.
13124 2021-08-31 Thomas Schwinge <thomas@codesourcery.com>
13126 * tree.c (walk_tree_1) <OMP_CLAUSE_TILE>: Handle three operands.
13128 2021-08-31 Thomas Schwinge <thomas@codesourcery.com>
13130 * omp-general.h (omp_is_reference): Rename to...
13131 (omp_privatize_by_reference): ... this. Adjust all users...
13132 * omp-general.c: ... here, ...
13133 * gimplify.c: ... here, ...
13134 * omp-expand.c: ... here, ...
13135 * omp-low.c: ... here.
13137 2021-08-31 Martin Sebor <msebor@redhat.com>
13139 * gimple-ssa-warn-access.cc (maybe_warn_alloc_args_overflow): Test
13140 pointer element for equality to zero, not that of the cotaining
13143 2021-08-31 Martin Sebor <msebor@redhat.com>
13145 * gcc-rich-location.h (gcc_rich_location): Make ctor explicit.
13147 2021-08-31 Martin Sebor <msebor@redhat.com>
13149 * function.h (function): Add comments.
13150 (get_range_query): Same. Add attribute returns nonnull.
13152 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
13154 * expr.c (convert_modes): Don't use subreg_promoted_mode on a
13155 SUBREG if it can't be guaranteed to a SUBREG_PROMOTED_VAR_P set.
13156 Instead use the standard (safer) is_a <scalar_int_mode> idiom.
13158 2021-08-31 Jeff Law <jlaw@localhost.localdomain>
13160 * config.gcc (cris-*-elf, cris-*-none): Remove dbxelf.h from
13162 (m32r-*-elf, m32rle-*-elf, m32r-*-linux): Likewise.
13163 (mn10300-*-*, am33_2.0-*-linux*): Likewise.
13164 (xtensa*-*-elf, xtensa*-*-linux, xtensa*-*-uclinux): Likewise.
13165 (m32c-*-elf*, m32c-*-rtems*): Likewise.
13166 * config/cris/cris.h (DBX_NO_XREFS): Remove.
13167 (DBX_CONTIN_LENGTH, DBX_CONTIN_CHAR): Likewise.
13168 * config/m32r/m32r.h (DBXOUT_SOURCE_LINE): Likewise.
13169 (DBX_DEBUGGING_INFO, DBX_CONTIN_LENGTH): Likewise.
13170 * config/mn10300/mn10300.h (DEFAULT_GDB_EXTENSIONS): Likewise.
13171 * config/mn10300/linux.h (DBX_REGISTER_NAMES): Likewise.
13173 2021-08-31 Marcel Vollweiler <marcel@codesourcery.com>
13175 * gimplify.c (gimplify_scan_omp_clauses): Error handling. 'ancestor' only
13176 allowed on target constructs and only with particular other clauses.
13177 * omp-expand.c (expand_omp_target): Output of 'sorry, not supported' if
13178 'ancestor' is used.
13179 * omp-low.c (check_omp_nesting_restrictions): Error handling. No nested OpenMP
13180 structs when 'ancestor' is used.
13181 (scan_omp_1_stmt): No usage of OpenMP runtime routines in a target region when
13182 'ancestor' is used.
13183 * tree-pretty-print.c (dump_omp_clause): Append 'ancestor'.
13184 * tree.h (OMP_CLAUSE_DEVICE_ANCESTOR): Define macro.
13186 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
13188 * expr.c (convert_modes): Preserve SUBREG_PROMOTED_VAR_P when
13189 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
13191 * simplify-rtx.c (simplify_unary_operation_1) [SIGN_EXTEND]:
13192 Likewise, preserve SUBREG_PROMOTED_VAR_P when creating a (wider)
13193 partial subreg from a SUBREG_PROMOTED_VAR_P subreg. Generate
13194 SIGN_EXTEND of the SUBREG_REG when a subreg would be paradoxical.
13195 [ZERO_EXTEND]: Likewise, preserve SUBREG_PROMOTED_VAR_P when
13196 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
13197 subreg. Generate ZERO_EXTEND of the SUBREG_REG when a subreg
13198 would be paradoxical.
13200 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
13202 * combine.c (combine_simplify_rtx): Avoid converting an explicit
13203 TRUNCATE into a lowpart SUBREG on !TRULY_NOOP_TRUNCATION targets.
13204 * simplify-rtx.c (simplify_unary_operation_1): Likewise.
13206 2021-08-31 Richard Biener <rguenther@suse.de>
13208 PR tree-optimization/102142
13209 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Fix
13210 condition under which to unset the visited flag.
13212 2021-08-31 Richard Biener <rguenther@suse.de>
13214 PR middle-end/102129
13215 * tree-ssa-ter.c (find_replaceable_in_bb): Do not move
13216 possibly trapping expressions across calls.
13218 2021-08-31 Jakub Jelinek <jakub@redhat.com>
13220 PR tree-optimization/102134
13221 * tree-ssa-ccp.c (bit_value_binop) <case RSHIFT_EXPR>: If sgn is
13222 UNSIGNED and r1val | r1mask has MSB set, ensure lzcount doesn't
13225 2021-08-31 Andrew Pinski <apinski@marvell.com>
13228 * collect-utils.c (setup_signals): New declaration.
13229 * collect-utils.h (setup_signals): New function.
13230 * collect2.c (handler): Delete.
13231 (main): Instead of manually setting up the signals,
13232 just call setup_signals.
13233 * lto-wrapper.c (main): Likewise.
13235 2021-08-31 Andrew Pinski <apinski@marvell.com>
13238 * config/i386/i386-protos.h (x86_output_aligned_bss):
13239 Change align argument to unsigned type.
13240 (x86_elf_aligned_decl_common): Likewise.
13241 * config/i386/i386.c (x86_elf_aligned_decl_common): Likewise.
13242 (x86_output_aligned_bss): Likewise.
13244 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
13246 * config/mips/mips.c (mips_module_isa_name): New.
13247 mips_file_start: add .module mipsREV to all asm output
13249 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
13251 * config/mips/mips.h (struct mips_cpu_info): define enum mips_isa;
13252 use enum instead of int for 'isa' member.
13253 * config.gcc, config/mips/mips.c, config/mips/mips-cpus.def,
13254 config/mips/netbsd.h: replace hardcoded numbers with enum.
13256 2021-08-31 liuhongt <hongtao.liu@intel.com>
13258 * config/i386/sse.md (*<avx512>_ucmp<mode>3_1): Change from
13259 define_split to define_insn_and_split.
13260 (*avx2_eq<mode>3): Removed.
13261 (<avx512>_eq<mode>3<mask_scalar_merge_name>): Adjust pattern
13262 (<avx512>_eq<mode>3<mask_scalar_merge_name>_1): Rename to ..
13263 (*<avx512>_eq<mode>3<mask_scalar_merge_name>_1): .. this, and
13265 (*avx2_gt<mode>3): Removed.
13266 (<avx512>_gt<mode>3<mask_scalar_merge_name>): Change from
13267 define_insn to define_expand, and adjust pattern.
13268 (UNSPEC_MASKED_EQ, UNSPEC_MASKED_GT): Removed.
13270 2021-08-30 David Malcolm <dmalcolm@redhat.com>
13273 * Makefile.in (ANALYZER_OBJS): Add analyzer/call-info.o.
13275 2021-08-30 Jason Merrill <jason@redhat.com>
13277 * doc/invoke.texi: Document -Wmissing-requires.
13279 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
13281 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Remove
13282 TARGET_EXTRA_BUILTINS guard.
13284 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
13286 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Change
13287 initialization of V2DI_type_node and unsigned_V2DI_type_node.
13289 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
13291 * config/rs6000/darwin.h (SUBTARGET_INIT_BUILTINS): Use the new
13292 decl when new_builtins_are_live.
13293 * config/rs6000/rs6000-builtin-new.def (__builtin_cfstring): New
13296 2021-08-30 Pat Haugen <pthaugen@linux.ibm.com>
13298 * config/rs6000/rs6000-cpus.def (ISA_3_1_MASKS_SERVER): Add
13299 OPTION_MASK_P10_FUSION_2STORE.
13300 (POWERPC_MASKS): Likewise.
13301 * config/rs6000/rs6000.c (rs6000_option_override_internal): Enable
13302 store fusion for Power10.
13303 (is_fusable_store): New.
13304 (power10_sched_reorder): Likewise.
13305 (rs6000_sched_reorder): Do Power10 specific reordering.
13306 (rs6000_sched_reorder2): Likewise.
13307 * config/rs6000/rs6000.opt: Add new option.
13309 2021-08-30 Richard Biener <rguenther@suse.de>
13311 PR tree-optimization/102128
13312 * tree-vect-slp.c (vect_bb_vectorization_profitable_p):
13313 Move scanning for if-converted scalar code to the caller
13314 and instead delay clearing the visited flag for profitable
13316 (vect_slp_region): Cost all subgraphs before scheduling.
13317 For if-converted BB vectorization scan for scalar COND_EXPRs
13318 and do not vectorize if any found and the cost model is
13321 2021-08-30 Richard Biener <rguenther@suse.de>
13323 * common.opt (fexceptions): Mark
13324 EnabledBy(fnon-call-exceptions).
13325 * doc/invoke.texi (fnon-call-exceptions): Document this
13326 enables -fexceptions.
13328 2021-08-30 Sebastian Huber <sebastian.huber@embedded-brains.de>
13330 * tsystem.h (abort): Define abort() if inhibit_libc is defined and it
13331 is not already defined.
13333 2021-08-30 liuhongt <hongtao.liu@intel.com>
13335 * expmed.c (extract_bit_field_1): Make sure we're playing with
13336 integral modes before call extract_integral_bit_field.
13337 (extract_integral_bit_field): Add a parameter of type
13338 scalar_int_mode which corresponds to of tmode.
13339 And call extract_and_convert_fixed_bit_field instead of
13340 extract_fixed_bit_field and convert_extracted_bit_field.
13341 (extract_and_convert_fixed_bit_field): New function, it's a
13342 combination of extract_fixed_bit_field and
13343 convert_extracted_bit_field.
13345 2021-08-29 Iain Sandoe <iain@sandoe.co.uk>
13347 * config/darwin.c (darwin_libc_has_function): Do not run
13348 the checks for x86 or modern Darwin. Make sure that there
13349 is a value set for darwin_macosx_version_min before testing.
13351 2021-08-29 Iain Sandoe <iain@sandoe.co.uk>
13353 * config/i386/darwin.h (CLEAR_INSN_CACHE): New.
13355 2021-08-28 Jan Hubicka <hubicka@ucw.cz>
13357 * ipa-modref-tree.h (modref_access_node::merge): Break out
13358 logic combining offsets and logic merging ranges to ...
13359 (modref_access_node::combined_offsets): ... here
13360 (modref_access_node::update2): ... here
13361 (modref_access_node::closer_pair_p): New member function.
13362 (modref_access_node::forced_merge): New member function.
13363 (modre_ref_node::insert): Do merging when table is full.
13365 2021-08-28 YunQiang Su <yunqiang.su@cipunited.com>
13368 * config.gcc: MIPS: use N64 ABI by default if the triple end
13369 with -gnuabi64, which is used by Debian since 2013.
13371 2021-08-28 Alexandre Oliva <oliva@adacore.com>
13373 * ipa-modref.c (analyze_function): Skip debug stmts.
13374 * tree-inline.c (estimate_num_insn): Consider builtins even
13375 without a cgraph_node.
13377 2021-08-27 Jeff Law <jlaw@localhost.localdomain>
13379 * config/h8300/bitfield.md (cstore<mode>4): Remove expander.
13380 * config/h8300/h8300.c (h8300_expand_branch): Remove function.
13381 * config/h8300/h8300-protos.h (h8300_expadn_branch): Remove prototype.
13382 * config/h8300/h8300.md (eqne): New code iterator.
13383 (geultu, geultu_to_c): Similarly.
13384 * config/h8300/testcompare.md (cstore<mode>4): Dummy expander.
13385 (store_c_<mode>, store_c_i_<mode>): New define_insn_and_splits
13386 (cmp<mode>_c): New pattern
13388 2021-08-27 Jeff Law <jlaw@localhost.localdomain>
13390 * tree-ssa-dom.c (reduce_vector_comparison_to_scalar_comparison): New
13392 (dom_opt_dom_walker::optimize_stmt): Use it.
13394 2021-08-27 Iain Sandoe <iain@sandoe.co.uk>
13396 * config/darwin.c (finalize_ctors): Add a section-start linker-
13398 (finalize_dtors): Likewise.
13399 * config/darwin.h (MIN_LD64_INIT_TERM_START_LABELS): New.
13401 2021-08-27 Bill Schmidt <wschmidt@linux.ibm.com>
13403 * config/rs6000/rs6000-call.c (rs6000-builtins.h): New #include.
13404 (rs6000_init_builtins): Call rs6000_init_generated_builtins. Skip the
13405 old initialization logic when new builtins are enabled.
13406 * config/rs6000/rs6000-gen-builtins.c (write_decls): Rename
13407 rs6000_autoinit_builtins to rs6000_init_generated_builtins.
13408 (write_init_file): Likewise.
13410 2021-08-27 Iain Sandoe <iain@sandoe.co.uk>
13412 * configure.ac (darwin2[[0-9]]* | darwin19*): Alter use of
13413 gcc_GAS_CHECK_FEATURE to remove an extraneous parameter.
13414 (amdgcn-* | gcn-*) Likewise.
13416 2021-08-27 Anthony Sharp <anthonysharp15@gmail.com>
13418 * symbol-summary.h: Added missing template keyword.
13420 2021-08-27 Richard Biener <rguenther@suse.de>
13422 PR tree-optimization/45178
13423 * tree-ssa-dce.c (find_obviously_necessary_stmts): For
13424 infinite loops without exit do not mark control dependent
13425 edges of the latch necessary.
13427 2021-08-27 konglin1 <lingling.kong@intel.com>
13430 * config/i386/sse.md: (<avx512>scattersi<mode>): Add mask operand to
13432 (<avx512>scattersi<mode>): Likewise.
13433 (*avx512f_scattersi<VI48F:mode>): Merge mask operand to set_dest.
13434 (*avx512f_scatterdi<VI48F:mode>): Likewise
13436 2021-08-27 Kewen Lin <linkw@linux.ibm.com>
13438 * config/rs6000/rs6000.c (rs6000_builtin_md_vectorized_function): Add
13439 support for built-in functions MISC_BUILTIN_DIVWE, MISC_BUILTIN_DIVWEU,
13440 MISC_BUILTIN_DIVDE, MISC_BUILTIN_DIVDEU, P10_BUILTIN_CFUGED,
13441 P10_BUILTIN_CNTLZDM, P10_BUILTIN_CNTTZDM, P10_BUILTIN_PDEPD and
13442 P10_BUILTIN_PEXTD on Power10.
13444 2021-08-27 Kewen Lin <linkw@linux.ibm.com>
13446 * config/rs6000/rs6000-call.c (builtin_function_type): Add unsigned
13447 signedness for some Power10 bifs.
13449 2021-08-27 David Edelsohn <dje.gcc@gmail.com>
13452 * config/rs6000/rs6000.c (rs6000_adjust_field_align): Use
13453 computed alignment if the entire struct has attribute packed.
13455 2021-08-27 liuhongt <hongtao.liu@intel.com>
13459 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold
13460 IX86_BUILTIN_SHUFPD512, IX86_BUILTIN_SHUFPS512,
13461 IX86_BUILTIN_SHUFPD256, IX86_BUILTIN_SHUFPS,
13462 IX86_BUILTIN_SHUFPS256.
13463 (ix86_masked_all_ones): New function.
13465 2021-08-26 Uroš Bizjak <ubizjak@gmail.com>
13467 * config/i386/i386.md (*btr<mode>_1): Call force_reg unconditionally.
13468 (conditional moves with memory inputs splitters): Ditto.
13469 * config/i386/sse.md (one_cmpl<mode>2): Simplify.
13471 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
13473 * ipa-modref-tree.h (modref_access_node::try_merge_with): Restart
13474 search after merging.
13476 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
13478 * config/rs6000/rs6000-overload.def: Add remaining overloads.
13480 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
13482 * config/rs6000/rs6000-builtin-new.def: Add cell stanza.
13484 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
13486 * config/rs6000/rs6000-builtin-new.def: Add ieee128-hw, dfp,
13487 crypto, and htm stanzas.
13489 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
13491 * config/rs6000/rs6000-builtin-new.def: Add mma stanza.
13493 2021-08-26 Martin Sebor <msebor@redhat.com>
13495 * tree-ssa-uninit.c (warn_uninit): Refactor and simplify.
13496 (warn_uninit_phi_uses): Remove argument from calls to warn_uninit.
13497 (warn_uninitialized_vars): Same. Reduce visibility of locals.
13498 (warn_uninitialized_phi): Same.
13500 2021-08-26 Roger Sayle <roger@nextmovesoftware.com>
13502 * tree-ssa-ccp.c (get_individual_bits): Helper function to
13503 extract the individual bits from a widest_int constant (mask).
13504 (gray_code_bit_flips): New read-only table for effiently
13505 enumerating permutations/combinations of bits.
13506 (bit_value_binop) [LROTATE_EXPR, RROTATE_EXPR]: Handle rotates
13507 by unknown counts that are guaranteed less than the target
13508 precision and four or fewer unknown bits by enumeration.
13509 [LSHIFT_EXPR, RSHIFT_EXPR]: Likewise, also handle shifts by
13510 enumeration under the same conditions. Handle remaining
13511 shifts as a mask based upon the minimum possible shift value.
13513 2021-08-26 Roger Sayle <roger@nextmovesoftware.com>
13514 Richard Biener <rguenther@suse.de>
13516 * match.pd (shift transformations): Remove a redundant
13517 !POINTER_TYPE_P check.
13519 2021-08-26 Uroš Bizjak <ubizjak@gmail.com>
13522 * config/i386/i386.md (cmove reg-reg move elimination peephole2s):
13523 Set all_regs to true in the call to replace_rtx.
13525 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
13527 * ipa-modref-tree.c (test_insert_search_collapse): Update test.
13528 * ipa-modref-tree.h (modref_base_node::insert): Be smarter when
13529 hiting --param modref-max-refs limit.
13530 (modref_tree:insert_base): Be smarter when hitting
13531 --param modref-max-bases limit. Add new parameter REF.
13532 (modref_tree:insert): Update.
13533 (modref_tree:merge): Update.
13534 * ipa-modref.c (read_modref_records): Update.
13536 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
13538 * params.opt: (modref-max-adjustments): Add full stop.
13540 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
13542 * ipa-modref-tree.h (modref_ref_node::verify): New member
13544 (modref_ref_node::insert): Use it.
13545 (modref_ref_node::try_mere_with): Fix off by one error.
13547 2021-08-26 Martin Liska <mliska@suse.cz>
13548 Stefan Kneifel <stefan.kneifel@bluewin.ch>
13550 * cgraph.h (create_version_clone_with_body): Add new parameter.
13551 * cgraphclones.c: Likewise.
13552 * multiple_target.c (create_dispatcher_calls): Do not use
13554 (create_target_clone): Likewise here.
13556 2021-08-26 Jonathan Yong <10walls@gmail.com>
13558 * doc/extend.texi: Add note about reserved priorities
13559 to the constructor attribute.
13561 2021-08-25 Martin Sebor <msebor@redhat.com>
13563 * gimple-range-cache.cc (ssa_global_cache::dump): Avoid printing
13564 range table header alone.
13565 * gimple-range.cc (gimple_ranger::export_global_ranges): Same.
13567 2021-08-25 Jan Hubicka <hubicka@ucw.cz>
13569 * doc/invoke.texi: Document --param modref-max-adjustments.
13570 * ipa-modref-tree.c (test_insert_search_collapse): Update.
13571 (test_merge): Update.
13572 * ipa-modref-tree.h (struct modref_access_node): Add adjustments;
13573 (modref_access_node::operator==): Fix handling of access ranges.
13574 (modref_access_node::contains): Constify parameter; handle also
13575 mismatched parm offsets.
13576 (modref_access_node::update): New function.
13577 (modref_access_node::merge): New function.
13578 (unspecified_modref_access_node): Update constructor.
13579 (modref_ref_node::insert_access): Add record_adjustments parameter;
13581 (modref_ref_node::try_merge_with): New private function.
13582 (modref_tree::insert): New record_adjustments parameter.
13583 (modref_tree::merge): New record_adjustments parameter.
13584 (modref_tree::copy_from): Update.
13585 * ipa-modref.c (dump_access): Dump adjustments field.
13586 (get_access): Update constructor.
13587 (record_access): Update call of insert.
13588 (record_access_lto): Update call of insert.
13589 (merge_call_side_effects): Add record_adjustments parameter.
13590 (get_access_for_fnspec): Update.
13591 (process_fnspec): Update.
13592 (analyze_call): Update.
13593 (analyze_function): Update.
13594 (read_modref_records): Update.
13595 (ipa_merge_modref_summary_after_inlining): Update.
13596 (propagate_unknown_call): Update.
13597 (modref_propagate_in_scc): Update.
13598 * params.opt (param-max-modref-adjustments=): New.
13600 2021-08-25 Michael Meissner <meissner@linux.ibm.com>
13602 * config/rs6000/vsx.md (UNSPEC_XXSPLTIDP): Rename from
13604 (xxspltiw_v4si): Use vecperm type attribute.
13605 (xxspltiw_v4si_inst): Use vecperm type attribute.
13606 (xxspltiw_v4sf_inst): Likewise.
13607 (xxspltidp_v2df): Use vecperm type attribute. Use
13608 UNSPEC_XXSPLTIDP instead of UNSPEC_XXSPLTID.
13609 (xxspltidp_v2df_inst): Likewise.
13610 (xxsplti32dx_v4si): Use vecperm type attribute.
13611 (xxsplti32dx_v4si_inst): Likewise.
13612 (xxsplti32dx_v4sf_inst): Likewise.
13613 (xxblend_<mode>): Likewise.
13614 (xxpermx): Likewise.
13615 (xxpermx_inst): Likewise.
13616 (xxeval): Likewise.
13618 2021-08-25 Lewis Hyatt <lhyatt@gmail.com>
13621 * coretypes.h (typedef diagnostic_input_charset_callback): Declare.
13622 * diagnostic.c (diagnostic_initialize_input_context): New function.
13623 * diagnostic.h (diagnostic_initialize_input_context): Declare.
13624 * input.c (default_charset_callback): New function.
13625 (file_cache::initialize_input_context): New function.
13626 (file_cache_slot::create): Added ability to convert the input
13627 according to the input context.
13628 (file_cache::file_cache): Initialize the new input context.
13629 (class file_cache_slot): Added new m_alloc_offset member.
13630 (file_cache_slot::file_cache_slot): Initialize the new member.
13631 (file_cache_slot::~file_cache_slot): Handle potentially offset buffer.
13632 (file_cache_slot::maybe_grow): Likewise.
13633 (file_cache_slot::needs_read_p): Handle NULL fp, which is now possible.
13634 (file_cache_slot::get_next_line): Likewise.
13635 * input.h (class file_cache): Added input context member.
13637 2021-08-25 Richard Biener <rguenther@suse.de>
13639 PR tree-optimization/102046
13640 * tree-vect-slp.c (vect_build_slp_tree_2): Conservatively
13641 update ->any_pattern when swapping operands.
13643 2021-08-25 Hongyu Wang <hongyu.wang@intel.com>
13646 * config/i386/i386.c (ix86_live_on_entry): Adjust comment.
13647 (ix86_decompose_address): Remove retval check for ASHIFT,
13648 allow non-canonical zero extend if AND mask covers ASHIFT
13650 (ix86_legitimate_address_p): Adjust condition for decompose.
13651 (ix86_rtx_costs): Adjust cost for lea with non-canonical
13653 Co-Authored by: Uros Bizjak <ubizjak@gmail.com>
13655 2021-08-25 Jiufu Guo <guojiufu@linux.ibm.com>
13657 PR tree-optimization/101145
13658 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
13660 (number_of_iterations_lt): Invoke above function.
13661 (adjust_cond_for_loop_until_wrap):
13662 Merge to number_of_iterations_until_wrap.
13663 (number_of_iterations_cond): Update invokes for
13664 adjust_cond_for_loop_until_wrap and number_of_iterations_lt.
13666 2021-08-25 konglin1 <lingling.kong@intel.com>
13669 * config/i386/avx512dqintrin.h (_mm512_fpclass_ps_mask): Fix
13670 macro define in O0.
13671 (_mm512_mask_fpclass_ps_mask): Ditto.
13673 2021-08-25 Kewen Lin <linkw@linux.ibm.com>
13675 * config/rs6000/altivec.md (vec_unpacku_hi_v16qi): Remove.
13676 (vec_unpacku_hi_v8hi): Likewise.
13677 (vec_unpacku_lo_v16qi): Likewise.
13678 (vec_unpacku_lo_v8hi): Likewise.
13679 (vec_unpacku_hi_<VP_small_lc>): New define_expand.
13680 (vec_unpacku_lo_<VP_small_lc>): Likewise.
13682 2021-08-24 David Edelsohn <dje.gcc@gmail.com>
13684 * config/rs6000/aix.h (SYSTEM_IMPLICIT_EXTERN_C): Delete.
13685 * config/rs6000/aix71.h (SYSTEM_IMPLICIT_EXTERN_C): Define.
13686 * config/rs6000/aix72.h (SYSTEM_IMPLICIT_EXTERN_C): Define.
13687 * config/rs6000/aix73.h (TARGET_AIX_VERSION): Increase to 73.
13689 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
13691 PR middle-end/102031
13692 * simplify-rtx.c (simplify_truncation): When comparing precisions
13693 use "subreg_prec" variable, not "subreg_mode".
13695 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
13697 * config/rs6000/rs6000-builtin-new.def: Add power10 and power10-64
13700 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
13702 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Initialize
13703 various pointer type nodes.
13704 * config/rs6000/rs6000.h (rs6000_builtin_type_index): Add enum
13705 values for various pointer types.
13706 (ptr_V16QI_type_node): New macro.
13707 (ptr_V1TI_type_node): New macro.
13708 (ptr_V2DI_type_node): New macro.
13709 (ptr_V2DF_type_node): New macro.
13710 (ptr_V4SI_type_node): New macro.
13711 (ptr_V4SF_type_node): New macro.
13712 (ptr_V8HI_type_node): New macro.
13713 (ptr_unsigned_V16QI_type_node): New macro.
13714 (ptr_unsigned_V1TI_type_node): New macro.
13715 (ptr_unsigned_V8HI_type_node): New macro.
13716 (ptr_unsigned_V4SI_type_node): New macro.
13717 (ptr_unsigned_V2DI_type_node): New macro.
13718 (ptr_bool_V16QI_type_node): New macro.
13719 (ptr_bool_V8HI_type_node): New macro.
13720 (ptr_bool_V4SI_type_node): New macro.
13721 (ptr_bool_V2DI_type_node): New macro.
13722 (ptr_bool_V1TI_type_node): New macro.
13723 (ptr_pixel_type_node): New macro.
13724 (ptr_intQI_type_node): New macro.
13725 (ptr_uintQI_type_node): New macro.
13726 (ptr_intHI_type_node): New macro.
13727 (ptr_uintHI_type_node): New macro.
13728 (ptr_intSI_type_node): New macro.
13729 (ptr_uintSI_type_node): New macro.
13730 (ptr_intDI_type_node): New macro.
13731 (ptr_uintDI_type_node): New macro.
13732 (ptr_intTI_type_node): New macro.
13733 (ptr_uintTI_type_node): New macro.
13734 (ptr_long_integer_type_node): New macro.
13735 (ptr_long_unsigned_type_node): New macro.
13736 (ptr_float_type_node): New macro.
13737 (ptr_double_type_node): New macro.
13738 (ptr_long_double_type_node): New macro.
13739 (ptr_dfloat64_type_node): New macro.
13740 (ptr_dfloat128_type_node): New macro.
13741 (ptr_ieee128_type_node): New macro.
13742 (ptr_ibm128_type_node): New macro.
13743 (ptr_vector_pair_type_node): New macro.
13744 (ptr_vector_quad_type_node): New macro.
13745 (ptr_long_long_integer_type_node): New macro.
13746 (ptr_long_long_unsigned_type_node): New macro.
13748 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
13750 * config/rs6000/rs6000-builtin-new.def: Add power9-vector, power9,
13751 and power9-64 stanzas.
13753 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
13754 Tom de Vries <tdevries@suse.de>
13756 * config.gcc (nvptx-*-*): Define {c,c++}_target_objs.
13757 * config/nvptx/nvptx-protos.h (nvptx_cpu_cpp_builtins): Prototype.
13758 * config/nvptx/nvptx.h (TARGET_CPU_CPP_BUILTINS): Implement with
13759 a call to the new nvptx_cpu_cpp_builtins function in nvptx-c.c.
13760 * config/nvptx/t-nvptx (nvptx-c.o): New rule.
13761 * config/nvptx/nvptx-c.c: New source file.
13762 (nvptx_cpu_cpp_builtins): Move implementation here.
13764 2021-08-24 Martin Sebor <msebor@redhat.com>
13766 PR middle-end/101600
13767 PR middle-end/101977
13768 * gimple-ssa-warn-access.cc (maybe_warn_for_bound): Tighten up
13769 the phrasing of a warning.
13770 (check_access): Use the remaining size after subtracting any offset
13771 rather than the whole object size.
13772 * pointer-query.cc (access_ref::get_ref): Clear BASE0 flag if it's
13773 clear for any nonnull PHI argument.
13774 (compute_objsize): Clear argument.
13776 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
13778 * config/rs6000/rs6000-builtin-new.def: Add power8-vector stanza.
13780 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
13782 * config/rs6000/rs6000-builtin-new.def: Add power7 and power7-64
13785 2021-08-24 Andrew MacLeod <amacleod@redhat.com>
13787 * value-relation.cc (rr_transitive_table): New.
13788 (relation_transitive): New.
13789 (value_relation::swap): Remove.
13790 (value_relation::apply_transitive): New.
13791 (relation_oracle::relation_oracle): Allocate a new tmp bitmap.
13792 (relation_oracle::register_relation): Call register_transitives.
13793 (relation_oracle::register_transitives): New.
13794 * value-relation.h (relation_oracle): Add new temporary bitmap and
13797 2021-08-24 H.J. Lu <hjl.tools@gmail.com>
13800 * config/i386/i386-expand.c (ix86_expand_vector_move): Broadcast
13801 from integer to a pseudo vector register.
13803 2021-08-24 Richard Biener <rguenther@suse.de>
13805 PR tree-optimization/100089
13806 * tree-vectorizer.h (vect_slp_bb): Rename to ...
13807 (vect_slp_if_converted_bb): ... this and get the original
13808 loop as new argument.
13809 * tree-vectorizer.c (try_vectorize_loop_1): Revert previous fix,
13810 pass original loop to vect_slp_if_converted_bb.
13811 * tree-vect-slp.c (vect_bb_vectorization_profitable_p):
13812 If orig_loop was passed scan the not vectorized stmts
13813 for COND_EXPRs and force not profitable if found.
13814 (vect_slp_region): Pass down all SLP instances to costing
13815 if orig_loop was specified.
13816 (vect_slp_bbs): Pass through orig_loop.
13817 (vect_slp_bb): Rename to ...
13818 (vect_slp_if_converted_bb): ... this and get the original
13819 loop as new argument.
13820 (vect_slp_function): Adjust.
13822 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
13825 * config/arm/arm.md (attribute arch): Add fix_vlldm.
13826 (arch_enabled): Use it.
13827 * config/arm/vfp.md (lazy_store_multiple_insn): Add alternative to
13828 use when erratum mitigation is needed.
13830 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
13833 * config/arm/arm.opt (mfix-cmse-cve-2021-35465): New option.
13834 * doc/invoke.texi (Arm Options): Document it.
13835 * config/arm/arm-cpus.in (quirk_vlldm): New feature bit.
13836 (ALL_QUIRKS): Add quirk_vlldm.
13837 (cortex-m33): Add quirk_vlldm.
13838 (cortex-m35p, cortex-m55): Likewise.
13839 * config/arm/arm.c (arm_option_override): Enable fix_vlldm if
13840 targetting an affected CPU and not explicitly controlled on
13843 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
13845 * config/arm/vfp.md (lazy_store_multiple_insn): Rewrite as valid RTL.
13846 (lazy_load_multiple_insn): Likewise.
13848 2021-08-24 liuhongt <hongtao.liu@intel.com>
13851 * config/i386/sse.md (<avx512>_vternlog<mode><sd_maskz_name>):
13852 Enable avx512 embedded broadcast.
13853 (*<avx512>_vternlog<mode>_all): Ditto.
13854 (<avx512>_vternlog<mode>_mask): Ditto.
13856 2021-08-24 liuhongt <hongtao.liu@intel.com>
13859 * config/i386/i386.c (ix86_rtx_costs): Define cost for
13861 * config/i386/i386.h (STRIP_UNARY): New macro.
13862 * config/i386/predicates.md (reg_or_notreg_operand): New
13864 * config/i386/sse.md (*<avx512>_vternlog<mode>_all): New define_insn.
13865 (*<avx512>_vternlog<mode>_1): New pre_reload
13866 define_insn_and_split.
13867 (*<avx512>_vternlog<mode>_2): Ditto.
13868 (*<avx512>_vternlog<mode>_3): Ditto.
13869 (any_logic1,any_logic2): New code iterator.
13870 (logic_op): New code attribute.
13871 (ternlogsuffix): Extend to VNxDF and VNxSF.
13873 2021-08-24 Richard Biener <rguenther@suse.de>
13875 * doc/invoke.texi (vect-inner-loop-cost-factor): Adjust.
13876 * params.opt (--param vect-inner-loop-cost-factor): Adjust
13878 * tree-vect-loop.c (vect_analyze_loop_form): Initialize
13879 inner_loop_cost_factor to the minimum of the estimated number
13880 of iterations of the inner loop and vect-inner-loop-cost-factor.
13882 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
13883 Richard Biener <rguenther@suse.de>
13885 * config/i386/i386-features.c (compute_convert_gain): Provide
13886 more accurate values for CONST_INT, when optimizing for size.
13887 * config/i386/i386.c (COSTS_N_BYTES): Move definition from here...
13888 * config/i386/i386.h (COSTS_N_BYTES): to here.
13890 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
13891 Jakub Jelinek <jakub@redhat.com>
13893 PR middle-end/102029
13894 * match.pd (shift transformations): Add an additional check for
13895 !POINTER_TYPE_P in the recently added left shift transformation.
13897 2021-08-24 liuhongt <hongtao.liu@intel.com>
13899 PR tree-optimization/100089
13900 * tree-vectorizer.c (try_vectorize_loop_1): Disable slp in
13901 loop vectorizer when cost model is very-cheap.
13903 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
13905 * config/rs6000/rs6000-gen-builtins.c (parse_bif_entry): Don't call
13906 asprintf, which is not available on AIX.
13908 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
13910 * config.gcc (target_gtfiles): Add ./rs6000-builtins.h.
13911 * config/rs6000/t-rs6000 (EXTRA_GTYPE_DEPS): Set.
13913 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
13915 * config.gcc (powerpc*-*-*): Add rs6000-builtins.o to extra_objs.
13916 * config/rs6000/rs6000-gen-builtins.c (main): Close init_file
13918 * config/rs6000/t-rs6000 (rs6000-gen-builtins.o): New target.
13919 (rbtree.o): Likewise.
13920 (rs6000-gen-builtins): Likewise.
13921 (rs6000-builtins.c): Likewise.
13922 (rs6000-builtins.h): Likewise.
13923 (rs6000.o): Add dependency.
13924 (EXTRA_HEADERS): Add rs6000-vecdefines.h.
13925 (rs6000-vecdefines.h): New target.
13926 (rs6000-builtins.o): Likewise.
13927 (rs6000-call.o): Add rs6000-builtins.h as a dependency.
13928 (rs6000-c.o): Likewise.
13930 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
13933 * config/rs6000/rs6000-gen-builtins.c (consume_whitespace):
13934 Diagnose buffer overrun.
13935 (safe_inc_pos): Fix overrun detection.
13936 (match_identifier): Diagnose buffer overrun.
13937 (match_integer): Likewise.
13938 (match_to_right_bracket): Likewise.
13940 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
13942 * ipa-modref-tree.h (modref_access_node::range_info_useful_p):
13943 Improve range compare.
13944 (modref_access_node::contains): New member function.
13945 (modref_access_node::search): Remove.
13946 (modref_access_node::insert): Be smarter about subaccesses.
13948 2021-08-23 Thomas Schwinge <thomas@codesourcery.com>
13950 * config/i386/i386-options.c (ix86_omp_device_kind_arch_isa)
13951 <omp_device_arch> [ACCEL_COMPILER]: Match "intel_mic".
13952 * config/i386/t-omp-device (omp-device-properties-i386) <arch>:
13955 2021-08-23 Jeff Law <jlaw@localhost.localdomain>
13957 * config/h8300/h8300-protos.h (h8300_expand_epilogue): Add new
13959 * config/h8300/jumpcall.md (call, call_value): Restrict to
13960 !SIBLING_CALL_P cases.
13961 (subcall, sibcall_value): New patterns & expanders.
13962 * config/h8300/proepi.md (epilogue): Pass new argument to
13963 h8300_expand_epilogue.
13964 (sibcall_epilogue): New expander.
13965 * config/h8300/h8300.c (h8300_expand_epilogue): Handle sibcall
13967 (h8300_ok_for_sibcall_p): New function.
13968 (TARGET_FUNCTION_OK_FOR_SIBCALL): define.
13970 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
13972 * simplify-rtx.c (simplify_unary_operation_1): [TRUNCATE]:
13973 Handle case where the operand is already the desired mode.
13975 2021-08-23 Richard Biener <rguenther@suse.de>
13978 * tree-ssa-structalias.c (ipa_pta_execute): Check in_other_partition
13979 in addition to has_gimple_body.
13981 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
13983 PR middle-end/101949
13984 * ipa-modref.c (analyze_ssa_name_flags): Fix merging of
13987 2021-08-23 Martin Liska <mliska@suse.cz>
13989 * doc/invoke.texi: Put the option out of -mxl-mode-app-model
13992 2021-08-23 Richard Biener <rguenther@suse.de>
13994 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost):
13995 Properly scale the inner loop cost only once.
13997 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
13999 * tree-ssa-ccp.c (bit_value_binop) [TRUNC_MOD_EXPR, TRUNC_DIV_EXPR]:
14000 Provide bounds for unsigned (and signed with non-negative operands)
14001 division and modulus.
14003 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
14005 * simplify-rtx.c (simplify_truncation): Generalize simplification
14006 of (truncate:A (subreg:B X)).
14007 (simplify_unary_operation_1) [FLOAT_TRUNCATE, FLOAT_EXTEND,
14008 SIGN_EXTEND, ZERO_EXTEND]: Handle cases where the operand
14009 already has the desired machine mode.
14010 (test_scalar_int_ops): Add tests that useless extensions and
14011 truncations are optimized away.
14012 (test_scalar_int_ext_ops): New self-test function to confirm
14013 that truncations of extensions are correctly simplified.
14014 (test_scalar_int_ext_ops2): New self-test function to check
14015 truncations of truncations, extensions of extensions, and
14016 truncations of extensions.
14017 (test_scalar_ops): Call the above two functions with a
14018 representative sampling of integer machine modes.
14020 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
14022 * match.pd (shift transformations): Change the sign of an
14023 LSHIFT_EXPR if it reduces the number of explicit conversions.
14025 2021-08-23 Jakub Jelinek <jakub@redhat.com>
14027 PR tree-optimization/86723
14028 * gimple-ssa-store-merging.c (find_bswap_or_nop_finalize): Add
14029 cast64_to_32 argument, set *cast64_to_32 to false, unless n is
14030 non-memory permutation of 64-bit src which only has bytes of
14031 0 or [5..8] and n->range is 4.
14032 (find_bswap_or_nop): Add cast64_to_32 and mask arguments, adjust
14033 find_bswap_or_nop_finalize caller, support bswap with some bytes
14034 zeroed, as long as at least two bytes are not zeroed.
14035 (bswap_replace): Add mask argument and handle masking of bswap
14037 (maybe_optimize_vector_constructor): Adjust find_bswap_or_nop
14038 caller, punt if cast64_to_32 or mask is not all ones.
14039 (pass_optimize_bswap::execute): Adjust find_bswap_or_nop_finalize
14040 caller, for now punt if cast64_to_32.
14042 2021-08-23 Richard Biener <rguenther@suse.de>
14044 PR tree-optimization/79334
14045 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Record
14046 a type also for COMPONENT_REFs.
14047 (vn_reference_may_trap): Check ARRAY_REF with constant index
14048 against the array domain.
14050 2021-08-23 liuhongt <hongtao.liu@intel.com>
14053 * config/i386/sse.md (*avx512f_pshufb_truncv8hiv8qi_1): Add
14054 TARGET_AVX512BW to condition.
14056 2021-08-23 Jakub Jelinek <jakub@redhat.com>
14059 * dwarf2out.c (gen_variable_die): Add DW_AT_location for global
14060 register variables already during early_dwarf if possible.
14062 2021-08-23 Christophe Lyon <christophe.lyon@foss.st.com>
14064 * config/arm/arm_mve.h: Fix __arm_vctp16q return type.
14066 2021-08-23 Christophe Lyon <christophe.lyon@foss.st.com>
14069 * config/arm/arm.opt: Fix typo.
14070 * config/arm/t-rmprofile: Fix typo.
14072 2021-08-23 Jakub Jelinek <jakub@redhat.com>
14074 * tree.h (OMP_CLAUSE_GRAINSIZE_STRICT): Define.
14075 (OMP_CLAUSE_NUM_TASKS_STRICT): Define.
14076 * tree-pretty-print.c (dump_omp_clause) <case OMP_CLAUSE_GRAINSIZE,
14077 case OMP_CLAUSE_NUM_TASKS>: Print strict: modifier.
14078 * omp-expand.c (expand_task_call): Use GOMP_TASK_FLAG_STRICT in iflags
14079 if either grainsize or num_tasks clause has the strict modifier.
14081 2021-08-23 Martin Liska <mliska@suse.cz>
14083 * dbgcnt.def (DEBUG_COUNTER): New counter.
14084 * gimple.c (gimple_call_arg_flags): Use it in IPA PTA.
14086 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
14088 * ipa-modref.c (analyze_ssa_name_flags): Improve handling of return slot.
14090 2021-08-23 Xi Ruoyao <xry111@mengyan1223.wang>
14093 * config/mips/mips-protos.h (mips_msa_output_shift_immediate):
14095 * config/mips/mips.c (mips_msa_output_shift_immediate): New
14097 * config/mips/mips-msa.md (vashl<mode>3, vashr<mode>3,
14098 vlshr<mode>3): Call it.
14100 2021-08-22 Jan Hubicka <hubicka@ucw.cz>
14101 Martin Liska <mliska@suse.cz>
14103 PR middle-end/101949
14104 * ipa-modref.c (analyze_ssa_name_flags): Indirect call implies
14107 2021-08-21 Dragan Mladjenovic <OT_Dragan.Mladjenovic@mediatek.com>
14109 * config/mips/mips.c (mips_function_rodata_section,
14110 TARGET_ASM_FUNCTION_RODATA_SECTION): Removed.
14112 2021-08-21 John David Anglin <danglin@gcc.gnu.org>
14114 * config/pa/pa.c (pa_asm_output_aligned_common): Remove warning.
14116 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
14118 * configure.ac (thread-local storage support): Remove tls_first_major
14119 and tls_first_minor. Use "$conftest_s" to check support.
14120 * configure: Regenerate.
14122 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
14124 * configure.ac: Fixup formatting.
14126 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
14128 * acinclude.m4 (gcc_GAS_CHECK_FEATURE): Remove third argument and ...
14129 * configure.ac: ... update all callers.
14131 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
14134 * acinclude.m4 (_gcc_COMPUTE_GAS_VERSION, _gcc_GAS_VERSION_GTE_IFELSE)
14135 (gcc_GAS_VERSION_GTE_IFELSE): Remove.
14136 (gcc_GAS_CHECK_FEATURE): Do not handle in-tree case specially.
14137 * configure.ac: Remove gcc_cv_gas_major_version, gcc_cv_gas_minor_version.
14138 Remove remaining checks for in-tree assembler.
14139 * configure: Regenerate.
14141 2021-08-20 Jeff Law <jlaw@localhost.localdomain>
14143 * config/h8300/h8300.c (shift_alg_hi): Improve arithmetic shift right
14144 by 15 bits for H8/300H and H8/S. Improve logical shifts by 12
14146 (shift_alg_si): Improve arithmetic right shift by 28-30 bits for
14147 H8/300H. Improve arithmetic shift right by 15 bits for H8/S.
14148 Improve logical shifts by 27 bits for H8/S.
14149 (get_shift_alg): Corresponding changes.
14150 (h8300_option_override): Revert to loops for -Os when profitable.
14152 2021-08-20 Richard Biener <rguenther@suse.de>
14154 * tree-vect-data-refs.c (dr_group_sort_cmp): Do not compare
14156 (vect_analyze_data_ref_accesses): Likewise. Assign the BB
14157 index as group_id when dataref_groups were not computed.
14158 * tree-vect-slp.c (vect_slp_bbs): Bump current_group when
14159 we advace to the next BB.
14161 2021-08-20 Jakub Jelinek <jakub@redhat.com>
14163 * omp-builtins.def (BUILT_IN_GOMP_WARNING, BUILT_IN_GOMP_ERROR): New
14166 2021-08-20 Martin Liska <mliska@suse.cz>
14168 PR gcov-profile/89961
14169 * gcov.c (make_gcov_file_name): Rewrite using std::string.
14170 (mangle_name): Simplify, do not used the second argument.
14171 (strip_extention): New function.
14172 (get_md5sum): Likewise.
14173 (get_gcov_intermediate_filename): Handle properly -p and -x
14175 (output_gcov_file): Use string type.
14176 (generate_results): Likewise.
14177 (md5sum_to_hex): Remove.
14179 2021-08-20 Michael Meissner <meissner@linux.ibm.com>
14181 * config/rs6000/altivec.md (UNSPEC_XXEVAL): Move to vsx.md.
14182 (UNSPEC_XXSPLTIW): Move to vsx.md.
14183 (UNSPEC_XXSPLTID): Move to vsx.md.
14184 (UNSPEC_XXSPLTI32DX): Move to vsx.md.
14185 (UNSPEC_XXBLEND): Move to vsx.md.
14186 (UNSPEC_XXPERMX): Move to vsx.md.
14187 (VM3): Move to vsx.md.
14188 (VM3_char): Move to vsx.md.
14189 (xxspltiw_v4si): Move to vsx.md.
14190 (xxspltiw_v4sf): Move to vsx.md.
14191 (xxspltiw_v4sf_inst): Move to vsx.md.
14192 (xxspltidp_v2df): Move to vsx.md.
14193 (xxspltidp_v2df_inst): Move to vsx.md.
14194 (xxsplti32dx_v4si_inst): Move to vsx.md.
14195 (xxsplti32dx_v4sf): Move to vsx.md.
14196 (xxsplti32dx_v4sf_inst): Move to vsx.md.
14197 (xxblend_<mode>): Move to vsx.md.
14198 (xxpermx): Move to vsx.md.
14199 (xxpermx_inst): Move to vsx.md.
14200 * config/rs6000/vsx.md (UNSPEC_XXEVAL): Move from altivec.md.
14201 (UNSPEC_XXSPLTIW): Move from altivec.md.
14202 (UNSPEC_XXSPLTID): Move from altivec.md.
14203 (UNSPEC_XXSPLTI32DX): Move from altivec.md.
14204 (UNSPEC_XXBLEND): Move from altivec.md.
14205 (UNSPEC_XXPERMX): Move from altivec.md.
14206 (VM3): Move from altivec.md.
14207 (VM3_char): Move from altivec.md.
14208 (xxspltiw_v4si): Move from altivec.md.
14209 (xxspltiw_v4sf): Move from altivec.md.
14210 (xxspltiw_v4sf_inst): Move from altivec.md.
14211 (xxspltidp_v2df): Move from altivec.md.
14212 (xxspltidp_v2df_inst): Move from altivec.md.
14213 (xxsplti32dx_v4si_inst): Move from altivec.md.
14214 (xxsplti32dx_v4sf): Move from altivec.md.
14215 (xxsplti32dx_v4sf_inst): Move from altivec.md.
14216 (xxblend_<mode>): Move from altivec.md.
14217 (xxpermx): Move from altivec.md.
14218 (xxpermx_inst): Move from altivec.md.
14220 2021-08-19 Roger Sayle <roger@nextmovesoftware.com>
14222 * tree-vect-generic.c (expand_vector_operations_1): Use either
14223 gimplify_build1 or gimplify_build2 instead of gimple_build_assign
14224 when constructing scalar splat expressions.
14226 2021-08-19 Peter Bergner <bergner@linux.ibm.com>
14229 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Cast
14230 pointer to __vector_pair *.
14232 2021-08-19 Martin Sebor <msebor@redhat.com>
14234 * gimple-range.cc: Add comments.
14235 * gimple-range.h: Same.
14237 2021-08-19 Martin Sebor <msebor@redhat.com>
14239 PR middle-end/101984
14240 * gimple-ssa-warn-access.cc (pass_waccess::execute): Also call
14243 2021-08-19 Jeff Law <jlaw@localhost.localdomain>
14245 * config.gcc (h8300-*-elf*): Do not include dbxelf.h.
14246 (h8300-*-linux*, v850-*-rtems*, v850*-elf*): Likewise.
14247 * config/v850/v850.h (DEFAULT_GDB_EXTENSIONS): Remove.
14249 2021-08-19 Jakub Jelinek <jakub@redhat.com>
14251 PR middle-end/101950
14252 * optabs.c (expand_clrsb_using_clz): New function.
14253 (expand_unop): Use it as another clrsb expansion fallback.
14255 2021-08-19 liuhongt <hongtao.liu@intel.com>
14258 2021-07-28 liuhongt <hongtao.liu@intel.com>
14261 * config/i386/i386.h (processor_costs): Add new member
14263 * config/i386/x86-tune-costs.h (ix86_size_cost, i386_cost,
14264 i486_cost, pentium_cost, lakemont_cost, pentiumpro_cost,
14265 geode_cost, k6_cost, athlon_cost, k8_cost, amdfam10_cost,
14266 bdver_cost, znver1_cost, znver2_cost, znver3_cost,
14267 btver1_cost, btver2_cost, btver3_cost, pentium4_cost,
14268 nocona_cost, atom_cost, atom_cost, slm_cost, intel_cost,
14269 generic_cost, core_cost): Initialize integer_to_sse same value
14271 (skylake_cost): Initialize integer_to_sse twice as much as sse_op.
14272 * config/i386/i386.c (ix86_builtin_vectorization_cost):
14273 Use integer_to_sse instead of sse_op to calculate the cost of
14276 2021-08-18 Iain Sandoe <iain@sandoe.co.uk>
14278 * config.gcc: Include rpath.opt for Darwin.
14279 * config/darwin.h (DRIVER_SELF_SPECS): Handle -rpath.
14281 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
14283 PR bootstrap/101959
14284 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor_expand):
14287 2021-08-18 Jonathan Wright <jonathan.wright@arm.com>
14289 * config/aarch64/arm_neon.h (vld3_lane_f64): Use float RTL
14290 pattern and type cast.
14291 (vld4_lane_f32): Use float RTL pattern.
14292 (vld4q_lane_f64): Use float type cast.
14294 2021-08-18 Jan Hubicka <hubicka@ucw.cz>
14296 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Check also
14299 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
14301 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor): Extend.
14302 (test_map_of_type_with_ctor_and_dtor_expand): Add function.
14303 (hash_map_tests_c_tests): Call it.
14305 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
14307 * ggc.h (enum ggc_collect): New.
14308 (ggc_collect): Use it.
14309 * ggc-page.c: Adjust.
14310 * ggc-common.c: Likewise.
14311 * ggc-tests.c: Likewise.
14312 * read-rtl-function.c: Likewise.
14313 * selftest-run-tests.c: Likewise.
14314 * doc/gty.texi (Invoking the garbage collector): Likewise.
14316 2021-08-18 liuhongt <hongtao.liu@intel.com>
14319 * config/i386/i386.h (TARGET_V2DF_REDUCTION_PREFER_HADDPD):
14321 * config/i386/sse.md (*sse3_haddv2df3_low): Add
14322 TARGET_V2DF_REDUCTION_PREFER_HADDPD.
14323 (*sse3_hsubv2df3_low): Ditto.
14324 * config/i386/x86-tune.def
14325 (X86_TUNE_V2DF_REDUCTION_PREFER_HADDPD): New tune.
14327 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
14329 * gimple-range-gori.cc (gori_compute::gori_compute): Enable tracing.
14330 (gori_compute::compute_operand_range): Add tracing.
14331 (gori_compute::logical_combine): Ditto.
14332 (gori_compute::compute_logical_operands): Ditto.
14333 (gori_compute::compute_operand1_range): Ditto.
14334 (gori_compute::compute_operand2_range): Ditto.
14335 (gori_compute::outgoing_edge_range_p): Ditto.
14336 * gimple-range-gori.h (class gori_compute): Add range_tracer.
14338 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
14340 * flag-types.h (enum evrp_mode): Adjust evrp-mode values.
14341 * gimple-range-cache.cc (DEBUG_RANGE_CACHE): Relocate from.
14342 * gimple-range-trace.h (DEBUG_RANGE_CACHE): Here.
14343 * params.opt (--param=evrp-mode): Adjust options.
14345 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
14347 * Makefile.in (OBJS): Add gimple-range-trace.o.
14348 * gimple-range-cache.h (enable_new_values): Remove unused prototype.
14349 * gimple-range-fold.cc: Adjust headers.
14350 * gimple-range-trace.cc: New.
14351 * gimple-range-trace.h: New.
14352 * gimple-range.cc (gimple_ranger::gimple_ranger): Enable tracer.
14353 (gimple_ranger::range_of_expr): Add tracing.
14354 (gimple_ranger::range_on_entry): Ditto.
14355 (gimple_ranger::range_on_exit): Ditto.
14356 (gimple_ranger::range_on_edge): Ditto.
14357 (gimple_ranger::fold_range_internal): Ditto.
14358 (gimple_ranger::dump_bb): Do not calculate edge range twice.
14359 (trace_ranger::*): Remove.
14360 (enable_ranger): Never create a trace_ranger.
14361 (debug_seed_ranger): Move to gimple-range-trace.cc.
14362 (dump_ranger): Ditto.
14363 (debug_ranger): Ditto.
14364 * gimple-range.h: Include gimple-range-trace.h.
14365 (range_on_entry, range_on_exit): No longer virtual.
14366 (class trace_ranger): Remove.
14367 (DEBUG_RANGE_CACHE): Move to gimple-range-trace.h.
14369 2021-08-17 Martin Sebor <msebor@redhat.com>
14371 PR middle-end/101854
14372 * builtins.c (expand_builtin_alloca): Move warning code to check_alloca
14373 in gimple-ssa-warn-access.cc.
14374 * calls.c (alloc_max_size): Move code to check_alloca.
14375 (get_size_range): Move to pointer-query.cc.
14376 (maybe_warn_alloc_args_overflow): Move to gimple-ssa-warn-access.cc.
14377 (get_attr_nonstring_decl): Move to tree.c.
14378 (fntype_argno_type): Move to gimple-ssa-warn-access.cc.
14379 (append_attrname): Same.
14380 (maybe_warn_rdwr_sizes): Same.
14381 (initialize_argument_information): Move code to
14382 gimple-ssa-warn-access.cc.
14383 * calls.h (maybe_warn_alloc_args_overflow): Move to
14384 gimple-ssa-warn-access.h.
14385 (get_attr_nonstring_decl): Move to tree.h.
14386 (maybe_warn_nonstring_arg): Move to gimple-ssa-warn-access.h.
14387 (enum size_range_flags): Move to pointer-query.h.
14388 (get_size_range): Same.
14389 * gimple-ssa-warn-access.cc (has_location): Remove unused overload
14390 to avoid Clang -Wunused-function.
14391 (get_size_range): Declare static.
14392 (maybe_emit_free_warning): Rename...
14393 (maybe_check_dealloc_call): ...to this for consistency.
14394 (class pass_waccess): Add members.
14395 (pass_waccess::~pass_waccess): Defined.
14396 (alloc_max_size): Move here from calls.c.
14397 (maybe_warn_alloc_args_overflow): Same.
14398 (check_alloca): New function.
14399 (check_alloc_size_call): New function.
14400 (check_strncat): Handle another warning flag.
14401 (pass_waccess::check_builtin): Handle alloca.
14402 (fntype_argno_type): Move here from calls.c.
14403 (append_attrname): Same.
14404 (maybe_warn_rdwr_sizes): Same.
14405 (pass_waccess::check_call): Define.
14406 (check_nonstring_args): New function.
14407 (pass_waccess::check): Call new member functions.
14408 (pass_waccess::execute): Enable ranger.
14409 * gimple-ssa-warn-access.h (get_size_range): Move here from calls.h.
14410 (maybe_warn_nonstring_arg): Same.
14411 * gimple-ssa-warn-restrict.c: Remove #include.
14412 * pointer-query.cc (get_size_range): Move here from calls.c.
14413 * pointer-query.h (enum size_range_flags): Same.
14414 (get_size_range): Same.
14415 * tree.c (get_attr_nonstring_decl): Move here from calls.c.
14416 * tree.h (get_attr_nonstring_decl): Move here from calls.h.
14418 2021-08-17 Thomas Schwinge <thomas@codesourcery.com>
14420 * ggc.h (ggc_collect): Add 'force_collect' parameter.
14421 * ggc-page.c (ggc_collect): Use that one instead of global
14422 'ggc_force_collect'. Adjust all users.
14423 * doc/gty.texi (Invoking the garbage collector): Update.
14424 * ggc-internal.h (ggc_force_collect): Remove.
14425 * ggc-common.c (ggc_force_collect): Likewise.
14426 * selftest.h (forcibly_ggc_collect): Remove.
14427 * ggc-tests.c (selftest::forcibly_ggc_collect): Likewise.
14428 * read-rtl-function.c (test_loading_labels): Adjust.
14429 * selftest-run-tests.c (run_tests): Likewise.
14431 2021-08-17 Iain Sandoe <iain@sandoe.co.uk>
14433 * config/darwin.c (darwin_file_end): Reset and reclaim the
14434 section names table at the end of compile.
14436 2021-08-17 Iain Sandoe <iain@sandoe.co.uk>
14439 * config.in: Regenerate.
14440 * config/i386/darwin.h (EXTRA_ASM_OPTS): New
14441 (ASM_SPEC): Pass options to disable branch shortening where
14443 * configure: Regenerate.
14444 * configure.ac: Detect versions of 'as' that support the
14445 optimisation which has the bug.
14447 2021-08-17 Richard Biener <rguenther@suse.de>
14449 * optabs-query.c (supports_vec_gather_load_p): Also check
14451 (supports_vec_scatter_store_p): Likewise.
14452 * tree-vect-data-refs.c (vect_gather_scatter_fn_p): Fall
14453 back to masked variants if non-masked are not supported.
14454 * tree-vect-patterns.c (vect_recog_gather_scatter_pattern):
14455 When we need to use masked gather/scatter but do not have
14456 a mask set up a constant true one.
14457 * tree-vect-stmts.c (vect_check_scalar_mask): Also allow
14458 non-SSA_NAME masks.
14460 2021-08-17 Roger Sayle <roger@nextmovesoftware.com>
14462 * tree-ssa-ccp.c (bit_value_binop) [MINUS_EXPR]: Use same
14463 algorithm as PLUS_EXPR to improve subtraction bit bounds.
14464 [POINTER_DIFF_EXPR]: Treat as synonymous with MINUS_EXPR.
14466 2021-08-17 Roger Sayle <roger@nextmovesoftware.com>
14468 * tree-ssa-ccp.c (bit_value_mult_const): New helper function to
14469 calculate the mask-value pair result of a multiplication by an
14471 (bit_value_binop) [MULT_EXPR]: Call it from here for
14472 multiplications by (sparse) non-negative constants.
14474 2021-08-17 Christophe Lyon <christophe.lyon@foss.st.com>
14477 * config.gcc (gcc_cv_initfini_array): Leave undefined for
14478 uclinuxfdpiceabi targets.
14480 2021-08-17 Alexandre Oliva <oliva@adacore.com>
14482 * tree-inline.c (maybe_move_debug_stmts_to_successors): Don't
14483 reverse debug stmts.
14485 2021-08-17 Alexandre Oliva <oliva@adacore.com>
14487 * tree-cfg.c (dump_function_to_file): Use fun, not cfun.
14489 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
14491 * config/aarch64/arm_neon.h (__LD4_LANE_FUNC): Delete.
14492 (__LD4Q_LANE_FUNC): Likewise.
14493 (vld4_lane_u8): Define without macro.
14494 (vld4_lane_u16): Likewise.
14495 (vld4_lane_u32): Likewise.
14496 (vld4_lane_u64): Likewise.
14497 (vld4_lane_s8): Likewise.
14498 (vld4_lane_s16): Likewise.
14499 (vld4_lane_s32): Likewise.
14500 (vld4_lane_s64): Likewise.
14501 (vld4_lane_f16): Likewise.
14502 (vld4_lane_f32): Likewise.
14503 (vld4_lane_f64): Likewise.
14504 (vld4_lane_p8): Likewise.
14505 (vld4_lane_p16): Likewise.
14506 (vld4_lane_p64): Likewise.
14507 (vld4q_lane_u8): Likewise.
14508 (vld4q_lane_u16): Likewise.
14509 (vld4q_lane_u32): Likewise.
14510 (vld4q_lane_u64): Likewise.
14511 (vld4q_lane_s8): Likewise.
14512 (vld4q_lane_s16): Likewise.
14513 (vld4q_lane_s32): Likewise.
14514 (vld4q_lane_s64): Likewise.
14515 (vld4q_lane_f16): Likewise.
14516 (vld4q_lane_f32): Likewise.
14517 (vld4q_lane_f64): Likewise.
14518 (vld4q_lane_p8): Likewise.
14519 (vld4q_lane_p16): Likewise.
14520 (vld4q_lane_p64): Likewise.
14521 (vld4_lane_bf16): Likewise.
14522 (vld4q_lane_bf16): Likewise.
14524 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
14526 * config/aarch64/arm_neon.h (__LD3_LANE_FUNC): Delete.
14527 (__LD3Q_LANE_FUNC): Delete.
14528 (vld3_lane_u8): Define without macro.
14529 (vld3_lane_u16): Likewise.
14530 (vld3_lane_u32): Likewise.
14531 (vld3_lane_u64): Likewise.
14532 (vld3_lane_s8): Likewise.
14533 (vld3_lane_s16): Likewise.
14534 (vld3_lane_s32): Likewise.
14535 (vld3_lane_s64): Likewise.
14536 (vld3_lane_f16): Likewise.
14537 (vld3_lane_f32): Likewise.
14538 (vld3_lane_f64): Likewise.
14539 (vld3_lane_p8): Likewise.
14540 (vld3_lane_p16): Likewise.
14541 (vld3_lane_p64): Likewise.
14542 (vld3q_lane_u8): Likewise.
14543 (vld3q_lane_u16): Likewise.
14544 (vld3q_lane_u32): Likewise.
14545 (vld3q_lane_u64): Likewise.
14546 (vld3q_lane_s8): Likewise.
14547 (vld3q_lane_s16): Likewise.
14548 (vld3q_lane_s32): Likewise.
14549 (vld3q_lane_s64): Likewise.
14550 (vld3q_lane_f16): Likewise.
14551 (vld3q_lane_f32): Likewise.
14552 (vld3q_lane_f64): Likewise.
14553 (vld3q_lane_p8): Likewise.
14554 (vld3q_lane_p16): Likewise.
14555 (vld3q_lane_p64): Likewise.
14556 (vld3_lane_bf16): Likewise.
14557 (vld3q_lane_bf16): Likewise.
14559 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
14561 * config/aarch64/arm_neon.h (__LD2_LANE_FUNC): Delete.
14562 (__LD2Q_LANE_FUNC): Likewise.
14563 (vld2_lane_u8): Define without macro.
14564 (vld2_lane_u16): Likewise.
14565 (vld2_lane_u32): Likewise.
14566 (vld2_lane_u64): Likewise.
14567 (vld2_lane_s8): Likewise.
14568 (vld2_lane_s16): Likewise.
14569 (vld2_lane_s32): Likewise.
14570 (vld2_lane_s64): Likewise.
14571 (vld2_lane_f16): Likewise.
14572 (vld2_lane_f32): Likewise.
14573 (vld2_lane_f64): Likewise.
14574 (vld2_lane_p8): Likewise.
14575 (vld2_lane_p16): Likewise.
14576 (vld2_lane_p64): Likewise.
14577 (vld2q_lane_u8): Likewise.
14578 (vld2q_lane_u16): Likewise.
14579 (vld2q_lane_u32): Likewise.
14580 (vld2q_lane_u64): Likewise.
14581 (vld2q_lane_s8): Likewise.
14582 (vld2q_lane_s16): Likewise.
14583 (vld2q_lane_s32): Likewise.
14584 (vld2q_lane_s64): Likewise.
14585 (vld2q_lane_f16): Likewise.
14586 (vld2q_lane_f32): Likewise.
14587 (vld2q_lane_f64): Likewise.
14588 (vld2q_lane_p8): Likewise.
14589 (vld2q_lane_p16): Likewise.
14590 (vld2q_lane_p64): Likewise.
14591 (vld2_lane_bf16): Likewise.
14592 (vld2q_lane_bf16): Likewise.
14594 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
14596 * haifa-sched.c (advance_one_cycle): Output more context-synchronization
14599 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
14601 * haifa-sched.c (enum rfs_decision, rfs_str): Add RFS_AUTOPREF.
14602 (rank_for_schedule): Use it.
14604 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
14606 PR rtl-optimization/91598
14607 * haifa-sched.c (autopref_rank_for_schedule): Prioritize "irrelevant"
14608 insns after memory reads and before memory writes.
14610 2021-08-17 Alistair_Lee <alistair.lee@arm.com>
14612 * rtl.h (CONST_VECTOR_P): New macro.
14613 * config/aarch64/aarch64.c (aarch64_get_sve_pred_bits): Use RTL
14614 code testing macros.
14615 (aarch64_ptrue_all_mode): Likewise.
14616 (aarch64_expand_mov_immediate): Likewise.
14617 (aarch64_const_vec_all_in_range_p): Likewise.
14618 (aarch64_rtx_costs): Likewise.
14619 (aarch64_legitimate_constant_p): Likewise.
14620 (aarch64_simd_valid_immediate): Likewise.
14621 (aarch64_simd_make_constant): Likewise.
14622 (aarch64_convert_mult_to_shift): Likewise.
14623 (aarch64_expand_sve_vec_perm): Likewise.
14624 (aarch64_vec_fpconst_pow_of_2): Likewise.
14626 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
14628 PR tree-optimization/101938
14629 * range-op.cc (operator_abs::op1_range): Special case
14630 -TYPE_MIN_VALUE for flag_wrapv.
14632 2021-08-17 Kewen Lin <linkw@linux.ibm.com>
14634 * tree-vect-slp.c (vectorizable_bb_reduc_epilogue): Add the cost for
14637 2021-08-17 Jakub Jelinek <jakub@redhat.com>
14639 * tree.def (OMP_SCOPE): New tree code.
14640 * tree.h (OMP_SCOPE_BODY, OMP_SCOPE_CLAUSES): Define.
14641 * tree-nested.c (convert_nonlocal_reference_stmt,
14642 convert_local_reference_stmt, convert_gimple_call): Handle
14644 * tree-pretty-print.c (dump_generic_node): Handle OMP_SCOPE.
14645 * gimple.def (GIMPLE_OMP_SCOPE): New gimple code.
14646 * gimple.c (gimple_build_omp_scope): New function.
14647 (gimple_copy): Handle GIMPLE_OMP_SCOPE.
14648 * gimple.h (gimple_build_omp_scope): Declare.
14649 (gimple_has_substatements): Handle GIMPLE_OMP_SCOPE.
14650 (gimple_omp_scope_clauses, gimple_omp_scope_clauses_ptr,
14651 gimple_omp_scope_set_clauses): New inline functions.
14652 (CASE_GIMPLE_OMP): Add GIMPLE_OMP_SCOPE.
14653 * gimple-pretty-print.c (dump_gimple_omp_scope): New function.
14654 (pp_gimple_stmt_1): Handle GIMPLE_OMP_SCOPE.
14655 * gimple-walk.c (walk_gimple_stmt): Likewise.
14656 * gimple-low.c (lower_stmt): Likewise.
14657 * gimplify.c (is_gimple_stmt): Handle OMP_MASTER.
14658 (gimplify_scan_omp_clauses): For task reductions, handle OMP_SCOPE
14659 like ORT_WORKSHARE constructs. Adjust diagnostics for %<scope%>
14660 allowing task reductions. Reject inscan reductions on scope.
14661 (omp_find_stores_stmt): Handle GIMPLE_OMP_SCOPE.
14662 (gimplify_omp_workshare, gimplify_expr): Handle OMP_SCOPE.
14663 * tree-inline.c (remap_gimple_stmt): Handle GIMPLE_OMP_SCOPE.
14664 (estimate_num_insns): Likewise.
14665 * omp-low.c (build_outer_var_ref): Look through GIMPLE_OMP_SCOPE
14666 contexts if var isn't privatized there.
14667 (check_omp_nesting_restrictions): Handle GIMPLE_OMP_SCOPE.
14668 (scan_omp_1_stmt): Likewise.
14669 (maybe_add_implicit_barrier_cancel): Look through outer
14671 (lower_omp_scope): New function.
14672 (lower_omp_task_reductions): Handle OMP_SCOPE.
14673 (lower_omp_1): Handle GIMPLE_OMP_SCOPE.
14674 (diagnose_sb_1, diagnose_sb_2): Likewise.
14675 * omp-expand.c (expand_omp_single): Support also GIMPLE_OMP_SCOPE.
14676 (expand_omp): Handle GIMPLE_OMP_SCOPE.
14677 (omp_make_gimple_edges): Likewise.
14678 * omp-builtins.def (BUILT_IN_GOMP_SCOPE_START): New built-in.
14680 2021-08-17 Richard Biener <rguenther@suse.de>
14682 PR tree-optimization/101925
14683 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Set
14684 reverse on COMPONENT_REF and ARRAY_REF according to
14685 what reverse_storage_order_for_component_p does.
14686 (vn_reference_eq): Compare reversed on reference ops.
14687 (reverse_storage_order_for_component_p): New overload.
14688 (vn_reference_lookup_3): Check reverse_storage_order_for_component_p
14689 on the reference looked up.
14691 2021-08-17 Jeff Law <jlaw@localhost.localdomain>
14693 * config/h8300/h8300.c (shift_alg_si): Avoid loops for most SImode
14694 shifts on the H8/S.
14695 (h8300_option_override): Use loops on H8/S more often when optimizing
14697 (get_shift_alg): Handle new "special" cases on H8/S. Simplify
14698 accordingly. Handle various arithmetic right shifts with special
14699 sequences that we couldn't handle before.
14701 2021-08-16 Jeff Law <jlaw@localhost.localdomain>
14703 * config.gcc (rl78-*-elf*): Do not include dbxelf.h.
14705 2021-08-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
14707 * config/sparc/rtemself.h (SPARC_GCOV_TYPE_SIZE): Define.
14708 * config/sparc/sparc.c (sparc_gcov_type_size): New.
14709 (TARGET_GCOV_TYPE_SIZE): Redefine if SPARC_GCOV_TYPE_SIZE is defined.
14710 * coverage.c (get_gcov_type): Use targetm.gcov_type_size().
14711 * doc/tm.texi (TARGET_GCOV_TYPE_SIZE): Add hook under "Misc".
14712 * doc/tm.texi.in: Regenerate.
14713 * target.def (gcov_type_size): New target hook.
14714 * targhooks.c (default_gcov_type_size): New.
14715 * targhooks.h (default_gcov_type_size): Declare.
14716 * tree-profile.c (gimple_gen_edge_profiler): Use precision of
14718 (gimple_gen_time_profiler): Likewise.
14720 2021-08-16 Eric Botcazou <ebotcazou@gcc.gnu.org>
14722 * dwarf2out.c (add_scalar_info): Deal with DW_AT_data_bit_offset.
14724 2021-08-16 Tobias Burnus <tobias@codesourcery.com>
14726 PR middle-end/101931
14727 * omp-low.c (omp_runtime_api_call): Update for routines
14728 added in the meanwhile.
14730 2021-08-16 Martin Liska <mliska@suse.cz>
14732 PR tree-optimization/100393
14733 * tree-switch-conversion.c (group_cluster::dump): Use
14734 get_comparison_count.
14735 (jump_table_cluster::find_jump_tables): Pre-compute number of
14736 comparisons and then decrement it. Cache also max_ratio.
14737 (jump_table_cluster::can_be_handled): Change signature.
14738 * tree-switch-conversion.h (get_comparison_count): New.
14740 2021-08-16 Eric Botcazou <ebotcazou@gcc.gnu.org>
14742 * dwarf2out.c (add_data_member_location_attribute): Use GNAT
14743 encodings only when -fgnat-encodings=all is specified.
14744 (add_bound_info): Likewise.
14745 (add_byte_size_attribute): Likewise.
14746 (gen_member_die): Likewise.
14748 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
14750 * omp-oacc-neuter-broadcast.cc
14751 (execute_omp_oacc_neuter_broadcast): Plug 'par' memory leak.
14753 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
14755 * omp-oacc-neuter-broadcast.cc
14756 (execute_omp_oacc_neuter_broadcast): Clarify memory management for
14759 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
14761 * omp-oacc-neuter-broadcast.cc (field_map): Move variable into...
14762 (execute_omp_oacc_neuter_broadcast): ... here.
14763 (install_var_field, build_receiver_ref, build_sender_ref): Take
14764 'field_map_t *' parameter. Adjust all users.
14765 (worker_single_copy, neuter_worker_single): Take a
14766 'record_field_map_t *' parameter. Adjust all users.
14768 2021-08-16 liuhongt <hongtao.liu@intel.com>
14771 * config/i386/i386.md (ldexp<mode>3): Force operands[1] to
14774 2021-08-16 Martin Liska <mliska@suse.cz>
14777 * multiple_target.c (create_dispatcher_calls): Make default
14778 function local only if it is a definition.
14780 2021-08-16 Martin Liska <mliska@suse.cz>
14783 * ipa-icf-gimple.c (func_checker::compare_ssa_name): Do not
14784 consider equal SSA_NAMEs when one is a param.
14786 2021-08-16 liuhongt <hongtao.liu@intel.com>
14789 * config/i386/i386-expand.c (ix86_expand_vec_perm_vpermt2):
14790 Support vpermi2b for V32QI/V16QImode.
14791 (ix86_extract_perm_from_pool_constant): New function.
14792 (ix86_expand_vec_one_operand_perm_avx512): Support
14793 vpermw/vpermb under TARGET_AVX512BW/TARGET_AVX512VBMI.
14794 (expand_vec_perm_1): Adjust comments for upper.
14795 * config/i386/i386-protos.h (ix86_extract_perm_from_pool_constant):
14797 * config/i386/predicates.md (permvar_truncate_operand): New predicate.
14798 (pshufb_truncv4siv4hi_operand): Ditto.
14799 (pshufb_truncv8hiv8qi_operand): Ditto.
14800 * config/i386/sse.md (*avx512bw_permvar_truncv16siv16hi_1):
14801 New pre_reload define_insn_and_split.
14802 (*avx512f_permvar_truncv8siv8hi_1): Ditto.
14803 (*avx512f_vpermvar_truncv8div8si_1): Ditto.
14804 (*avx512f_permvar_truncv32hiv32qi_1): Ditto.
14805 (*avx512f_permvar_truncv16hiv16qi_1): Ditto.
14806 (*avx512f_permvar_truncv4div4si_1): Ditto.
14807 (*avx512f_pshufb_truncv8hiv8qi_1): Ditto.
14808 (*avx512f_pshufb_truncv4siv4hi_1): Ditto.
14809 (*avx512f_pshufd_truncv2div2si_1): Ditto.
14811 2021-08-16 Kito Cheng <kito.cheng@sifive.com>
14813 * config/riscv/multilib-generator: Support code model option for
14815 * doc/install.texi: Add document of new option for
14816 --with-multilib-generator.
14818 2021-08-15 Clément Chigot <clement.chigot@atos.net>
14820 * config/rs6000/rs6000.c (xcoff_tls_exec_model_detected): New.
14821 (rs6000_legitimize_tls_address_aix): Use it.
14822 (rs6000_xcoff_file_end): Add ".ref __tls_get_addr" when
14823 xcoff_tls_exec_model_detected is true.
14825 2021-08-15 Jeff Law <jlaw@localhost.localdomain>
14827 * config/h8300/h8300.c (shift_alg_si): Retune H8/300H shifts
14828 to allow a bit more code growth, saving many dozens of cycles.
14829 (h8300_option_override): Adjus shift_alg_si if optimizing for
14831 (get_shift_alg): Use special + inline shifts for residuals
14834 2021-08-14 Stafford Horne <shorne@gmail.com>
14837 * config/or1k/or1k-opts.h: New file.
14838 * config/or1k/or1k.c (or1k_legitimize_address_1, print_reloc):
14839 Support generating gotha relocations if -mcmodel=large is
14841 * config/or1k/or1k.h (TARGET_CMODEL_SMALL, TARGET_CMODEL_LARGE):
14843 * config/or1k/or1k.opt (mcmodel=): New option.
14844 * doc/invoke.texi (OpenRISC Options): Document mcmodel.
14846 2021-08-14 Martin Sebor <msebor@redhat.com>
14848 PR middle-end/101791
14849 * gimple-ssa-warn-access.cc (new_delete_mismatch_p): Use new argument
14850 to valid_new_delete_pair_p.
14851 * tree.c (valid_new_delete_pair_p): Add argument.
14852 * tree.h (valid_new_delete_pair_p): Same.
14854 2021-08-14 Jakub Jelinek <jakub@redhat.com>
14857 * config/i386/i386-expand.c (expand_vec_perm_broadcast_1)
14858 <case E_V64QImode>: For this mode assert
14859 !TARGET_AVX512BW || d->perm[0] rather than !TARGET_AVX2 || d->perm[0].
14861 2021-08-13 Michael Meissner <meissner@linux.ibm.com>
14864 * config/rs6000/altivec.md (xxeval): Use register_predicate
14865 instead of altivec_register_predicate.
14867 2021-08-13 Martin Sebor <msebor@redhat.com>
14869 PR middle-end/101734
14870 * tree-ssa-uninit.c (maybe_warn_read_write_only): New function.
14871 (maybe_warn_operand): Call it.
14873 2021-08-13 Martin Liska <mliska@suse.cz>
14876 * attribs.c (decl_attributes): Make naked functions "noipa"
14879 2021-08-13 Martin Liska <mliska@suse.cz>
14882 * symtab.c (symtab_node::noninterposable_alias): Do not create
14883 local aliases for target_clone functions as the clonning pass
14886 2021-08-13 Martin Liska <mliska@suse.cz>
14888 * opts.c (LIVE_PATCHING_OPTION): Define.
14889 (control_options_for_live_patching): Use it in error messages.
14891 2021-08-13 Jan Hubicka <hubicka@ucw.cz>
14893 * ipa-modref.c (dump_eaf_flags): Dump EAF_NOREAD.
14894 (implicit_const_eaf_flags, implicit_pure_eaf_flags,
14895 ignore_stores_eaf_flags): New constants.
14896 (remove_useless_eaf_flags): New function.
14897 (eaf_flags_useful_p): Use it.
14898 (deref_flags): Add EAF_NOT_RETURNED if flag is unused;
14900 (modref_lattice::init): Add EAF_NOREAD.
14901 (modref_lattice::add_escape_point): Do not reacord escape point if
14903 (modref_lattice::merge): EAF_NOESCAPE implies EAF_NODIRECTESCAPE;
14904 use remove_useless_eaf_flags.
14905 (modref_lattice::merge_deref): Use ignore_stores_eaf_flags.
14906 (modref_lattice::merge_direct_load): Add EAF_NOREAD
14907 (analyze_ssa_name_flags): Fix handling EAF_NOT_RETURNED
14908 (analyze_parms): Use remove_useless_eaf_flags.
14909 (ipa_merge_modref_summary_after_inlining): Use ignore_stores_eaf_flags.
14910 (modref_merge_call_site_flags): Add caller and ecf_flags parameter;
14911 use remove_useless_eaf_flags.
14912 (modref_propagate_flags_in_scc): Update.
14913 * ipa-modref.h: Turn eaf_flags_t back to char.
14914 * tree-core.h (EAF_NOT_RETURNED): Fix.
14915 (EAF_NOREAD): New constant
14916 * tree-ssa-alias.c: (ref_maybe_used_by_call_p_1): Check for
14918 * tree-ssa-structalias.c (handle_rhs_call): Handle new flags.
14919 (handle_pure_call): Likewise.
14921 2021-08-12 Jakub Jelinek <jakub@redhat.com>
14923 * tree.def (OMP_MASKED): New tree code.
14924 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_FILTER.
14925 * tree.h (OMP_MASKED_BODY, OMP_MASKED_CLAUSES, OMP_MASKED_COMBINED,
14926 OMP_CLAUSE_FILTER_EXPR): Define.
14927 * tree.c (omp_clause_num_ops): Add OMP_CLAUSE_FILTER entry.
14928 (omp_clause_code_name): Likewise.
14929 (walk_tree_1): Handle OMP_CLAUSE_FILTER.
14930 * tree-nested.c (convert_nonlocal_omp_clauses,
14931 convert_local_omp_clauses): Handle OMP_CLAUSE_FILTER.
14932 (convert_nonlocal_reference_stmt, convert_local_reference_stmt,
14933 convert_gimple_call): Handle GIMPLE_OMP_MASTER.
14934 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_FILTER.
14935 (dump_generic_node): Handle OMP_MASTER.
14936 * gimple.def (GIMPLE_OMP_MASKED): New gimple code.
14937 * gimple.c (gimple_build_omp_masked): New function.
14938 (gimple_copy): Handle GIMPLE_OMP_MASKED.
14939 * gimple.h (gimple_build_omp_masked): Declare.
14940 (gimple_has_substatements): Handle GIMPLE_OMP_MASKED.
14941 (gimple_omp_masked_clauses, gimple_omp_masked_clauses_ptr,
14942 gimple_omp_masked_set_clauses): New inline functions.
14943 (CASE_GIMPLE_OMP): Add GIMPLE_OMP_MASKED.
14944 * gimple-pretty-print.c (dump_gimple_omp_masked): New function.
14945 (pp_gimple_stmt_1): Handle GIMPLE_OMP_MASKED.
14946 * gimple-walk.c (walk_gimple_stmt): Likewise.
14947 * gimple-low.c (lower_stmt): Likewise.
14948 * gimplify.c (is_gimple_stmt): Handle OMP_MASTER.
14949 (gimplify_scan_omp_clauses): Handle OMP_CLAUSE_FILTER. For clauses
14950 that take one expression rather than decl or constant, force
14951 gimplification of that into a SSA_NAME or temporary unless min
14953 (gimplify_adjust_omp_clauses): Handle OMP_CLAUSE_FILTER.
14954 (gimplify_expr): Handle OMP_MASKED.
14955 * tree-inline.c (remap_gimple_stmt): Handle GIMPLE_OMP_MASKED.
14956 (estimate_num_insns): Likewise.
14957 * omp-low.c (scan_sharing_clauses): Handle OMP_CLAUSE_FILTER.
14958 (check_omp_nesting_restrictions): Handle GIMPLE_OMP_MASKED. Adjust
14959 diagnostics for existence of masked construct.
14960 (scan_omp_1_stmt, lower_omp_master, lower_omp_1, diagnose_sb_1,
14961 diagnose_sb_2): Handle GIMPLE_OMP_MASKED.
14962 * omp-expand.c (expand_omp_synch, expand_omp, omp_make_gimple_edges):
14965 2021-08-12 Uroš Bizjak <ubizjak@gmail.com>
14968 * config/i386/i386.md (avx512f_scalef<mode>2): New insn pattern.
14969 (ldexp<mode>3): Use avx512f_scalef<mode>2.
14970 (UNSPEC_SCALEF): Move from ...
14971 * config/i386/sse.md (UNSPEC_SCALEF): ... here.
14973 2021-08-12 Jan Hubicka <hubicka@ucw.cz>
14975 * ipa-split.c (consider_split): Fix condition testing void functions.
14977 2021-08-12 Aldy Hernandez <aldyh@redhat.com>
14979 * doc/invoke.texi: Remove docs for threader-mode param.
14980 * flag-types.h (enum threader_mode): Remove.
14981 * params.opt: Remove threader-mode param.
14982 * tree-ssa-threadbackward.c (class back_threader): Remove
14983 path_is_unreachable_p.
14984 Make find_paths private.
14985 Add maybe_thread and thread_through_all_blocks.
14986 Remove reference marker for m_registry.
14987 Remove reference marker for m_profit.
14988 (back_threader::back_threader): Adjust for registry and profit not
14990 (dump_path): Move down.
14991 (debug): Move down.
14992 (class thread_jumps): Remove.
14993 (class back_threader_registry): Remove m_all_paths.
14995 (thread_jumps::thread_through_all_blocks): Move to back_threader
14997 (fsm_find_thread_path): Remove
14998 (back_threader::maybe_thread): New.
14999 (back_threader::thread_through_all_blocks): Move from
15001 (back_threader_registry::back_threader_registry): Remove
15003 (back_threader_registry::~back_threader_registry): Remove.
15004 (thread_jumps::find_taken_edge): Remove.
15005 (thread_jumps::check_subpath_and_update_thread_path): Remove.
15006 (thread_jumps::maybe_register_path): Remove.
15007 (thread_jumps::handle_phi): Remove.
15008 (handle_assignment_p): Remove.
15009 (thread_jumps::handle_assignment): Remove.
15010 (thread_jumps::fsm_find_control_statement_thread_paths): Remove.
15011 (thread_jumps::find_jump_threads_backwards): Remove.
15012 (thread_jumps::find_jump_threads_backwards_with_ranger): Remove.
15013 (try_thread_blocks): Rename find_jump_threads_backwards to
15015 (pass_early_thread_jumps::execute): Same.
15017 2021-08-12 Tobias Burnus <tobias@codesourcery.com>
15019 * tree-core.h (omp_clause_proc_bind_kind): Add
15020 OMP_CLAUSE_PROC_BIND_PRIMARY.
15021 * tree-pretty-print.c (dump_omp_clause): Add TODO comment to
15022 change 'master' to 'primary' in proc_bind for OpenMP 5.1.
15024 2021-08-12 Claudiu Zissulescu <claziss@synopsys.com>
15026 * common/config/arc/arc-common.c (arc_option_init_struct): Remove
15027 fno-common reference.
15028 * config/arc/arc.c (arc_override_options): Remove overriding of
15031 2021-08-12 Jakub Jelinek <jakub@redhat.com>
15034 * config/i386/i386-expand.c (ix86_expand_vec_one_operand_perm_avx512):
15035 If d->testing_p, return true after performing checks instead of
15036 actually expanding the insn.
15037 (expand_vec_perm_broadcast_1): Handle V32HImode - assert
15038 !TARGET_AVX512BW and return false.
15040 2021-08-12 Eric Botcazou <ebotcazou@gcc.gnu.org>
15042 * configure.ac (PE linker --disable-dynamicbase support): New check.
15043 * configure: Regenerate.
15044 * config.in: Likewise.
15045 * config/i386/mingw32.h (LINK_SPEC_DISABLE_DYNAMICBASE): New define.
15046 (LINK_SPEC): Use it.
15047 * config/i386/mingw-w64.h (LINK_SPEC_DISABLE_DYNAMICBASE): Likewise.
15048 (LINK_SPEC): Likewise.
15050 2021-08-12 liuhongt <hongtao.liu@intel.com>
15053 * config/i386/sse.md (*avx2_zero_extendv16qiv16hi2_2): New
15054 post_reload define_insn_and_split.
15055 (*avx512bw_zero_extendv32qiv32hi2_2): Ditto.
15056 (*sse4_1_zero_extendv8qiv8hi2_4): Ditto.
15057 (*avx512f_zero_extendv16hiv16si2_2): Ditto.
15058 (*avx2_zero_extendv8hiv8si2_2): Ditto.
15059 (*sse4_1_zero_extendv4hiv4si2_4): Ditto.
15060 (*avx512f_zero_extendv8siv8di2_2): Ditto.
15061 (*avx2_zero_extendv4siv4di2_2): Ditto.
15062 (*sse4_1_zero_extendv2siv2di2_4): Ditto.
15063 (VI248_256, VI248_512, VI148_512, VI148_256, VI148_128): New
15066 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
15068 * config/rs6000/rs6000-builtin-new.def: Add always, power5, and
15071 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
15073 * config/rs6000/rs6000-builtin-new.def: Add vsx stanza.
15075 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
15077 * config/rs6000/rs6000-builtin-new.def: Finish altivec stanza.
15078 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Move
15079 initialization of pcvoid_type_node here...
15080 (altivec_init_builtins): ...from here.
15081 * config/rs6000/rs6000.h (rs6000_builtin_type_index): Add
15082 RS6000_BTI_const_ptr_void.
15083 (pcvoid_type_node): New macro.
15085 2021-08-11 Richard Biener <rguenther@suse.de>
15088 * tree-ssa-forwprop.c (pass_forwprop::execute): Do not decompose
15089 hard-register accesses.
15091 2021-08-11 Richard Biener <rguenther@suse.de>
15093 * tree-ssa-operands.c (operands_scanner::get_expr_operands):
15094 Do not look at COMPONENT_REF FIELD_DECLs TREE_THIS_VOLATILE
15095 to determine has_volatile_ops.
15097 2021-08-11 Eric Botcazou <ebotcazou@gcc.gnu.org>
15099 * cfgexpand.c (expand_used_vars): Reuse attribs local variable.
15101 2021-08-11 Jan Hubicka <hubicka@ucw.cz>
15102 Alexandre Oliva <oliva@adacore.com>
15104 * ipa-modref.c (modref_lattice::dump): Fix escape_point's min_flags
15106 (modref_lattice::merge_deref): Fix handling of indirect scape points.
15107 (update_escape_summary_1): Likewise.
15108 (update_escape_summary): Likewise.
15109 (ipa_merge_modref_summary_after_inlining): Likewise.
15111 2021-08-11 Richard Biener <rguenther@suse.de>
15113 PR middle-end/101858
15114 * fold-const.c (fold_binary_loc): Guard simplification
15115 of X < (cast) (1 << Y) to integer types.
15117 2021-08-11 Richard Biener <rguenther@suse.de>
15119 PR tree-optimization/101861
15120 * tree-vect-stmts.c (vectorizable_load): Fix error in
15121 previous change with regard to gather vectorization.
15123 2021-08-11 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
15126 * config/arm/arm_neon.h (vdup_n_s8): Replace call to builtin
15128 (vdup_n_s16): Likewise.
15129 (vdup_n_s32): Likewise.
15130 (vdup_n_s64): Likewise.
15131 (vdup_n_u8): Likewise.
15132 (vdup_n_u16): Likewise.
15133 (vdup_n_u32): Likewise.
15134 (vdup_n_u64): Likewise.
15135 (vdup_n_p8): Likewise.
15136 (vdup_n_p16): Likewise.
15137 (vdup_n_p64): Likewise.
15138 (vdup_n_f16): Likewise.
15139 (vdup_n_f32): Likewise.
15140 (vdupq_n_s8): Likewise.
15141 (vdupq_n_s16): Likewise.
15142 (vdupq_n_s32): Likewise.
15143 (vdupq_n_s64): Likewise.
15144 (vdupq_n_u8): Likewise.
15145 (vdupq_n_u16): Likewise.
15146 (vdupq_n_u32): Likewise.
15147 (vdupq_n_u64): Likewise.
15148 (vdupq_n_p8): Likewise.
15149 (vdupq_n_p16): Likewise.
15150 (vdupq_n_p64): Likewise.
15151 (vdupq_n_f16): Likewise.
15152 (vdupq_n_f32): Likewise.
15153 (vmov_n_s8): Replace call to builtin with call to corresponding
15155 (vmov_n_s16): Likewise.
15156 (vmov_n_s32): Likewise.
15157 (vmov_n_s64): Likewise.
15158 (vmov_n_u8): Likewise.
15159 (vmov_n_u16): Likewise.
15160 (vmov_n_u32): Likewise.
15161 (vmov_n_u64): Likewise.
15162 (vmov_n_p8): Likewise.
15163 (vmov_n_p16): Likewise.
15164 (vmov_n_f16): Likewise.
15165 (vmov_n_f32): Likewise.
15166 (vmovq_n_s8): Likewise.
15167 (vmovq_n_s16): Likewise.
15168 (vmovq_n_s32): Likewise.
15169 (vmovq_n_s64): Likewise.
15170 (vmovq_n_u8): Likewise.
15171 (vmovq_n_u16): Likewise.
15172 (vmovq_n_u32): Likewise.
15173 (vmovq_n_u64): Likewise.
15174 (vmovq_n_p8): Likewise.
15175 (vmovq_n_p16): Likewise.
15176 (vmovq_n_f16): Likewise.
15177 (vmovq_n_f32): Likewise.
15178 * config/arm/arm_neon_builtins.def: Remove entries for vdup_n.
15180 2021-08-11 liuhongt <hongtao.liu@intel.com>
15183 * config/i386/i386.md (ldexp<mode>3): Extend to vscalefs[sd]
15184 when TARGET_AVX512F and TARGET_SSE_MATH.
15186 2021-08-10 Jakub Jelinek <jakub@redhat.com>
15189 * config/i386/i386-expand.c (expand_vec_perm_even_odd): Return false
15190 for V32HImode if !TARGET_AVX512BW.
15191 (ix86_vectorize_vec_perm_const) <case E_V32HImode, case E_V64QImode>:
15192 If !TARGET_AVX512BW and TARGET_AVX512F and d.testing_p, don't fail
15193 early, but actually check the permutation.
15195 2021-08-10 Richard Biener <rguenther@suse.de>
15197 PR tree-optimization/101809
15198 * tree-vect-stmts.c (get_load_store_type): Allow emulated
15199 gathers with offset vector nunits being a constant multiple
15200 of the data vector nunits.
15201 (vect_get_gather_scatter_ops): Use the appropriate nunits
15202 for the offset vector defs.
15203 (vectorizable_store): Adjust call to
15204 vect_get_gather_scatter_ops.
15205 (vectorizable_load): Likewise. Handle the case of less
15206 offset vectors than data vectors.
15208 2021-08-10 Jakub Jelinek <jakub@redhat.com>
15211 * config/i386/sse.md (*avx512f_shuf_<shuffletype>64x2_1<mask_name>_1,
15212 *avx512f_shuf_<shuffletype>32x4_1<mask_name>_1): New define_insn
15215 2021-08-10 Richard Biener <rguenther@suse.de>
15217 PR tree-optimization/101801
15218 PR tree-optimization/101819
15219 * tree-vectorizer.h (vect_emulated_vector_p): Declare.
15220 * tree-vect-loop.c (vect_emulated_vector_p): New function.
15221 (vectorizable_reduction): Re-instantiate a check for emulated
15223 * tree-vect-stmts.c (vectorizable_shift): Likewise.
15224 (vectorizable_operation): Likewise. Cost emulated vector
15225 operations according to the scalar sequence synthesized by
15228 2021-08-10 Richard Biener <rguenther@suse.de>
15230 PR middle-end/101824
15231 * tree-nested.c (get_frame_field): Mark the COMPONENT_REF as
15232 volatile in case the variable was.
15234 2021-08-10 H.J. Lu <hjl.tools@gmail.com>
15237 * config/i386/constraints.md (BC): Document for integer SSE
15238 constant all bits set operand.
15239 (BF): New constraint for const floating-point all bits set
15241 * config/i386/i386.c (standard_sse_constant_p): Likewise.
15242 (standard_sse_constant_opcode): Likewise.
15243 * config/i386/sse.md (sseconstm1): New mode attribute.
15244 (mov<mode>_internal): Replace BC with <sseconstm1>.
15246 2021-08-10 liuhongt <hongtao.liu@intel.com>
15248 * config/i386/sse.md (cond_<insn><mode>): New expander.
15249 (VI248_AVX512VLBW): New mode iterator.
15250 * config/i386/predicates.md
15251 (nonimmediate_or_const_vec_dup_operand): New predicate.
15253 2021-08-09 Andrew MacLeod <amacleod@redhat.com>
15255 PR tree-optimization/101741
15256 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Check
15257 type of parameter for toupper/tolower.
15259 2021-08-09 Martin Jambor <mjambor@suse.cz>
15261 PR testsuite/101654
15262 * ipa-prop.c (propagate_controlled_uses): Removed a spurious space.
15264 2021-08-09 Pat Haugen <pthaugen@linux.ibm.com>
15266 * config/rs6000/rs6000.c (is_load_insn1): Verify destination is a
15268 (is_store_insn1): Verify source is a register.
15270 2021-08-09 Uroš Bizjak <ubizjak@gmail.com>
15273 * config/i386/mmx.md (<any_logic:code>v2sf3):
15274 Rename from *mmx_<any_logic:code>v2sf3
15276 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15278 * config/nvptx/nvptx.c: Cross-reference parts adapted in
15279 'gcc/omp-oacc-neuter-broadcast.cc'.
15280 * omp-low.c: Likewise.
15281 * omp-oacc-neuter-broadcast.cc: Cross-reference parts adapted from
15284 2021-08-09 Julian Brown <julian@codesourcery.com>
15285 Kwok Cheung Yeung <kcy@codesourcery.com>
15286 Thomas Schwinge <thomas@codesourcery.com>
15288 * config/gcn/gcn.c (gcn_init_builtins): Override decls for
15289 BUILT_IN_GOACC_SINGLE_START, BUILT_IN_GOACC_SINGLE_COPY_START,
15290 BUILT_IN_GOACC_SINGLE_COPY_END and BUILT_IN_GOACC_BARRIER.
15291 (gcn_goacc_validate_dims): Turn on worker partitioning unconditionally.
15292 (gcn_fork_join): Update comment.
15293 * config/gcn/gcn.opt (flag_worker_partitioning): Remove.
15294 (macc_experimental_workers): Remove unused option.
15296 2021-08-09 Julian Brown <julian@codesourcery.com>
15297 Nathan Sidwell <nathan@codesourcery.com> (via 'gcc/config/nvptx/nvptx.c' master)
15298 Kwok Cheung Yeung <kcy@codesourcery.com>
15299 Thomas Schwinge <thomas@codesourcery.com>
15301 * Makefile.in (OBJS): Add omp-oacc-neuter-broadcast.o.
15302 * doc/tm.texi.in (TARGET_GOACC_CREATE_WORKER_BROADCAST_RECORD):
15303 Add documentation hook.
15304 * doc/tm.texi: Regenerate.
15305 * omp-oacc-neuter-broadcast.cc: New file.
15306 * omp-builtins.def (BUILT_IN_GOACC_BARRIER)
15307 (BUILT_IN_GOACC_SINGLE_START, BUILT_IN_GOACC_SINGLE_COPY_START)
15308 (BUILT_IN_GOACC_SINGLE_COPY_END): New builtins.
15309 * passes.def (pass_omp_oacc_neuter_broadcast): Add pass.
15310 * target.def (goacc.create_worker_broadcast_record): Add target
15312 * tree-pass.h (make_pass_omp_oacc_neuter_broadcast): Add
15314 * config/gcn/gcn-protos.h (gcn_goacc_adjust_propagation_record):
15315 Rename prototype to...
15316 (gcn_goacc_create_worker_broadcast_record): ... this.
15317 * config/gcn/gcn-tree.c (gcn_goacc_adjust_propagation_record): Rename
15319 (gcn_goacc_create_worker_broadcast_record): ... this.
15320 * config/gcn/gcn.c (TARGET_GOACC_ADJUST_PROPAGATION_RECORD):
15322 (TARGET_GOACC_CREATE_WORKER_BROADCAST_RECORD): ... this.
15324 2021-08-09 Tejas Belagod <tejas.belagod@arm.com>
15327 * config/aarch64/aarch64-simd.md (vlshr<mode>3, vashr<mode>3): Use
15328 the right iterator.
15330 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15332 * Makefile.in (GTFILES): Remove '$(srcdir)/omp-offload.c'.
15334 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15336 * builtins.def (DEF_GOACC_BUILTIN, DEF_GOMP_BUILTIN): Don't
15337 consider '-foffload-abi'.
15338 * common.opt (-foffload-abi): Remove 'Var', 'Init'.
15339 * opts.c (common_handle_option) <-foffload-abi> [ACCEL_COMPILER]:
15342 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15344 * optc-gen.awk: Sanity check that 'Init' doesn't appear without
15347 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15349 * omp-builtins.def (BUILT_IN_ACC_GET_DEVICE_TYPE): Remove.
15351 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15353 * doc/gty.texi (Files): Update.
15355 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15357 * doc/gty.texi (Files): Fix GTY header file example.
15359 2021-08-09 Roger Sayle <roger@nextmovesoftware.com>
15361 * tree-ssa-ccp.c (value_mask_to_min_max): Helper function to
15362 determine the upper and lower bounds from a mask-value pair.
15363 (bit_value_unop) [ABS_EXPR, ABSU_EXPR]: Add support for
15364 absolute value and unsigned absolute value expressions.
15365 (bit_value_binop): Initialize *VAL's precision.
15366 [LT_EXPR, LE_EXPR]: Use value_mask_to_min_max to determine
15367 upper and lower bounds of operands. Add LE_EXPR/GE_EXPR
15368 support when the operands are unknown but potentially equal.
15369 [MIN_EXPR, MAX_EXPR]: Support minimum/maximum expressions.
15371 2021-08-09 Bin Cheng <bin.cheng@linux.alibaba.com>
15373 * config/aarch64/aarch64.md
15374 (*extend<SHORT:mode><GPI:mode>2_aarch64): Use %<GPI:w>0.
15376 2021-08-08 Sergei Trofimovich <siarheit@google.com>
15378 * lra-constraints.c: Fix s/otput/output/ typo.
15380 2021-08-06 Martin Sebor <msebor@redhat.com>
15382 * builtins.c (expand_builtin_memchr): Move to gimple-ssa-warn-access.cc.
15383 (expand_builtin_strcat): Same.
15384 (expand_builtin_stpncpy): Same.
15385 (expand_builtin_strncat): Same.
15386 (check_read_access): Same.
15387 (check_memop_access): Same.
15388 (expand_builtin_strlen): Move checks to gimple-ssa-warn-access.cc.
15389 (expand_builtin_strnlen): Same.
15390 (expand_builtin_memcpy): Same.
15391 (expand_builtin_memmove): Same.
15392 (expand_builtin_mempcpy): Same.
15393 (expand_builtin_strcpy): Same.
15394 (expand_builtin_strcpy_args): Same.
15395 (expand_builtin_stpcpy_1): Same.
15396 (expand_builtin_strncpy): Same.
15397 (expand_builtin_memset): Same.
15398 (expand_builtin_bzero): Same.
15399 (expand_builtin_strcmp): Same.
15400 (expand_builtin_strncmp): Same.
15401 (expand_builtin): Remove handlers.
15402 (fold_builtin_strlen): Add a comment.
15403 * builtins.h (check_access): Move to gimple-ssa-warn-access.cc.
15404 * calls.c (maybe_warn_nonstring_arg): Same.
15405 * diagnostic-spec.c (nowarn_spec_t::nowarn_spec_t): Add warning option.
15406 * gimple-fold.c (gimple_fold_builtin_strcpy): Pass argument to callee.
15407 (gimple_fold_builtin_stpcpy): Same.
15408 * gimple-ssa-warn-access.cc (has_location): New function.
15409 (get_location): Same.
15410 (get_callee_fndecl): Same.
15411 (call_nargs): Same.
15413 (warn_string_no_nul): Define.
15414 (unterminated_array): Same.
15415 (check_nul_terminated_array): Same.
15416 (maybe_warn_nonstring_arg): Same.
15417 (maybe_warn_for_bound): Same.
15418 (warn_for_access): Same.
15419 (check_access): Same.
15420 (check_memop_access): Same.
15421 (check_read_access): Same.
15422 (warn_dealloc_offset): Use helper functions.
15423 (maybe_emit_free_warning): Same.
15424 (class pass_waccess): Add members.
15425 (check_strcat): New function.
15426 (check_strncat): New function.
15427 (check_stxcpy): New function.
15428 (check_stxncpy): New function.
15429 (check_strncmp): New function.
15430 (pass_waccess::check_builtin): New function.
15431 (pass_waccess::check): Call it.
15432 * gimple-ssa-warn-access.h (warn_string_no_nul): Move here from
15434 (maybe_warn_for_bound): Same.
15435 (check_access): Same.
15436 (check_memop_access): Same.
15437 (check_read_access): Same.
15438 * pointer-query.h (struct access_data): Define a ctor overload.
15440 2021-08-06 Richard Biener <rguenther@suse.de>
15442 PR tree-optimization/101801
15443 * tree-vectorizer.h (vect_worthwhile_without_simd_p): Rename...
15444 (vect_can_vectorize_without_simd_p): ... to this.
15445 * tree-vect-loop.c (vect_worthwhile_without_simd_p): Rename...
15446 (vect_can_vectorize_without_simd_p): ... to this and fold
15447 in vect_min_worthwhile_factor.
15448 (vect_min_worthwhile_factor): Remove.
15449 (vectorizable_reduction): Adjust and remove the cost part.
15450 * tree-vect-stmts.c (vectorizable_shift): Likewise.
15451 (vectorizable_operation): Likewise.
15453 2021-08-06 Uroš Bizjak <ubizjak@gmail.com>
15456 * config/i386/i386.md (cmove reg-to-reg move elimination peephole2s):
15457 Add general_gr_operand predicate to operand 3.
15459 2021-08-06 Roger Sayle <roger@nextmovesoftware.com>
15461 * tree-ssa-phiopt.c (cond_removal_in_builtin_zero_pattern): Use
15462 CFN_BUILT_IN_CLRSB* instead of BUILT_IN_CLRSB* for consistency.
15464 2021-08-06 Tamar Christina <tamar.christina@arm.com>
15466 * config/aarch64/aarch64-sve-builtins.cc (register_svpattern,
15467 register_svprfop): Pass vec<> by pointer.
15468 * langhooks-def.h (lhd_simulate_enum_decl): Likewise.
15469 * langhooks.c (lhd_simulate_enum_decl): Likewise.
15470 * langhooks.h (struct lang_hooks_for_types): Likewise.
15472 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
15474 * config/aarch64/arm_neon.h (vst1_bf16_x2): Use
15475 __builtin_memcpy instead of constructing an additional
15476 __builtin_aarch64_simd_oi one vector at a time.
15477 (vst1q_bf16_x2): Likewise.
15478 (vst1_bf16_x3): Use __builtin_memcpy instead of constructing
15479 an additional __builtin_aarch64_simd_ci one vector at a time.
15480 (vst1q_bf16_x3): Likewise.
15481 (vst1_bf16_x4): Use __builtin_memcpy instead of a union.
15482 (vst1q_bf16_x4): Likewise.
15483 (vst2_bf16): Use __builtin_memcpy instead of constructing an
15484 additional __builtin_aarch64_simd_oi one vector at a time.
15485 (vst2q_bf16): Likewise.
15486 (vst3_bf16): Use __builtin_memcpy instead of constructing an
15487 additional __builtin_aarch64_simd_ci mode one vector at a
15489 (vst3q_bf16): Likewise.
15490 (vst4_bf16): Use __builtin_memcpy instead of constructing an
15491 additional __builtin_aarch64_simd_xi one vector at a time.
15492 (vst4q_bf16): Likewise.
15494 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
15496 * config/aarch64/arm_neon.h (__ST2_LANE_FUNC): Delete.
15497 (__ST2Q_LANE_FUNC): Delete.
15498 (vst2_lane_f16): Use __builtin_memcpy to copy vector
15499 structure instead of constructing __builtin_aarch64_simd_oi
15500 one vector at a time.
15501 (vst2_lane_f32): Likewise.
15502 (vst2_lane_f64): Likewise.
15503 (vst2_lane_p8): Likewise.
15504 (vst2_lane_p16): Likewise.
15505 (vst2_lane_p64): Likewise.
15506 (vst2_lane_s8): Likewise.
15507 (vst2_lane_s16): Likewise.
15508 (vst2_lane_s32): Likewise.
15509 (vst2_lane_s64): Likewise.
15510 (vst2_lane_u8): Likewise.
15511 (vst2_lane_u16): Likewise.
15512 (vst2_lane_u32): Likewise.
15513 (vst2_lane_u64): Likewise.
15514 (vst2_lane_bf16): Likewise.
15515 (vst2q_lane_f16): Use __builtin_memcpy to copy vector
15516 structure instead of using a union.
15517 (vst2q_lane_f32): Likewise.
15518 (vst2q_lane_f64): Likewise.
15519 (vst2q_lane_p8): Likewise.
15520 (vst2q_lane_p16): Likewise.
15521 (vst2q_lane_p64): Likewise.
15522 (vst2q_lane_s8): Likewise.
15523 (vst2q_lane_s16): Likewise.
15524 (vst2q_lane_s32): Likewise.
15525 (vst2q_lane_s64): Likewise.
15526 (vst2q_lane_u8): Likewise.
15527 (vst2q_lane_u16): Likewise.
15528 (vst2q_lane_u32): Likewise.
15529 (vst2q_lane_u64): Likewise.
15530 (vst2q_lane_bf16): Likewise.
15532 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
15534 * config/aarch64/arm_neon.h (__ST3_LANE_FUNC): Delete.
15535 (__ST3Q_LANE_FUNC): Delete.
15536 (vst3_lane_f16): Use __builtin_memcpy to copy vector
15537 structure instead of constructing __builtin_aarch64_simd_ci
15538 one vector at a time.
15539 (vst3_lane_f32): Likewise.
15540 (vst3_lane_f64): Likewise.
15541 (vst3_lane_p8): Likewise.
15542 (vst3_lane_p16): Likewise.
15543 (vst3_lane_p64): Likewise.
15544 (vst3_lane_s8): Likewise.
15545 (vst3_lane_s16): Likewise.
15546 (vst3_lane_s32): Likewise.
15547 (vst3_lane_s64): Likewise.
15548 (vst3_lane_u8): Likewise.
15549 (vst3_lane_u16): Likewise.
15550 (vst3_lane_u32): Likewise.
15551 (vst3_lane_u64): Likewise.
15552 (vst3_lane_bf16): Likewise.
15553 (vst3q_lane_f16): Use __builtin_memcpy to copy vector
15554 structure instead of using a union.
15555 (vst3q_lane_f32): Likewise.
15556 (vst3q_lane_f64): Likewise.
15557 (vst3q_lane_p8): Likewise.
15558 (vst3q_lane_p16): Likewise.
15559 (vst3q_lane_p64): Likewise.
15560 (vst3q_lane_s8): Likewise.
15561 (vst3q_lane_s16): Likewise.
15562 (vst3q_lane_s32): Likewise.
15563 (vst3q_lane_s64): Likewise.
15564 (vst3q_lane_u8): Likewise.
15565 (vst3q_lane_u16): Likewise.
15566 (vst3q_lane_u32): Likewise.
15567 (vst3q_lane_u64): Likewise.
15568 (vst3q_lane_bf16): Likewise.
15570 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
15572 * config/aarch64/arm_neon.h (__ST4_LANE_FUNC): Delete.
15573 (__ST4Q_LANE_FUNC): Delete.
15574 (vst4_lane_f16): Use __builtin_memcpy to copy vector
15575 structure instead of constructing __builtin_aarch64_simd_xi
15576 one vector at a time.
15577 (vst4_lane_f32): Likewise.
15578 (vst4_lane_f64): Likewise.
15579 (vst4_lane_p8): Likewise.
15580 (vst4_lane_p16): Likewise.
15581 (vst4_lane_p64): Likewise.
15582 (vst4_lane_s8): Likewise.
15583 (vst4_lane_s16): Likewise.
15584 (vst4_lane_s32): Likewise.
15585 (vst4_lane_s64): Likewise.
15586 (vst4_lane_u8): Likewise.
15587 (vst4_lane_u16): Likewise.
15588 (vst4_lane_u32): Likewise.
15589 (vst4_lane_u64): Likewise.
15590 (vst4_lane_bf16): Likewise.
15591 (vst4q_lane_f16): Use __builtin_memcpy to copy vector
15592 structure instead of using a union.
15593 (vst4q_lane_f32): Likewise.
15594 (vst4q_lane_f64): Likewise.
15595 (vst4q_lane_p8): Likewise.
15596 (vst4q_lane_p16): Likewise.
15597 (vst4q_lane_p64): Likewise.
15598 (vst4q_lane_s8): Likewise.
15599 (vst4q_lane_s16): Likewise.
15600 (vst4q_lane_s32): Likewise.
15601 (vst4q_lane_s64): Likewise.
15602 (vst4q_lane_u8): Likewise.
15603 (vst4q_lane_u16): Likewise.
15604 (vst4q_lane_u32): Likewise.
15605 (vst4q_lane_u64): Likewise.
15606 (vst4q_lane_bf16): Likewise.
15608 2021-08-06 Martin Liska <mliska@suse.cz>
15610 * config/rs6000/rs6000.c (rs6000_option_override_internal): When
15611 a target option is restored, it can have
15612 rs6000_long_double_type_size set to FLOAT_PRECISION_TFmode
15613 and error should not be emitted.
15615 2021-08-06 Sebastian Huber <sebastian.huber@embedded-brains.de>
15617 * gcov-io.h (gcov_write): Declare.
15618 * gcov-io.c (gcov_write): New.
15619 (gcov_write_counter): Remove.
15620 (gcov_write_tag_length): Likewise.
15621 (gcov_write_summary): Replace gcov_write_tag_length() with calls to
15622 gcov_write_unsigned().
15623 * doc/invoke.texi (fprofile-info-section): Mention
15624 __gcov_info_to_gdca().
15626 2021-08-06 Martin Sebor <msebor@redhat.com>
15628 * dominance.c (prune_bbs_to_update_dominators): Adjust by-value vec
15629 arguments to by-reference.
15630 (iterate_fix_dominators): Same.
15631 * dominance.h (iterate_fix_dominators): Same.
15632 * ipa-prop.h: Call auto_vec::to_vec_legacy.
15633 * tree-data-ref.c (dump_data_dependence_relation): Adjust by-value vec
15634 arguments to by-reference.
15635 (debug_data_dependence_relation): Same.
15636 (dump_data_dependence_relations): Same.
15637 * tree-data-ref.h (debug_data_dependence_relation): Same.
15638 (dump_data_dependence_relations): Same.
15639 * tree-predcom.c (dump_chains): Same.
15640 (initialize_root_vars_lm): Same.
15641 (determine_unroll_factor): Same.
15642 (replace_phis_by_defined_names): Same.
15643 (insert_init_seqs): Same.
15644 (pcom_worker::tree_predictive_commoning_loop): Call
15645 auto_vec::to_vec_legacy.
15646 * tree-ssa-pre.c (insert_into_preds_of_block): Adjust by-value vec
15647 arguments to by-reference.
15648 * tree-ssa-threadbackward.c (populate_worklist): Same.
15649 (back_threader::resolve_def): Same.
15650 * tree-vect-data-refs.c (vect_check_nonzero_value): Same.
15651 (vect_enhance_data_refs_alignment): Same.
15652 (vect_check_lower_bound): Same.
15653 (vect_prune_runtime_alias_test_list): Same.
15654 (vect_permute_store_chain): Same.
15655 * tree-vect-slp-patterns.c (vect_normalize_conj_loc): Same.
15656 * tree-vect-stmts.c (vect_create_vectorized_demotion_stmts): Same.
15657 * tree-vectorizer.h (vect_permute_store_chain): Same.
15658 * vec.c (test_init): New function.
15659 (vec_c_tests): Call new function.
15660 * vec.h (vec): Declare ctors, dtor, and assignment.
15661 (auto_vec::vec_to_legacy): New function.
15662 (vec::copy): Adjust initialization.
15664 2021-08-05 H.J. Lu <hjl.tools@gmail.com>
15667 * config/i386/i386.c (ix86_can_inline_p): Ignore MASK_80387 if
15668 callee only uses GPRs.
15669 * config/i386/ia32intrin.h: Revert commit 5463cee2770.
15670 * config/i386/serializeintrin.h: Revert commit 71958f740f1.
15671 * config/i386/x86gprintrin.h: Add
15672 #pragma GCC target("general-regs-only") and #pragma GCC pop_options
15673 to disable non-GPR ISAs.
15675 2021-08-05 Richard Sandiford <richard.sandiford@arm.com>
15677 PR middle-end/101787
15678 * doc/md.texi (cond_ashl, cond_ashr, cond_lshr): Document.
15680 2021-08-05 Richard Sandiford <richard.sandiford@arm.com>
15682 * tree-vectorizer.h (vect_is_store_elt_extraction, vect_is_reduction)
15683 (vect_reduc_type, vect_embedded_comparison_type, vect_comparison_type)
15684 (vect_is_extending_load, vect_is_integer_truncation): New functions,
15685 moved from aarch64.c but given different names.
15686 * config/aarch64/aarch64.c (aarch64_is_store_elt_extraction)
15687 (aarch64_is_reduction, aarch64_reduc_type)
15688 (aarch64_embedded_comparison_type, aarch64_comparison_type)
15689 (aarch64_extending_load_p, aarch64_integer_truncation_p): Delete
15690 in favor of the above. Update callers accordingly.
15692 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
15695 * config/arm/arm-cpus.in (generic-armv7-a): Add quirk to suppress
15696 writing .cpu directive in asm output.
15697 * config/arm/arm.c (arm_identify_fpu_from_isa): New variable.
15698 (arm_last_printed_arch_string): Delete.
15699 (arm_last-printed_fpu_string): Delete.
15700 (arm_configure_build_target): If use of floating-point/SIMD is
15701 disabled, remove all fp/simd related features from the target ISA.
15702 (last_arm_targ_options): New variable.
15703 (arm_print_asm_arch_directives): Add new parameters. Change order
15704 of emitted directives and handle all cases here.
15705 (arm_file_start): Always call arm_print_asm_arch_directives, move
15706 all generation of .arch/.arch_extension here.
15707 (arm_file_end): Call arm_print_asm_arch.
15708 (arm_declare_function_name): Call arm_print_asm_arch_directives
15709 instead of printing .arch/.fpu directives directly.
15711 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
15713 * config/arm/arm.c (arm_configure_build_target): Don't call
15714 arm_option_reconfigure_globals.
15715 (arm_option_restore): Call arm_option_reconfigure_globals after
15716 reconfiguring the target.
15717 * config/arm/arm-c.c (arm_pragma_target_parse): Likewise.
15719 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
15721 * config/arm/arm.c (arm_configure_build_target): Ensure the target's
15722 arch_name is always set.
15724 2021-08-05 Jonathan Wright <jonathan.wright@arm.com>
15726 * config/aarch64/aarch64.c: Traverse RTL tree to prevent cost
15727 of vec_select high-half from being added into Neon subtract
15730 2021-08-05 Jonathan Wright <jonathan.wright@arm.com>
15732 * config/aarch64/aarch64.c: Traverse RTL tree to prevent cost
15733 of vec_select high-half from being added into Neon add cost.
15735 2021-08-05 Kewen Lin <linkw@linux.ibm.com>
15737 * cfgloop.h (loops_list::loops_list): Add one optional argument
15738 root and adjust accordingly, update loop tree walking and factor
15740 * cfgloop.c (loops_list::walk_loop_tree): ... this. New function.
15742 2021-08-05 Eric Botcazou <ebotcazou@gcc.gnu.org>
15744 PR tree-optimization/101626
15745 * tree-sra.c (propagate_subaccesses_from_rhs): Do not set the
15746 reverse scalar storage order on a pointer or vector component.
15748 2021-08-05 liuhongt <hongtao.liu@intel.com>
15750 * config/i386/sse.md (cond_<code><mode>): New expander.
15752 2021-08-05 liuhongt <hongtao.liu@intel.com>
15754 * config/i386/sse.md (cond_<code><mode>): New expander.
15756 2021-08-05 liuhongt <hongtao.liu@intel.com>
15758 * config/i386/sse.md (cond_<code><mode>): New expander.
15760 2021-08-04 David Malcolm <dmalcolm@redhat.com>
15763 * Makefile.in (ANALYZER_OBJS): Add analyzer/region-model-asm.o.
15765 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
15768 * config/i386/i386.h (STORE_MAX_PIECES): Allow 16/32/64 bytes
15769 only if TARGET_INTER_UNIT_MOVES_TO_VEC is true.
15771 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
15774 * config/i386/i386-expand.c (ix86_expand_vector_move): Call
15775 ix86_gen_scratch_sse_rtx to get a scratch SSE register to copy
15776 data with SSE register from one memory location to another.
15778 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
15780 * config/s390/s390.c (expand_perm_with_vpdi): New function.
15781 (vectorize_vec_perm_const_1): Call expand_perm_with_vpdi.
15782 * config/s390/vector.md (*vpdi1<mode>, @vpdi1<mode>): Enable a
15783 parameterized expander.
15784 (*vpdi4<mode>, @vpdi4<mode>): Likewise.
15786 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
15788 * config/s390/s390.c (MAX_VECT_LEN): Define macro.
15789 (struct expand_vec_perm_d): Define struct.
15790 (expand_perm_with_merge): New function.
15791 (vectorize_vec_perm_const_1): New function.
15792 (s390_vectorize_vec_perm_const): New function.
15793 (TARGET_VECTORIZE_VEC_PERM_CONST): Define target macro.
15795 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
15797 * config/s390/vector.md (V_HW_64): Remove mode iterator.
15798 (*vec_load_pair<mode>): Use V_HW_2 instead of V_HW_64.
15799 * config/s390/vx-builtins.md
15800 (vec_scatter_element<V_HW_2:mode>_SI): Use V_HW_2 instead of
15803 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
15805 * config/s390/s390.md (UNSPEC_VEC_PERMI): Remove constant
15807 * config/s390/vector.md (*vpdi1<mode>, *vpdi4<mode>): New pattern
15809 * config/s390/vx-builtins.md (*vec_permi<mode>): Emit generic rtx
15810 instead of an unspec.
15812 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
15814 * config/s390/s390-modes.def: Add more vector modes to support
15815 concatenation of two vectors.
15816 * config/s390/s390-protos.h (s390_expand_merge_perm_const): Add
15818 (s390_expand_merge): Likewise.
15819 * config/s390/s390.c (s390_expand_merge_perm_const): New function.
15820 (s390_expand_merge): New function.
15821 * config/s390/s390.md (UNSPEC_VEC_MERGEH, UNSPEC_VEC_MERGEL):
15822 Remove constant definitions.
15823 * config/s390/vector.md (V_HW_2): Add mode iterators.
15824 (VI_HW_4, V_HW_4): Rename VI_HW_4 to V_HW_4.
15825 (vec_2x_nelts, vec_2x_wide): New mode attributes.
15826 (*vmrhb, *vmrlb, *vmrhh, *vmrlh, *vmrhf, *vmrlf, *vmrhg, *vmrlg):
15827 New pattern definitions.
15828 (vec_widen_umult_lo_<mode>, vec_widen_umult_hi_<mode>)
15829 (vec_widen_smult_lo_<mode>, vec_widen_smult_hi_<mode>)
15830 (vec_unpacks_lo_v4sf, vec_unpacks_hi_v4sf, vec_unpacks_lo_v2df)
15831 (vec_unpacks_hi_v2df): Adjust expanders to emit non-unspec RTX for
15833 * config/s390/vx-builtins.md (V_HW_4): Remove mode iterator. Now
15835 (vec_mergeh<mode>, vec_mergel<mode>): Use s390_expand_merge to
15836 emit vec merge pattern.
15838 2021-08-04 Jonathan Wright <jonathan.wright@arm.com>
15840 * config/aarch64/aarch64.c (aarch64_strip_extend_vec_half):
15842 (aarch64_rtx_mult_cost): Traverse RTL tree to prevent cost of
15843 vec_select high-half from being added into Neon multiply
15845 * rtlanal.c (vec_series_highpart_p): Define.
15846 * rtlanal.h (vec_series_highpart_p): Declare.
15848 2021-08-04 Jonathan Wright <jonathan.wright@arm.com>
15850 * config/aarch64/aarch64.c (aarch64_strip_duplicate_vec_elt):
15852 (aarch64_rtx_mult_cost): Traverse RTL tree to prevent
15853 vec_select cost from being added into Neon multiply cost.
15855 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
15857 * tree-vect-loop.c (vect_better_loop_vinfo_p): Detect cases in
15858 which old_loop_vinfo is an epilogue loop that handles a constant
15859 number of iterations.
15861 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
15863 * tree-vect-loop.c (vect_analyze_loop): Print a dump message
15864 when a reanalyzed loop fails to be cheaper than the current
15867 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
15869 * config/aarch64/aarch64.c: Fix a typo.
15871 2021-08-04 Vincent Lefèvre <vincent-gcc@vinc17.net>
15873 PR gcov-profile/101773
15874 * gcov-io.c (gcov_close): Check return code of a fclose.
15876 2021-08-04 Bernd Edlinger <bernd.edlinger@hotmail.de>
15879 * dwarf2out.c (dwarf2out_assembly_start): Emit a dummy
15880 .file statement when needed.
15882 2021-08-04 Richard Biener <rguenther@suse.de>
15884 * tree-vect-data-refs.c (vect_check_gather_scatter):
15885 Include widening conversions only when the result is
15886 still handed by native gather or the current offset
15887 size not already matches the data size.
15888 Also succeed analysis in case there's no native support,
15889 noted by a IFN_LAST ifn and a NULL decl.
15890 (vect_analyze_data_refs): Always consider gathers.
15891 * tree-vect-patterns.c (vect_recog_gather_scatter_pattern):
15892 Test for no IFN gather rather than decl gather.
15893 * tree-vect-stmts.c (vect_model_load_cost): Pass in the
15894 gather-scatter info and cost emulated gathers accordingly.
15895 (vect_truncate_gather_scatter_offset): Properly test for
15897 (vect_use_strided_gather_scatters_p): Likewise.
15898 (get_load_store_type): Handle emulated gathers and its
15900 (vectorizable_load): Likewise. Emulate them by extracting
15901 scalar offsets, doing scalar loads and a vector construct.
15903 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
15906 * expr.c (op_by_pieces_d::op_by_pieces_d): Add a max_pieces
15907 argument to set m_max_size.
15908 (move_by_pieces_d): Pass MOVE_MAX_PIECES to op_by_pieces_d.
15909 (store_by_pieces_d): Pass STORE_MAX_PIECES to op_by_pieces_d.
15910 (compare_by_pieces_d): Pass COMPARE_MAX_PIECES to op_by_pieces_d.
15912 2021-08-04 Roger Sayle <roger@nextmovesoftware.com>
15913 Marc Glisse <marc.glisse@inria.fr>
15915 * match.pd (bit_ior, bit_xor): Canonicalize (X*C1)|(X*C2) and
15916 (X*C1)^(X*C2) as X*(C1+C2), and related variants, using
15917 tree_nonzero_bits to ensure that operands are bit-wise disjoint.
15919 2021-08-04 Richard Biener <rguenther@suse.de>
15921 * tree-ssa-forwprop.c (pass_forwprop::execute): Split
15922 out code to decompose vector loads ...
15923 (optimize_vector_load): ... here. Generalize it to
15924 handle intermediate widening and TARGET_MEM_REF loads
15925 and apply it to loads with a supported vector mode as well.
15927 2021-08-04 Richard Biener <rguenther@suse.de>
15929 PR tree-optimization/101756
15930 * tree-vect-slp.c (vectorizable_bb_reduc_epilogue): Make sure
15931 the result of the reduction epilogue is compatible to the original
15934 2021-08-04 liuhongt <hongtao.liu@intel.com>
15937 * config/i386/i386.md (peephole2): Refine predicate from
15938 register_operand to general_reg_operand.
15940 2021-08-04 Aldy Hernandez <aldyh@redhat.com>
15942 * gimple-range-path.h (path_range_query::dump): Mark override.
15944 2021-08-04 Richard Biener <rguenther@suse.de>
15946 PR tree-optimization/101769
15947 * tree-tailcall.c (eliminate_tail_call): Add the created loop
15948 for the first recursion and return it via the new output parameter.
15949 (optimize_tail_call): Pass through new output param.
15950 (tree_optimize_tail_calls_1): After creating all latches,
15951 add the created loop to the loop tree. Do not mark loops for fixup.
15953 2021-08-04 Martin Liska <mliska@suse.cz>
15955 * doc/invoke.texi: Document threader-mode param.
15957 2021-08-04 liuhongt <hongtao.liu@intel.com>
15959 * config/i386/sse.md (cond_fma<mode>): New expander.
15960 (cond_fms<mode>): Ditto.
15961 (cond_fnma<mode>): Ditto.
15962 (cond_fnms<mode>): Ditto.
15964 2021-08-03 Segher Boessenkool <segher@kernel.crashing.org>
15966 * config/rs6000/vsx.md (*vsx_le_perm_store_<mode>): Use && instead of &.
15968 2021-08-03 Segher Boessenkool <segher@kernel.crashing.org>
15970 * config/rs6000/constraints.md: Remove "e" from the list of available
15971 constraint characters.
15973 2021-08-03 Eugene Rozenfeld <erozen@microsoft.com>
15975 PR gcov-profile/71672
15976 * auto-profile.c (afdo_indirect_call): Fix setup of the historgram value for indirect calls.
15978 2021-08-03 Paul A. Clarke <pc@us.ibm.com>
15980 * config/rs6000/smmintrin.h (_mm_minpos_epu16): New.
15982 2021-08-03 H.J. Lu <hjl.tools@gmail.com>
15984 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): In 64-bit mode,
15985 try XMM31 to avoid vzeroupper.
15987 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
15989 * doc/invoke.texi: Document -mtune=neoverse-512tvb and
15990 -mcpu=neoverse-512tvb.
15991 * config/aarch64/aarch64-cores.def (neoverse-512tvb): New entry.
15992 * config/aarch64/aarch64-tune.md: Regenerate.
15993 * config/aarch64/aarch64.c (neoverse512tvb_sve_vector_cost)
15994 (neoverse512tvb_sve_issue_info, neoverse512tvb_vec_issue_info)
15995 (neoverse512tvb_vector_cost, neoverse512tvb_tunings): New structures.
15996 (aarch64_adjust_body_cost_sve): Handle -mtune=neoverse-512tvb.
15997 (aarch64_adjust_body_cost): Likewise.
15999 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16001 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Only
16002 record issue information for operations that occur in the
16005 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16007 * config/aarch64/aarch64.c (aarch64_multiply_add_p): Add a vec_flags
16008 parameter. Detect cases in which an Advanced SIMD MLA would almost
16009 certainly require a MOV.
16010 (aarch64_count_ops): Update accordingly.
16012 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16014 * config/aarch64/aarch64.c (aarch64_is_store_elt_extraction): New
16015 function, split out from...
16016 (aarch64_detect_vector_stmt_subtype): ...here.
16017 (aarch64_add_stmt_cost): Treat extracting element 0 as free.
16019 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16021 * config/aarch64/aarch64-protos.h (sve_vec_cost):
16022 Add gather_load_x32_cost and gather_load_x64_cost.
16023 * config/aarch64/aarch64.c (generic_sve_vector_cost)
16024 (a64fx_sve_vector_cost, neoversev1_sve_vector_cost): Update
16025 accordingly, using the values given by the scalar_load * number
16026 of elements calculation that we used previously.
16027 (aarch64_detect_vector_stmt_subtype): Use the new fields.
16029 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16031 * config/aarch64/aarch64.c (aarch64_adjust_body_cost_sve): New
16032 function, split out from...
16033 (aarch64_adjust_body_cost): ...here.
16035 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16037 * config/aarch64/fractional-cost.h: New file.
16038 * config/aarch64/aarch64.c: Include <algorithm> (indirectly)
16039 and cost_fraction.h.
16040 (vec_cost_fraction): New typedef.
16041 (aarch64_detect_scalar_stmt_subtype): Use it for statement costs.
16042 (aarch64_detect_vector_stmt_subtype): Likewise.
16043 (aarch64_sve_adjust_stmt_cost, aarch64_adjust_stmt_cost): Likewise.
16044 (aarch64_estimate_min_cycles_per_iter): Use vec_cost_fraction
16046 (aarch64_adjust_body_cost): Likewise.
16047 (aarch64_test_cost_fraction): New function.
16048 (aarch64_run_selftests): Call it.
16050 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
16052 * config/aarch64/aarch64-protos.h (tune_params::sve_width): Turn
16054 * config/aarch64/aarch64.c (aarch64_cmp_autovec_modes): Update
16056 (aarch64_estimated_poly_value): Likewise. Use the least significant
16057 set bit for the minimum and likely values. Use the most significant
16058 set bit for the maximum value.
16060 2021-08-03 liuhongt <hongtao.liu@intel.com>
16062 * config/i386/sse.md (cond_<insn><mode>): New expander.
16063 (cond_mul<mode>): Ditto.
16065 2021-08-03 Kewen Lin <linkw@linux.ibm.com>
16067 * tree-cfg.c (move_sese_region_to_fn): Fix typos on dloop.
16069 2021-08-03 liuhongt <hongtao.liu@intel.com>
16071 * config/i386/sse.md (cond_<insn><mode>):New expander.
16072 (cond_mul<mode>): Ditto.
16073 (cond_div<mode>): Ditto.
16075 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
16077 * config/i386/i386.c (ix86_finalize_stack_frame_flags): Also
16078 check stack_realign_needed for stack realignment.
16079 (ix86_legitimate_constant_p): Always allow CONST_WIDE_INT smaller
16080 than the largest integer supported by vector register.
16081 * config/i386/i386.h (MAX_MOVE_MAX): New. Set to 64.
16082 (MOVE_MAX): Set to bytes of the largest integer supported by
16084 (STORE_MAX_PIECES): New.
16086 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
16088 * config/i386/i386-expand.c (ix86_expand_vector_move): Call
16089 ix86_gen_scratch_sse_rtx to get a scratch SSE register to copy
16090 data from one memory location to another.
16092 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
16094 PR middle-end/90773
16095 * config/i386/i386.c (TARGET_GEN_MEMSET_SCRATCH_RTX): New.
16097 2021-08-02 Aldy Hernandez <aldyh@redhat.com>
16099 PR tree-optimization/101724
16100 * params.opt: Remove --param=threader-iterative.
16101 * tree-ssa-threadbackward.c (pass_thread_jumps::execute): Remove
16104 2021-08-02 Tom de Vries <tdevries@suse.de>
16106 PR middle-end/101665
16107 * doc/extend.texi (nonnull attribute): Improve documentation.
16109 2021-08-02 Andrew Pinski <apinski@marvell.com>
16111 PR rtl-optimization/101683
16112 * rtlanal.c (may_trap_p_1): Handle UNSIGNED_FIX.
16114 2021-08-02 Roger Sayle <roger@nextmovesoftware.com>
16116 * tree-ssa-phiopt.c (cond_removal_in_builtin_zero_pattern):
16117 Renamed from cond_removal_in_popcount_clz_ctz_pattern.
16118 Add support for BSWAP, FFS, PARITY and CLRSB builtins.
16119 (tree_ssa_phiop_worker): Update call to function above.
16121 2021-08-01 H.J. Lu <hjl.tools@gmail.com>
16124 * config/i386/i386.md (bsr_rex64_1_zext): New.
16125 (combine splitter for constant - clzll): Replace gen_bsr_rex64_1
16126 with gen_bsr_rex64_1_zext.
16128 2021-07-31 Jakub Jelinek <jakub@redhat.com>
16131 * config/i386/i386.md (bsr_rex64_1, bsr_1, bsr_zext_1): New
16132 define_insn patterns.
16133 (*bsr_rex64_2, *bsr_2): New define_insn_and_split patterns.
16134 Add combine splitters for constant - clz.
16135 (clz<mode>2): Use a temporary pseudo for bsr result.
16137 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
16139 * config/rs6000/smmintrin.h (_mm_floor_pd, _mm_floor_ps,
16140 _mm_floor_sd, _mm_floor_ss): New.
16142 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
16144 * config/rs6000/smmintrin.h (_mm_ceil_pd, _mm_ceil_ps,
16145 _mm_ceil_sd, _mm_ceil_ss): New.
16147 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
16149 * config/rs6000/smmintrin.h (_mm_blend_pd, _mm_blendv_pd,
16150 _mm_blend_ps, _mm_blendv_ps): New.
16152 2021-07-30 Roger Sayle <roger@nextmovesoftware.com>
16153 Uroš Bizjak <ubizjak@gmail.com>
16155 * config/i386/i386.md (*dec_cmov<mode>): New define_insn_and_split
16156 to generate a conditional move using the carry flag after sub $1.
16157 (peephole2): Eliminate a register-to-register move by inverting
16158 the condition of a conditional move.
16160 2021-07-30 Hans-Peter Nilsson <hp@bitrange.com>
16162 * config/mmix/mmix.md ("call", "call_value", "*call_real")
16163 ("*call_value_real"): Don't generate rtx mentioning the generic
16164 operands 1 and 2 to "call", and similarly for "call_value".
16165 * config/mmix/mmix.c (mmix_print_operand_punct_valid_p)
16166 (mmix_print_operand): Use '!' instead of 'p'.
16168 2021-07-30 Hans-Peter Nilsson <hp@bitrange.com>
16170 * doc/md.texi (call): Correct information about operand 2.
16171 * config/mmix/mmix.md ("call", "call_value"): Remove fixed FIXMEs.
16173 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
16175 * range-op.cc (operator_trunc_mod::wi_fold): Fold constants.
16177 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
16179 * range-op.cc (operator_div::wi_fold): Return UNDEFINED for [0, 0] divisor.
16181 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
16183 * gimple-range-cache.cc (*::set_bb_range): Change const basic_block to
16184 const_basic_block..
16185 (*::get_bb_range): Ditto.
16186 (*::bb_range_p): Ditto.
16187 * gimple-range-cache.h: Change prototypes.
16189 2021-07-30 H.J. Lu <hjl.tools@gmail.com>
16191 PR middle-end/90773
16192 * builtins.c (builtin_memcpy_read_str): Change the mode argument
16193 from scalar_int_mode to fixed_size_mode.
16194 (builtin_strncpy_read_str): Likewise.
16195 (gen_memset_value_from_prev): New function.
16196 (builtin_memset_read_str): Change the mode argument from
16197 scalar_int_mode to fixed_size_mode. Use gen_memset_value_from_prev
16198 and support CONST_VECTOR.
16199 (builtin_memset_gen_str): Likewise.
16200 (try_store_by_multiple_pieces): Use by_pieces_constfn to declare
16202 * builtins.h (builtin_strncpy_read_str): Replace scalar_int_mode
16203 with fixed_size_mode.
16204 (builtin_memset_read_str): Likewise.
16205 * expr.c (widest_int_mode_for_size): Renamed to ...
16206 (widest_fixed_size_mode_for_size): Add a bool argument to
16207 indicate if QI vector mode can be used.
16208 (by_pieces_ninsns): Call widest_fixed_size_mode_for_size
16209 instead of widest_int_mode_for_size.
16210 (pieces_addr::adjust): Change the mode argument from
16211 scalar_int_mode to fixed_size_mode.
16212 (op_by_pieces_d): Make m_len read-only. Add a bool member,
16213 m_qi_vector_mode, to indicate that QI vector mode can be used.
16214 (op_by_pieces_d::op_by_pieces_d): Add a bool argument to
16215 initialize m_qi_vector_mode. Call widest_fixed_size_mode_for_size
16216 instead of widest_int_mode_for_size.
16217 (op_by_pieces_d::get_usable_mode): Change the mode argument from
16218 scalar_int_mode to fixed_size_mode. Call
16219 widest_fixed_size_mode_for_size instead of
16220 widest_int_mode_for_size.
16221 (op_by_pieces_d::smallest_fixed_size_mode_for_size): New member
16222 function to return the smallest integer or QI vector mode.
16223 (op_by_pieces_d::run): Call widest_fixed_size_mode_for_size
16224 instead of widest_int_mode_for_size. Call
16225 smallest_fixed_size_mode_for_size instead of
16226 smallest_int_mode_for_size.
16227 (store_by_pieces_d::store_by_pieces_d): Add a bool argument to
16228 indicate that QI vector mode can be used and pass it to
16229 op_by_pieces_d::op_by_pieces_d.
16230 (can_store_by_pieces): Call widest_fixed_size_mode_for_size
16231 instead of widest_int_mode_for_size. Pass memsetp to
16232 widest_fixed_size_mode_for_size to support QI vector mode.
16233 Allow all CONST_VECTORs for memset if vec_duplicate is supported.
16234 (store_by_pieces): Pass memsetp to
16235 store_by_pieces_d::store_by_pieces_d.
16236 (clear_by_pieces_1): Removed.
16237 (clear_by_pieces): Replace clear_by_pieces_1 with
16238 builtin_memset_read_str and pass true to store_by_pieces_d to
16239 support vector mode broadcast.
16240 (string_cst_read_str): Change the mode argument from
16241 scalar_int_mode to fixed_size_mode.
16242 * expr.h (by_pieces_constfn): Change scalar_int_mode to
16244 (by_pieces_prev): Likewise.
16245 * rtl.h (lowpart_subreg_regno): New.
16246 * rtlanal.c (lowpart_subreg_regno): New. A wrapper around
16247 simplify_subreg_regno.
16248 * target.def (gen_memset_scratch_rtx): New hook.
16249 * doc/tm.texi.in: Add TARGET_GEN_MEMSET_SCRATCH_RTX.
16250 * doc/tm.texi: Regenerated.
16252 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
16255 * config/mips/mips.c (mips_atomic_assign_expand_fenv): Use
16256 TARGET_EXPR instead of MODIFY_EXPR.
16258 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
16261 * config/mips/mips-protos.h (mips_expand_vec_cmp_expr): Declare.
16262 * config/mips/mips.c (mips_expand_vec_cmp_expr): New function.
16263 * config/mips/mips-msa.md (vec_cmp<MSA:mode><mode_i>): New
16265 (vec_cmpu<IMSA:mode><mode_i>): New expander.
16267 2021-07-30 H.J. Lu <hjl.tools@gmail.com>
16270 * config/i386/i386-options.c (ix86_option_override_internal):
16271 Don't enable LZCNT/POPCNT if they have been disabled explicitly.
16273 2021-07-30 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
16276 * config/arm/arm_neon.h (vld1_p64): Replace call to builtin by
16277 explicitly dereferencing __a.
16278 (vld1_s64): Likewise.
16279 (vld1_u64): Likewise.
16280 * config/arm/arm_neon_builtins.def (vld1): Remove entry for di
16281 and change to VAR13.
16283 2021-07-30 Aldy Hernandez <aldyh@redhat.com>
16285 * gimple-loop-versioning.cc (lv_dom_walker::lv_dom_walker): Remove
16286 use of m_range_analyzer.
16287 (loop_versioning::lv_dom_walker::before_dom_children): Same.
16288 (loop_versioning::lv_dom_walker::after_dom_children): Remove.
16289 (loop_versioning::prune_loop_conditions): Replace vr_values use
16290 with range_query interface.
16291 (pass_loop_versioning::execute): Use ranger.
16293 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
16296 * ipa-devirt.c (ipa_odr_read_section): Compare the precision of
16297 enum values, and emit a warning if they mismatch.
16299 2021-07-30 Kewen Lin <linkw@linux.ibm.com>
16301 * cfgloop.h (as_const): New function.
16302 (class loop_iterator): Rename to ...
16303 (class loops_list): ... this.
16304 (loop_iterator::next): Rename to ...
16305 (loops_list::Iter::fill_curr_loop): ... this and adjust.
16306 (loop_iterator::loop_iterator): Rename to ...
16307 (loops_list::loops_list): ... this and adjust.
16308 (loops_list::Iter): New class.
16309 (loops_list::iterator): New type.
16310 (loops_list::const_iterator): New type.
16311 (loops_list::begin): New function.
16312 (loops_list::end): Likewise.
16313 (loops_list::begin const): Likewise.
16314 (loops_list::end const): Likewise.
16315 (FOR_EACH_LOOP): Remove.
16316 (FOR_EACH_LOOP_FN): Remove.
16317 * cfgloop.c (flow_loops_dump): Adjust FOR_EACH_LOOP* with range-based
16318 for loop with loops_list instance.
16319 (sort_sibling_loops): Likewise.
16320 (disambiguate_loops_with_multiple_latches): Likewise.
16321 (verify_loop_structure): Likewise.
16322 * cfgloopmanip.c (create_preheaders): Likewise.
16323 (force_single_succ_latches): Likewise.
16324 * config/aarch64/falkor-tag-collision-avoidance.c
16325 (execute_tag_collision_avoidance): Likewise.
16326 * config/mn10300/mn10300.c (mn10300_scan_for_setlb_lcc): Likewise.
16327 * config/s390/s390.c (s390_adjust_loops): Likewise.
16328 * doc/loop.texi: Likewise.
16329 * gimple-loop-interchange.cc (pass_linterchange::execute): Likewise.
16330 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Likewise.
16331 * gimple-loop-versioning.cc (loop_versioning::analyze_blocks): Likewise.
16332 (loop_versioning::make_versioning_decisions): Likewise.
16333 * gimple-ssa-split-paths.c (split_paths): Likewise.
16334 * graphite-isl-ast-to-gimple.c (graphite_regenerate_ast_isl): Likewise.
16335 * graphite.c (canonicalize_loop_form): Likewise.
16336 (graphite_transform_loops): Likewise.
16337 * ipa-fnsummary.c (analyze_function_body): Likewise.
16338 * ipa-pure-const.c (analyze_function): Likewise.
16339 * loop-doloop.c (doloop_optimize_loops): Likewise.
16340 * loop-init.c (loop_optimizer_finalize): Likewise.
16341 (fix_loop_structure): Likewise.
16342 * loop-invariant.c (calculate_loop_reg_pressure): Likewise.
16343 (move_loop_invariants): Likewise.
16344 * loop-unroll.c (decide_unrolling): Likewise.
16345 (unroll_loops): Likewise.
16346 * modulo-sched.c (sms_schedule): Likewise.
16347 * predict.c (predict_loops): Likewise.
16348 (pass_profile::execute): Likewise.
16349 * profile.c (branch_prob): Likewise.
16350 * sel-sched-ir.c (sel_finish_pipelining): Likewise.
16351 (sel_find_rgns): Likewise.
16352 * tree-cfg.c (replace_loop_annotate): Likewise.
16353 (replace_uses_by): Likewise.
16354 (move_sese_region_to_fn): Likewise.
16355 * tree-if-conv.c (pass_if_conversion::execute): Likewise.
16356 * tree-loop-distribution.c (loop_distribution::execute): Likewise.
16357 * tree-parloops.c (parallelize_loops): Likewise.
16358 * tree-predcom.c (tree_predictive_commoning): Likewise.
16359 * tree-scalar-evolution.c (scev_initialize): Likewise.
16360 (scev_reset): Likewise.
16361 * tree-ssa-dce.c (find_obviously_necessary_stmts): Likewise.
16362 * tree-ssa-live.c (remove_unused_locals): Likewise.
16363 * tree-ssa-loop-ch.c (ch_base::copy_headers): Likewise.
16364 * tree-ssa-loop-im.c (analyze_memory_references): Likewise.
16365 (tree_ssa_lim_initialize): Likewise.
16366 * tree-ssa-loop-ivcanon.c (canonicalize_induction_variables): Likewise.
16367 * tree-ssa-loop-ivopts.c (tree_ssa_iv_optimize): Likewise.
16368 * tree-ssa-loop-manip.c (get_loops_exits): Likewise.
16369 * tree-ssa-loop-niter.c (estimate_numbers_of_iterations): Likewise.
16370 (free_numbers_of_iterations_estimates): Likewise.
16371 * tree-ssa-loop-prefetch.c (tree_ssa_prefetch_arrays): Likewise.
16372 * tree-ssa-loop-split.c (tree_ssa_split_loops): Likewise.
16373 * tree-ssa-loop-unswitch.c (tree_ssa_unswitch_loops): Likewise.
16374 * tree-ssa-loop.c (gate_oacc_kernels): Likewise.
16375 (pass_scev_cprop::execute): Likewise.
16376 * tree-ssa-propagate.c (clean_up_loop_closed_phi): Likewise.
16377 * tree-ssa-sccvn.c (do_rpo_vn): Likewise.
16378 * tree-ssa-threadupdate.c
16379 (jump_thread_path_registry::thread_through_all_blocks): Likewise.
16380 * tree-vectorizer.c (vectorize_loops): Likewise.
16381 * tree-vrp.c (vrp_asserts::find_assert_locations): Likewise.
16383 2021-07-29 Hans-Peter Nilsson <hp@bitrange.com>
16385 * config/mmix/mmix.c (mmix_function_arg_1): Avoid
16386 generating a VOIDmode register for e.g the
16387 function_arg_info::end_marker.
16389 2021-07-29 Jeff Law <jeffreyalaw@gmail.com>
16391 * config/h8300/h8300-modes.def: Add CCZ, CCV and CCC, drop CCZNV.
16392 * config/h8300/h8300.md (H8cc mode iterator): Add CCZ.
16393 (cc mode_attr): Similarly.
16394 (ccz subst_attr): Similarly.
16395 * config/h8300/jumpcall.md: Add new patterns for branch-on-bit.
16396 * config/h8300/testcompare.md: Remove various cc0 based patterns
16397 that had been commented out. Add pattern to set CCZ from a bit
16400 2021-07-29 Thomas Schwinge <thomas@codesourcery.com>
16401 Julian Brown <julian@codesourcery.com>
16402 Kwok Cheung Yeung <kcy@codesourcery.com>
16404 * omp-offload.c (oacc_loop_xform_head_tail, oacc_loop_process):
16405 'update_stmt' after modification.
16406 (pass_oacc_loop_designation): New function, extracted out of...
16407 (pass_oacc_device_lower): ... this.
16408 (pass_data_oacc_loop_designation, pass_oacc_loop_designation)
16409 (make_pass_oacc_loop_designation): New
16410 * passes.def: Add it.
16411 * tree-parloops.c (create_parallel_loop): Adjust.
16412 * tree-pass.h (make_pass_oacc_loop_designation): New.
16414 2021-07-29 Aldy Hernandez <aldyh@redhat.com>
16416 * flag-types.h (enum threader_mode): New.
16417 * params.opt: Add entry for --param=threader-mode.
16418 * tree-ssa-threadbackward.c (THREADER_ITERATIVE_MODE): New.
16419 (class back_threader): New.
16420 (back_threader::back_threader): New.
16421 (back_threader::~back_threader): New.
16422 (back_threader::maybe_register_path): New.
16423 (back_threader::find_taken_edge): New.
16424 (back_threader::find_taken_edge_switch): New.
16425 (back_threader::find_taken_edge_cond): New.
16426 (back_threader::resolve_def): New.
16427 (back_threader::resolve_phi): New.
16428 (back_threader::find_paths_to_names): New.
16429 (back_threader::find_paths): New.
16432 (thread_jumps::find_jump_threads_backwards): Call ranger threader.
16433 (thread_jumps::find_jump_threads_backwards_with_ranger): New.
16434 (pass_thread_jumps::execute): Abstract out code...
16435 (try_thread_blocks): ...here.
16436 * tree-ssa-threadedge.c (jump_threader::thread_outgoing_edges):
16437 Abstract out threading candidate code to...
16438 (single_succ_to_potentially_threadable_block): ...here.
16439 * tree-ssa-threadedge.h (single_succ_to_potentially_threadable_block):
16441 * tree-ssa-threadupdate.c (register_jump_thread): Return boolean.
16442 * tree-ssa-threadupdate.h (class jump_thread_path_registry):
16443 Return bool from register_jump_thread.
16445 2021-07-29 Andreas Krebbel <krebbel@linux.ibm.com>
16447 * target.def: in0 and in1 do not need to be registers.
16448 * doc/tm.texi: Regenerate.
16450 2021-07-29 liuhongt <hongtao.liu@intel.com>
16453 * config/i386/i386.c (ix86_widen_mult_cost): New function.
16454 (ix86_add_stmt_cost): Use ix86_widen_mult_cost for
16457 2021-07-29 Jiufu Guo <guojiufu@linux.ibm.com>
16460 * config/rs6000/rs6000.c (TARGET_PREFERRED_DOLOOP_MODE): New hook.
16461 (rs6000_preferred_doloop_mode): New hook.
16462 * doc/tm.texi: Regenerate.
16463 * doc/tm.texi.in: Add hook preferred_doloop_mode.
16464 * target.def (preferred_doloop_mode): New hook.
16465 * targhooks.c (default_preferred_doloop_mode): New hook.
16466 * targhooks.h (default_preferred_doloop_mode): New hook.
16467 * tree-ssa-loop-ivopts.c (compute_doloop_base_on_mode): New function.
16468 (add_iv_candidate_for_doloop): Call targetm.preferred_doloop_mode
16469 and compute_doloop_base_on_mode.
16471 2021-07-28 Martin Sebor <msebor@redhat.com>
16473 PR middle-end/101494
16474 * tree-ssa-uninit.c (maybe_warn_operand): Correct object offset
16475 and size computation.
16477 2021-07-28 Martin Sebor <msebor@redhat.com>
16479 PR middle-end/101601
16480 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Remove
16482 Handle pointers to functions.
16484 2021-07-28 Martin Sebor <msebor@redhat.com>
16486 * Makefile.in (OBJS): Add gimple-ssa-warn-access.o and pointer-query.o.
16487 * attribs.h (fndecl_dealloc_argno): Move fndecl_dealloc_argno to tree.h.
16488 * builtins.c (compute_objsize_r): Move to pointer-query.cc.
16489 (access_ref::access_ref): Same.
16490 (access_ref::phi): Same.
16491 (access_ref::get_ref): Same.
16492 (access_ref::size_remaining): Same.
16493 (access_ref::offset_in_range): Same.
16494 (access_ref::add_offset): Same.
16495 (access_ref::inform_access): Same.
16496 (ssa_name_limit_t::visit_phi): Same.
16497 (ssa_name_limit_t::leave_phi): Same.
16498 (ssa_name_limit_t::next): Same.
16499 (ssa_name_limit_t::next_phi): Same.
16500 (ssa_name_limit_t::~ssa_name_limit_t): Same.
16501 (pointer_query::pointer_query): Same.
16502 (pointer_query::get_ref): Same.
16503 (pointer_query::put_ref): Same.
16504 (pointer_query::flush_cache): Same.
16505 (warn_string_no_nul): Move to gimple-ssa-warn-access.cc.
16506 (check_nul_terminated_array): Same.
16507 (unterminated_array): Same.
16508 (maybe_warn_for_bound): Same.
16509 (check_read_access): Same.
16510 (warn_for_access): Same.
16511 (get_size_range): Same.
16512 (check_access): Same.
16513 (gimple_call_alloc_size): Move to tree.c.
16514 (gimple_parm_array_size): Move to pointer-query.cc.
16515 (get_offset_range): Same.
16516 (gimple_call_return_array): Same.
16517 (handle_min_max_size): Same.
16518 (handle_array_ref): Same.
16519 (handle_mem_ref): Same.
16520 (compute_objsize): Same.
16521 (gimple_call_alloc_p): Move to gimple-ssa-warn-access.cc.
16522 (call_dealloc_argno): Same.
16523 (fndecl_dealloc_argno): Same.
16524 (new_delete_mismatch_p): Same.
16525 (matching_alloc_calls_p): Same.
16526 (warn_dealloc_offset): Same.
16527 (maybe_emit_free_warning): Same.
16528 * builtins.h (check_nul_terminated_array): Move to
16529 gimple-ssa-warn-access.h.
16530 (check_nul_terminated_array): Same.
16531 (warn_string_no_nul): Same.
16532 (unterminated_array): Same.
16533 (class ssa_name_limit_t): Same.
16534 (class pointer_query): Same.
16535 (struct access_ref): Same.
16536 (class range_query): Same.
16537 (struct access_data): Same.
16538 (gimple_call_alloc_size): Same.
16539 (gimple_parm_array_size): Same.
16540 (compute_objsize): Same.
16541 (class access_data): Same.
16542 (maybe_emit_free_warning): Same.
16543 * calls.c (initialize_argument_information): Remove call to
16544 maybe_emit_free_warning.
16545 * gimple-array-bounds.cc: Include new header..
16546 * gimple-fold.c: Same.
16547 * gimple-ssa-sprintf.c: Same.
16548 * gimple-ssa-warn-restrict.c: Same.
16549 * passes.def: Add pass_warn_access.
16550 * tree-pass.h (make_pass_warn_access): Declare.
16551 * tree-ssa-strlen.c: Include new headers.
16552 * tree.c (fndecl_dealloc_argno): Move here from builtins.c.
16553 * tree.h (fndecl_dealloc_argno): Move here from attribs.h.
16554 * gimple-ssa-warn-access.cc: New file.
16555 * gimple-ssa-warn-access.h: New file.
16556 * pointer-query.cc: New file.
16557 * pointer-query.h: New file.
16559 2021-07-28 Jakub Jelinek <jakub@redhat.com>
16561 PR middle-end/101624
16562 * ubsan.c (maybe_instrument_pointer_overflow,
16563 instrument_object_size): Only test DECL_REGISTER on VAR_DECLs,
16564 PARM_DECLs or RESULT_DECLs.
16565 * sanopt.c (maybe_optimize_ubsan_ptr_ifn): Likewise.
16567 2021-07-28 Jakub Jelinek <jakub@redhat.com>
16569 PR middle-end/101642
16570 * match.pd (bswap16 (x) == bswap16 (y)): Cast both operands
16571 to type of bswap16 for comparison.
16572 (bswap16 (x) == cst): Cast bswap16 operand to type of cst.
16574 2021-07-28 Richard Biener <rguenther@suse.de>
16576 PR tree-optimization/101615
16577 * tree-vect-slp.c (vect_optimize_slp): Materialize permutes
16578 at CTOR SLP graph entries.
16580 2021-07-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
16582 * config/aarch64/aarch64.md (*extend<SHORT:mode><GPI:mode>2_aarch64):
16583 Add "r,w" alternative.
16585 2021-07-28 H.J. Lu <hjl.tools@gmail.com>
16588 * config/i386/i386.c (ix86_avx_u128_mode_needed): Don't set
16589 AVX_U128_DIRTY when all bits are zero.
16591 2021-07-28 Richard Biener <rguenther@suse.de>
16593 PR tree-optimization/101615
16594 * tree-vect-slp.c (vect_optimize_slp): Pre-existing vector
16595 external nodes cannot be permuted so make them perm_out 0.
16597 2021-07-28 Andrew Stubbs <ams@codesourcery.com>
16600 * config.in: Regenerate.
16601 * config/gcn/gcn-hsa.h (A_FIJI): New define.
16602 (A_900): New define.
16603 (A_906): New define.
16604 (A_908): New define.
16605 (ASM_SPEC): Use A_FIJI, A_900, A_906 and A_908.
16606 * config/gcn/gcn.c (output_file_start): Adjust attributes according
16607 to the assembler capabilities.
16608 * config/gcn/mkoffload.c (main): Likewise.
16609 * configure: Regenerate.
16610 * configure.ac: Add tests for LLVM assembler attribute features.
16612 2021-07-28 Andrew MacLeod <amacleod@redhat.com>
16614 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Check for
16615 cond_false and cond_true on branches.
16617 2021-07-28 Bin Cheng <bin.cheng@linux.alibaba.com>
16619 * config/aarch64/aarch64.c (aarch64_gen_adjusted_ldpstp): use
16622 2021-07-28 Bin Cheng <bin.cheng@linux.alibaba.com>
16624 * alias.c (init_alias_analysis): Don't skip prologue/epilogue.
16626 2021-07-28 Jakub Jelinek <jakub@redhat.com>
16629 * config/i386/sse.md (vashr<mode>3): Split into vashrv8di3 expander
16630 and vashrv4di3 expander, where the latter requires just TARGET_AVX2
16631 and has special !TARGET_AVX512VL expansion.
16632 (vashrv2di3<mask_name>): Rename to ...
16633 (vashrv2di3): ... this. Change condition to TARGET_XOP || TARGET_AVX2
16634 and add special !TARGET_XOP && !TARGET_AVX512VL expansion.
16636 2021-07-28 Martin Uecker <muecker@gwdg.de>
16638 * calls.c (maybe_warn_rdwr_sizes): Correct argument
16639 numbers in warning that were switched.
16641 2021-07-28 Kewen Lin <linkw@linux.ibm.com>
16643 PR tree-optimization/101596
16644 * tree-vect-patterns.c (vect_recog_mulhs_pattern): Fix wrong check
16645 by using new_type's precision instead.
16647 2021-07-28 liuhongt <hongtao.liu@intel.com>
16650 * config/i386/i386.h (processor_costs): Add new member
16652 * config/i386/x86-tune-costs.h (ix86_size_cost, i386_cost,
16653 i486_cost, pentium_cost, lakemont_cost, pentiumpro_cost,
16654 geode_cost, k6_cost, athlon_cost, k8_cost, amdfam10_cost,
16655 bdver_cost, znver1_cost, znver2_cost, znver3_cost,
16656 btver1_cost, btver2_cost, btver3_cost, pentium4_cost,
16657 nocona_cost, atom_cost, atom_cost, slm_cost, intel_cost,
16658 generic_cost, core_cost): Initialize integer_to_sse same value
16660 (skylake_cost): Initialize integer_to_sse twice as much as sse_op.
16661 * config/i386/i386.c (ix86_builtin_vectorization_cost):
16662 Use integer_to_sse instead of sse_op to calculate the cost of
16665 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
16667 * config/rs6000/rs6000-gen-builtins.c (write_ovld_static_init): New
16669 (write_init_file): Call write_ovld_static_init.
16671 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
16673 * config/rs6000/rs6000-gen-builtins.c (write_bif_static_init): New
16675 (write_init_file): Call write_bif_static_init.
16677 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
16679 * config/rs6000/rs6000-gen-builtins.c (typemap): New struct.
16680 (TYPE_MAP_SIZE): New macro.
16681 (type_map): New initialized variable.
16682 (typemap_cmp): New function.
16683 (write_type_node): Likewise.
16684 (write_fntype_init): Implement.
16686 2021-07-27 Martin Sebor <msebor@redhat.com>
16688 PR tree-optimization/101584
16689 * tree-ssa-uninit.c (builtin_call_nomodifying_p): New function.
16690 (check_defs): Call it.
16692 2021-07-27 Aldy Hernandez <aldyh@redhat.com>
16694 * tree-ssa-dom.c (dom_jump_threader_simplifier):
16695 Put avail_exprs_stack in the class, instead of passing it to
16696 jump_threader_simplifier.
16697 (dom_jump_threader_simplifier::simplify): Add state argument.
16698 (dom_opt_dom_walker): Add state.
16699 (pass_dominator::execute): Pass state to threader.
16700 (dom_opt_dom_walker::before_dom_children): Use state.
16701 * tree-ssa-threadedge.c (jump_threader::jump_threader): Replace
16702 arguments by state.
16703 (jump_threader::record_temporary_equivalences_from_phis):
16704 Register equivalences through the state variable.
16705 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
16706 Record ranges in a statement through the state variable.
16707 (jump_threader::simplify_control_stmt_condition): Pass state to
16709 (jump_threader::simplify_control_stmt_condition_1): Same.
16710 (jump_threader::thread_around_empty_blocks): Remove obsolete
16712 (jump_threader::thread_through_normal_block): Record equivalences
16713 on edge through the state variable.
16714 (jump_threader::thread_across_edge): Abstract state pushing.
16715 (jt_state::jt_state): New.
16716 (jt_state::push): New.
16717 (jt_state::pop): New.
16718 (jt_state::register_equiv): New.
16719 (jt_state::record_ranges_from_stmt): New.
16720 (jt_state::register_equivs_on_edge): New.
16721 (jump_threader_simplifier::jump_threader_simplifier): Move from
16723 (jump_threader_simplifier::simplify): Add state argument.
16724 * tree-ssa-threadedge.h (class jt_state): New.
16725 (class jump_threader): Add state to constructor.
16726 (class jump_threader_simplifier): Add state to simplify. Remove
16727 avail_exprs_stack from class.
16728 * tree-vrp.c (vrp_jump_threader_simplifier::simplify): Add state
16730 (vrp_jump_threader::vrp_jump_threader): Add state.
16731 (vrp_jump_threader::~vrp_jump_threader): Cleanup state.
16733 2021-07-27 Aldy Hernandez <aldyh@redhat.com>
16735 * Makefile.in (OBJS): Add gimple-range-path.o.
16736 * gimple-range-path.cc: New file.
16737 * gimple-range-path.h: New file.
16739 2021-07-27 Jonathan Wright <jonathan.wright@arm.com>
16741 * config/aarch64/aarch64-simd.md: Push sign/zero-extension
16742 inside vec_duplicate for all patterns.
16743 * simplify-rtx.c (simplify_context::simplify_unary_operation_1):
16744 Push sign/zero-extension inside vec_duplicate.
16746 2021-07-27 Richard Biener <rguenther@suse.de>
16748 PR tree-optimization/101573
16749 * tree-ssa-uninit.c (warn_uninit_phi_uses): New function
16750 looking at uninitialized PHI arg defs in some constrained cases.
16751 (warn_uninitialized_vars): Call it.
16752 (execute_early_warn_uninitialized): Calculate dominators.
16754 2021-07-27 Richard Biener <rguenther@suse.de>
16756 PR tree-optimization/39821
16757 * tree-vect-stmts.c (vect_model_promotion_demotion_cost): Use
16758 vector_stmt for widening arithmetic.
16759 (vectorizable_conversion): Adjust.
16761 2021-07-27 Martin Jambor <mjambor@suse.cz>
16763 * cgraph.h (ipa_replace_map): New field force_load_ref.
16764 * ipa-prop.h (ipa_param_descriptor): Reduce precision of move_cost,
16765 aded new flag load_dereferenced, adjusted comments.
16766 (ipa_get_param_dereferenced): New function.
16767 (ipa_set_param_dereferenced): Likewise.
16768 * cgraphclones.c (cgraph_node::create_virtual_clone): Follow it.
16769 * ipa-cp.c: Include gimple.h.
16770 (ipcp_discover_new_direct_edges): Take into account dereferenced flag.
16771 (get_replacement_map): New parameter force_load_ref, set the
16772 appropriate flag in ipa_replace_map if set.
16773 (struct symbol_and_index_together): New type.
16774 (adjust_refs_in_act_callers): New function.
16775 (adjust_references_in_caller): Likewise.
16776 (create_specialized_node): When appropriate, call
16777 adjust_references_in_caller and force only load references.
16778 * ipa-prop.c (load_from_dereferenced_name): New function.
16779 (ipa_analyze_controlled_uses): Also detect loads from a
16780 dereference, harden testing of call statements.
16781 (ipa_write_node_info): Stream the dereferenced flag.
16782 (ipa_read_node_info): Likewise.
16783 (ipa_set_jf_constant): Also create refdesc when jump function
16784 references a variable.
16785 (cgraph_node_for_jfunc): Rename to symtab_node_for_jfunc, work
16786 also on references of variables and return a symtab_node. Adjust
16788 (propagate_controlled_uses): Also remove references to VAR_DECLs.
16790 2021-07-27 Jakub Jelinek <jakub@redhat.com>
16792 PR middle-end/101586
16793 * gimple-fold.c (clear_padding_type): Ignore FIELD_DECLs with byte
16794 positions above or equal to sz except for diagnostics of flexible
16797 2021-07-26 Andrew MacLeod <amacleod@redhat.com>
16799 PR tree-optimization/78888
16800 * gimple-range-fold.cc (get_letter_range): New.
16801 (fold_using_range::range_of_builtin_call): Call get_letter_range.
16803 2021-07-26 Andrew MacLeod <amacleod@redhat.com>
16805 PR tree-optimization/78888
16806 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Add cases
16807 for CFN_BUILT_IN_TOUPPER and CFN_BUILT_IN_TOLOWER.
16809 2021-07-26 Roger Sayle <roger@nextmovesoftware.com>
16810 Marc Glisse <marc.glisse@inria.fr>
16812 * match.pd (rotate): Simplify equality/inequality of rotations.
16813 (bswap): Simplify equality/inequality tests of byte swapping.
16815 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
16817 * range-op.cc (operator_bitwise_xor::op1_op2_relation_effect):
16820 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
16822 * range-op.cc (operator_lshift::fold_range): Pass rel to
16823 base class fold_range.
16824 (operator_rshift::fold_range): Same.
16826 2021-07-26 Ashimida <ashimida@linux.alibaba.com>
16829 * toplev.h (min_align_loops_log): Remove declaration.
16830 (min_align_jumps_log, min_align_labels_log): Likewise.
16831 (min_align_functions_log): Likewise.
16833 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
16835 * tree-vrp.c (vrp_simplify_cond_using_ranges): Rename vr_values
16837 (execute_vrp): Abstract out simplification of conditionals...
16838 (simplify_casted_conds): ...here.
16840 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
16842 * gimple-array-bounds.cc (array_bounds_checker::get_value_range):
16843 Add gimple argument.
16844 (array_bounds_checker::check_array_ref): Same.
16845 (array_bounds_checker::check_addr_expr): Same.
16846 (array_bounds_checker::check_array_bounds): Pass statement to
16847 check_array_bounds and check_addr_expr.
16848 * gimple-array-bounds.h (check_array_bounds): Add gimple argument.
16849 (check_addr_expr): Same.
16850 (get_value_range): Same.
16852 2021-07-26 Tamar Christina <tamar.christina@arm.com>
16854 * config/aarch64/aarch64-simd-builtins.def (sdot, udot): Rename to..
16855 (sdot_prod, udot_prod): ... This.
16856 * config/aarch64/aarch64-simd.md (aarch64_<sur>dot<vsi2qi>): Merged
16858 (<sur>dot_prod<vsi2qi>): ... this.
16859 (aarch64_<sur>dot_lane<vsi2qi>, aarch64_<sur>dot_laneq<vsi2qi>):
16860 Change operands order.
16861 (<sur>sadv16qi): Use new operands order.
16862 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32,
16863 vdotq_s32): Use new RTL ordering.
16865 2021-07-26 Tamar Christina <tamar.christina@arm.com>
16867 * config/aarch64/aarch64-builtins.c (TYPES_TERNOP_SUSS,
16868 aarch64_types_ternop_suss_qualifiers): New.
16869 * config/aarch64/aarch64-simd-builtins.def (usdot_prod): Use it.
16870 * config/aarch64/aarch64-simd.md (usdot_prod<vsi2qi>): Re-organize RTL.
16871 * config/aarch64/arm_neon.h (vusdot_s32, vusdotq_s32): Use it.
16873 2021-07-23 Jakub Jelinek <jakub@redhat.com>
16875 PR rtl-optimization/101562
16876 * expmed.c (store_integral_bit_field): Only use movstrict_optab
16877 if the operand isn't paradoxical.
16879 2021-07-23 Aldy Hernandez <aldyh@redhat.com>
16881 * gimple-array-bounds.h (class array_bounds_checker): Change
16882 ranges type to range_query.
16884 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
16886 * config/aarch64/arm_neon.h (vst1_s64_x2): Use
16887 __builtin_memcpy instead of constructing
16888 __builtin_aarch64_simd_oi one vector at a time.
16889 (vst1_u64_x2): Likewise.
16890 (vst1_f64_x2): Likewise.
16891 (vst1_s8_x2): Likewise.
16892 (vst1_p8_x2): Likewise.
16893 (vst1_s16_x2): Likewise.
16894 (vst1_p16_x2): Likewise.
16895 (vst1_s32_x2): Likewise.
16896 (vst1_u8_x2): Likewise.
16897 (vst1_u16_x2): Likewise.
16898 (vst1_u32_x2): Likewise.
16899 (vst1_f16_x2): Likewise.
16900 (vst1_f32_x2): Likewise.
16901 (vst1_p64_x2): Likewise.
16902 (vst1q_s8_x2): Likewise.
16903 (vst1q_p8_x2): Likewise.
16904 (vst1q_s16_x2): Likewise.
16905 (vst1q_p16_x2): Likewise.
16906 (vst1q_s32_x2): Likewise.
16907 (vst1q_s64_x2): Likewise.
16908 (vst1q_u8_x2): Likewise.
16909 (vst1q_u16_x2): Likewise.
16910 (vst1q_u32_x2): Likewise.
16911 (vst1q_u64_x2): Likewise.
16912 (vst1q_f16_x2): Likewise.
16913 (vst1q_f32_x2): Likewise.
16914 (vst1q_f64_x2): Likewise.
16915 (vst1q_p64_x2): Likewise.
16917 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
16919 * config/aarch64/arm_neon.h (vst1_s64_x3): Use
16920 __builtin_memcpy instead of constructing
16921 __builtin_aarch64_simd_ci one vector at a time.
16922 (vst1_u64_x3): Likewise.
16923 (vst1_f64_x3): Likewise.
16924 (vst1_s8_x3): Likewise.
16925 (vst1_p8_x3): Likewise.
16926 (vst1_s16_x3): Likewise.
16927 (vst1_p16_x3): Likewise.
16928 (vst1_s32_x3): Likewise.
16929 (vst1_u8_x3): Likewise.
16930 (vst1_u16_x3): Likewise.
16931 (vst1_u32_x3): Likewise.
16932 (vst1_f16_x3): Likewise.
16933 (vst1_f32_x3): Likewise.
16934 (vst1_p64_x3): Likewise.
16935 (vst1q_s8_x3): Likewise.
16936 (vst1q_p8_x3): Likewise.
16937 (vst1q_s16_x3): Likewise.
16938 (vst1q_p16_x3): Likewise.
16939 (vst1q_s32_x3): Likewise.
16940 (vst1q_s64_x3): Likewise.
16941 (vst1q_u8_x3): Likewise.
16942 (vst1q_u16_x3): Likewise.
16943 (vst1q_u32_x3): Likewise.
16944 (vst1q_u64_x3): Likewise.
16945 (vst1q_f16_x3): Likewise.
16946 (vst1q_f32_x3): Likewise.
16947 (vst1q_f64_x3): Likewise.
16948 (vst1q_p64_x3): Likewise.
16950 2021-07-23 H.J. Lu <hjl.tools@gmail.com>
16953 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): Don't return
16954 hard register when LRA is in progress.
16956 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
16958 * config/aarch64/arm_neon.h (vst1_s8_x4): Use
16959 __builtin_memcpy instead of using a union.
16960 (vst1q_s8_x4): Likewise.
16961 (vst1_s16_x4): Likewise.
16962 (vst1q_s16_x4): Likewise.
16963 (vst1_s32_x4): Likewise.
16964 (vst1q_s32_x4): Likewise.
16965 (vst1_u8_x4): Likewise.
16966 (vst1q_u8_x4): Likewise.
16967 (vst1_u16_x4): Likewise.
16968 (vst1q_u16_x4): Likewise.
16969 (vst1_u32_x4): Likewise.
16970 (vst1q_u32_x4): Likewise.
16971 (vst1_f16_x4): Likewise.
16972 (vst1q_f16_x4): Likewise.
16973 (vst1_f32_x4): Likewise.
16974 (vst1q_f32_x4): Likewise.
16975 (vst1_p8_x4): Likewise.
16976 (vst1q_p8_x4): Likewise.
16977 (vst1_p16_x4): Likewise.
16978 (vst1q_p16_x4): Likewise.
16979 (vst1_s64_x4): Likewise.
16980 (vst1_u64_x4): Likewise.
16981 (vst1_p64_x4): Likewise.
16982 (vst1q_s64_x4): Likewise.
16983 (vst1q_u64_x4): Likewise.
16984 (vst1q_p64_x4): Likewise.
16985 (vst1_f64_x4): Likewise.
16986 (vst1q_f64_x4): Likewise.
16988 2021-07-23 Jonathan Wrightt <jonathan.wright@arm.com>
16990 * config/aarch64/arm_neon.h (vst2_s64): Use __builtin_memcpy
16991 instead of constructing __builtin_aarch64_simd_oi one vector
16993 (vst2_u64): Likewise.
16994 (vst2_f64): Likewise.
16995 (vst2_s8): Likewise.
16996 (vst2_p8): Likewise.
16997 (vst2_s16): Likewise.
16998 (vst2_p16): Likewise.
16999 (vst2_s32): Likewise.
17000 (vst2_u8): Likewise.
17001 (vst2_u16): Likewise.
17002 (vst2_u32): Likewise.
17003 (vst2_f16): Likewise.
17004 (vst2_f32): Likewise.
17005 (vst2_p64): Likewise.
17006 (vst2q_s8): Likewise.
17007 (vst2q_p8): Likewise.
17008 (vst2q_s16): Likewise.
17009 (vst2q_p16): Likewise.
17010 (vst2q_s32): Likewise.
17011 (vst2q_s64): Likewise.
17012 (vst2q_u8): Likewise.
17013 (vst2q_u16): Likewise.
17014 (vst2q_u32): Likewise.
17015 (vst2q_u64): Likewise.
17016 (vst2q_f16): Likewise.
17017 (vst2q_f32): Likewise.
17018 (vst2q_f64): Likewise.
17019 (vst2q_p64): Likewise.
17021 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17023 * config/aarch64/arm_neon.h (vst3_s64): Use __builtin_memcpy
17024 instead of constructing __builtin_aarch64_simd_ci one vector
17026 (vst3_u64): Likewise.
17027 (vst3_f64): Likewise.
17028 (vst3_s8): Likewise.
17029 (vst3_p8): Likewise.
17030 (vst3_s16): Likewise.
17031 (vst3_p16): Likewise.
17032 (vst3_s32): Likewise.
17033 (vst3_u8): Likewise.
17034 (vst3_u16): Likewise.
17035 (vst3_u32): Likewise.
17036 (vst3_f16): Likewise.
17037 (vst3_f32): Likewise.
17038 (vst3_p64): Likewise.
17039 (vst3q_s8): Likewise.
17040 (vst3q_p8): Likewise.
17041 (vst3q_s16): Likewise.
17042 (vst3q_p16): Likewise.
17043 (vst3q_s32): Likewise.
17044 (vst3q_s64): Likewise.
17045 (vst3q_u8): Likewise.
17046 (vst3q_u16): Likewise.
17047 (vst3q_u32): Likewise.
17048 (vst3q_u64): Likewise.
17049 (vst3q_f16): Likewise.
17050 (vst3q_f32): Likewise.
17051 (vst3q_f64): Likewise.
17052 (vst3q_p64): Likewise.
17054 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17056 * config/aarch64/arm_neon.h (vst4_s64): Use __builtin_memcpy
17057 instead of constructing __builtin_aarch64_simd_xi one vector
17059 (vst4_u64): Likewise.
17060 (vst4_f64): Likewise.
17061 (vst4_s8): Likewise.
17062 (vst4_p8): Likewise.
17063 (vst4_s16): Likewise.
17064 (vst4_p16): Likewise.
17065 (vst4_s32): Likewise.
17066 (vst4_u8): Likewise.
17067 (vst4_u16): Likewise.
17068 (vst4_u32): Likewise.
17069 (vst4_f16): Likewise.
17070 (vst4_f32): Likewise.
17071 (vst4_p64): Likewise.
17072 (vst4q_s8): Likewise.
17073 (vst4q_p8): Likewise.
17074 (vst4q_s16): Likewise.
17075 (vst4q_p16): Likewise.
17076 (vst4q_s32): Likewise.
17077 (vst4q_s64): Likewise.
17078 (vst4q_u8): Likewise.
17079 (vst4q_u16): Likewise.
17080 (vst4q_u32): Likewise.
17081 (vst4q_u64): Likewise.
17082 (vst4q_f16): Likewise.
17083 (vst4q_f32): Likewise.
17084 (vst4q_f64): Likewise.
17085 (vst4q_p64): Likewise.
17087 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17089 * config/aarch64/arm_neon.h (vtbx4_s8): Use __builtin_memcpy
17090 instead of constructing __builtin_aarch64_simd_oi one vector
17092 (vtbx4_u8): Likewise.
17093 (vtbx4_p8): Likewise.
17095 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17097 * config/aarch64/arm_neon.h (vtbl3_s8): Use __builtin_memcpy
17098 instead of constructing __builtin_aarch64_simd_oi one vector
17100 (vtbl3_u8): Likewise.
17101 (vtbl3_p8): Likewise.
17102 (vtbl4_s8): Likewise.
17103 (vtbl4_u8): Likewise.
17104 (vtbl4_p8): Likewise.
17106 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17108 * config/aarch64/arm_neon.h (vqtbx2_s8): Use __builtin_memcpy
17109 instead of constructing __builtin_aarch64_simd_oi one vector
17111 (vqtbx2_u8): Likewise.
17112 (vqtbx2_p8): Likewise.
17113 (vqtbx2q_s8): Likewise.
17114 (vqtbx2q_u8): Likewise.
17115 (vqtbx2q_p8): Likewise.
17116 (vqtbx3_s8): Use __builtin_memcpy instead of constructing
17117 __builtin_aarch64_simd_ci one vector at a time.
17118 (vqtbx3_u8): Likewise.
17119 (vqtbx3_p8): Likewise.
17120 (vqtbx3q_s8): Likewise.
17121 (vqtbx3q_u8): Likewise.
17122 (vqtbx3q_p8): Likewise.
17123 (vqtbx4_s8): Use __builtin_memcpy instead of constructing
17124 __builtin_aarch64_simd_xi one vector at a time.
17125 (vqtbx4_u8): Likewise.
17126 (vqtbx4_p8): Likewise.
17127 (vqtbx4q_s8): Likewise.
17128 (vqtbx4q_u8): Likewise.
17129 (vqtbx4q_p8): Likewise.
17131 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
17133 * config/aarch64/arm_neon.h (vqtbl2_s8): Use __builtin_memcpy
17134 instead of constructing __builtin_aarch64_simd_oi one vector
17136 (vqtbl2_u8): Likewise.
17137 (vqtbl2_p8): Likewise.
17138 (vqtbl2q_s8): Likewise.
17139 (vqtbl2q_u8): Likewise.
17140 (vqtbl2q_p8): Likewise.
17141 (vqtbl3_s8): Use __builtin_memcpy instead of constructing
17142 __builtin_aarch64_simd_ci one vector at a time.
17143 (vqtbl3_u8): Likewise.
17144 (vqtbl3_p8): Likewise.
17145 (vqtbl3q_s8): Likewise.
17146 (vqtbl3q_u8): Likewise.
17147 (vqtbl3q_p8): Likewise.
17148 (vqtbl4_s8): Use __builtin_memcpy instead of constructing
17149 __builtin_aarch64_simd_xi one vector at a time.
17150 (vqtbl4_u8): Likewise.
17151 (vqtbl4_p8): Likewise.
17152 (vqtbl4q_s8): Likewise.
17153 (vqtbl4q_u8): Likewise.
17154 (vqtbl4q_p8): Likewise.
17156 2021-07-23 Haochen Gui <guihaoc@gcc.gnu.org>
17159 * config/rs6000/rs6000.md (cstore<mode>4): Fix wrong fall through.
17161 2021-07-22 Andrew Pinski <apinski@marvell.com>
17163 PR tree-optimization/10153
17164 * tree-tailcall.c (create_tailcall_accumulator):
17165 Don't call fold_convert as the type should be correct already.
17166 (tree_optimize_tail_calls_1): Use build_{one,zero}_cst instead
17167 of integer_{one,zero}_node for the call of create_tailcall_accumulator.
17169 2021-07-22 Aldy Hernandez <aldyh@redhat.com>
17171 * gimple-range-cache.cc (non_null_ref::adjust_range): Replace
17172 varying_p check for null/non-null check.
17174 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
17176 PR tree-optimization/101511
17177 * value-relation.cc (relation_oracle::query_relation): Check if ssa1
17178 is in ssa2's equiv set, and don't trap if so.
17180 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
17182 PR tree-optimization/101497
17183 * gimple-range-fold.cc (fold_using_range::range_of_cond_expr): Check
17186 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
17188 PR tree-optimization/101496
17189 * vr-values.c (simplify_using_ranges::fold_cond): Call range_of_stmt
17190 first, then vrp_visit_cond_Stmt.
17192 2021-07-22 liuhongt <hongtao.liu@intel.com>
17194 * config/i386/i386-expand.c
17195 (ix86_broadcast_from_integer_constant): Rename to ..
17196 (ix86_broadcast_from_constant): .. this, and extend it to
17198 (ix86_expand_vector_move): Extend to float mode.
17199 * config/i386/i386-features.c
17200 (replace_constant_pool_with_broadcast): Remove.
17201 (remove_partial_avx_dependency_gate): Ditto.
17202 (constant_pool_broadcast): Ditto.
17203 (class pass_constant_pool_broadcast): Ditto.
17204 (make_pass_constant_pool_broadcast): Ditto.
17205 (remove_partial_avx_dependency): Adjust gate.
17206 * config/i386/i386-passes.def: Remove pass_constant_pool_broadcast.
17207 * config/i386/i386-protos.h
17208 (make_pass_constant_pool_broadcast): Remove.
17210 2021-07-22 liuhongt <hongtao.liu@intel.com>
17212 * config/i386/constraints.md (Wb): New constraint.
17214 * config/i386/i386.md (*ashlhi3_1): Extend to avx512 mask
17216 (*ashlqi3_1): Ditto.
17217 (*<insn><mode>3_1): Split to ..
17218 (*ashr<mode>3_1): this, ...
17219 (*lshr<mode>3_1): and this, also extend this pattern to avx512
17221 (*<insn><mode>3_1): Split to ..
17222 (*ashr<mode>3_1): this, ...
17223 (*lshrqi3_1): and this, also extend this pattern to avx512
17225 (*lshrhi3_1): And this, also extend this pattern to avx512
17227 * config/i386/sse.md (k<code><mode>): New define_split after
17228 it to convert generic shift pattern to mask shift ones.
17230 2021-07-21 Thomas Schwinge <thomas@codesourcery.com>
17231 Joseph Myers <joseph@codesourcery.com>
17232 Cesar Philippidis <cesar@codesourcery.com>
17234 * tree-core.h (omp_clause_code): Add 'OMP_CLAUSE_NOHOST'.
17235 * tree.c (omp_clause_num_ops, omp_clause_code_name, walk_tree_1):
17237 * tree-pretty-print.c (dump_omp_clause): Likewise.
17238 * omp-general.c (oacc_verify_routine_clauses): Likewise.
17239 * gimplify.c (gimplify_scan_omp_clauses)
17240 (gimplify_adjust_omp_clauses): Likewise.
17241 * tree-nested.c (convert_nonlocal_omp_clauses)
17242 (convert_local_omp_clauses): Likewise.
17243 * omp-low.c (scan_sharing_clauses): Likewise.
17244 * omp-offload.c (execute_oacc_device_lower): Update.
17246 2021-07-21 Martin Sebor <msebor@redhat.com>
17248 * tree-ssa-alias.c (walk_aliased_vdefs_1): Fix typos in a comment.
17250 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17252 * config/rs6000/rs6000-gen-builtins.c (write_init_bif_table):
17255 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17257 * config/rs6000/rs6000-gen-builtins.c (write_fntype): New
17259 (write_fntype_init): New stub function.
17260 (write_init_bif_table): Likewise.
17261 (write_init_ovld_table): New function.
17262 (write_init_file): Implement.
17264 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17266 * config/rs6000/rs6000-gen-builtins.c
17267 (write_autogenerated_header): New function.
17268 (write_decls): Likewise.
17269 (write_extern_fntype): New callback function.
17270 (write_header_file): Implement.
17272 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17274 * config/rs6000/rs6000-gen-builtins.c (write_defines_file):
17277 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17279 * config/rs6000/rs6000-gen-builtins.c (complete_vector_type): New
17281 (complete_base_type): Likewise.
17282 (construct_fntype_id): Likewise.
17283 (parse_bif_entry): Call contruct_fntype_id.
17284 (parse_ovld_entry): Likewise.
17286 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17288 * config/rs6000/rs6000-gen-builtins.c (ovld_stanza): New struct.
17289 (MAXOVLDSTANZAS): New macro.
17290 (ovld_stanzas): New variable.
17291 (curr_ovld_stanza): Likewise.
17292 (MAXOVLDS): New macro.
17293 (ovlddata): New struct.
17294 (ovlds): New variable.
17295 (curr_ovld): Likewise.
17296 (max_ovld_args): Likewise.
17297 (parse_ovld_entry): New function.
17298 (parse_ovld_stanza): Likewise.
17299 (parse_ovld): Implement.
17301 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17303 * config/rs6000/rs6000-gen-builtins.c (parse_bif_attrs):
17306 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17308 * config/rs6000/rs6000-gen-builtins.c (parse_args): New function.
17309 (parse_prototype): Implement.
17311 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17313 * config/rs6000/rs6000-gen-builtins.c (bif_stanza): New enum.
17314 (curr_bif_stanza): New variable.
17315 (stanza_entry): New struct.
17316 (stanza_map): New initialized variable.
17317 (enable_string): Likewise.
17318 (fnkinds): New enum.
17319 (typelist): New struct.
17320 (attrinfo): Likewise.
17321 (MAXRESTROPNDS): New macro.
17322 (prototype): New struct.
17323 (MAXBIFS): New macro.
17324 (bifdata): New struct.
17325 (bifs): New variable.
17326 (curr_bif): Likewise.
17327 (bif_order): Likewise.
17328 (bif_index): Likewise.
17329 (fatal): New function.
17330 (stanza_name_to_stanza): Likewise.
17331 (parse_bif_attrs): New stub function.
17332 (parse_prototype): Likewise.
17333 (parse_bif_entry): New function.
17334 (parse_bif_stanza): Likewise.
17335 (parse_bif): Implement.
17336 (set_bif_order): New function.
17337 (create_bif_order): Implement.
17339 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17341 * config/rs6000/rs6000-gen-builtins.c (rbtree.h): New #include.
17342 (num_bifs): New variable.
17343 (num_ovld_stanzas): Likewise.
17344 (num_ovlds): Likewise.
17345 (parse_codes): New enum.
17346 (bif_rbt): New variable.
17347 (ovld_rbt): Likewise.
17348 (fntype_rbt): Likewise.
17349 (bifo_rbt): Likewise.
17350 (parse_bif): New stub function.
17351 (create_bif_order): Likewise.
17352 (parse_ovld): Likewise.
17353 (write_header_file): Likewise.
17354 (write_init_file): Likewise.
17355 (write_defines_file): Likewise.
17356 (delete_output_files): New function.
17359 2021-07-21 H.J. Lu <hjl.tools@gmail.com>
17362 * config/i386/i386-builtin.def: Remove OPTION_MASK_ISA_SSE4_2
17363 from CRC32 _builtin functions.
17365 2021-07-21 Sebastian Huber <sebastian.huber@embedded-brains.de>
17367 * coverage.c (build_gcov_info_var_registration): Mark the object placed
17368 in the linker set as referenced so that it does not get optimized away.
17370 2021-07-21 Kito Cheng <kito.cheng@sifive.com>
17373 2021-07-20 Kito Cheng <kito.cheng@sifive.com>
17375 * config.gcc (riscv*-*-*): Detect which python is available.
17377 2021-07-21 Jakub Jelinek <jakub@redhat.com>
17379 PR middle-end/101535
17380 * gimplify.c (omp_check_private): Properly skip ORT_TARGET_DATA
17381 contexts in which decl isn't privatized and for ORT_TARGET return
17382 false if decl is mapped.
17384 2021-07-21 Richard Sandiford <richard.sandiford@arm.com>
17386 * gimple-loop-jam.c: Include tree-ssa-sccvn.h.
17387 (tree_loop_unroll_and_jam): Run value-numbering on a loop that
17388 has been successfully unrolled.
17390 2021-07-21 Richard Sandiford <richard.sandiford@arm.com>
17392 * tree-ssa-loop-manip.c (determine_exit_conditions): Return a null
17393 exit condition if no tail loop is needed, and if the original exit
17394 condition should therefore be kept as-is.
17395 (tree_transform_and_unroll_loop): Handle that case here too.
17397 2021-07-21 Kewen Lin <linkw@linux.ibm.com>
17399 * tree-data-ref.c (free_dependence_relations): Adjust to pass vec
17401 (free_data_refs): Likewise.
17402 * tree-data-ref.h (free_dependence_relations): Likewise.
17403 (free_data_refs): Likewise.
17404 * tree-predcom.c (struct chain): Use auto_vec instead of vec for
17406 (struct component): Likewise.
17407 (pcom_worker::pcom_worker): Adjust for auto_vec and renaming changes.
17408 (pcom_worker::~pcom_worker): Likewise.
17409 (pcom_worker::release_chain): Adjust as auto_vec changes.
17410 (pcom_worker::loop): Rename to ...
17411 (pcom_worker::m_loop): ... this.
17412 (pcom_worker::datarefs): Rename to ...
17413 (pcom_worker::m_datarefs): ... this. Use auto_vec instead of vec.
17414 (pcom_worker::dependences): Rename to ...
17415 (pcom_worker::m_dependences): ... this. Use auto_vec instead of vec.
17416 (pcom_worker::chains): Rename to ...
17417 (pcom_worker::m_chains): ... this. Use auto_vec instead of vec.
17418 (pcom_worker::looparound_phis): Rename to ...
17419 (pcom_worker::m_looparound_phis): ... this. Use auto_vec instead of
17421 (pcom_worker::cache): Rename to ...
17422 (pcom_worker::m_cache): ... this. Use auto_vec instead of vec.
17423 (pcom_worker::release_chain): Adjust for auto_vec changes.
17424 (pcom_worker::release_chains): Adjust for auto_vec and renaming
17426 (release_component): Remove.
17427 (release_components): Adjust for release_component removal.
17428 (component_of): Adjust to use vec.
17429 (merge_comps): Likewise.
17430 (pcom_worker::aff_combination_dr_offset): Adjust for renaming changes.
17431 (pcom_worker::determine_offset): Likewise.
17432 (class comp_ptrs): Remove.
17433 (pcom_worker::split_data_refs_to_components): Adjust for renaming
17434 changes, for comp_ptrs removal with auto_vec.
17435 (pcom_worker::suitable_component_p): Adjust for renaming changes.
17436 (pcom_worker::filter_suitable_components): Adjust for release_component
17438 (pcom_worker::valid_initializer_p): Adjust for renaming changes.
17439 (pcom_worker::find_looparound_phi): Likewise.
17440 (pcom_worker::add_looparound_copies): Likewise.
17441 (pcom_worker::determine_roots_comp): Likewise.
17442 (pcom_worker::single_nonlooparound_use): Likewise.
17443 (pcom_worker::execute_pred_commoning_chain): Likewise.
17444 (pcom_worker::execute_pred_commoning): Likewise.
17445 (pcom_worker::try_combine_chains): Likewise.
17446 (pcom_worker::prepare_initializers_chain): Likewise.
17447 (pcom_worker::prepare_initializers): Likewise.
17448 (pcom_worker::prepare_finalizers_chain): Likewise.
17449 (pcom_worker::prepare_finalizers): Likewise.
17450 (pcom_worker::tree_predictive_commoning_loop): Likewise.
17452 2021-07-20 Martin Sebor <msebor@redhat.com>
17454 PR middle-end/101397
17455 * builtins.c (gimple_call_return_array): Add argument. Correct
17456 offsets for memchr, mempcpy, stpcpy, and stpncpy.
17457 (compute_objsize_r): Adjust offset computation for argument returning
17460 2021-07-20 Martin Sebor <msebor@redhat.com>
17462 PR middle-end/101300
17463 * tree-ssa-uninit.c (check_defs): Handle UBSAN built-ins.
17465 2021-07-20 Jeff Law <jlaw@localhost.localdomain>
17467 * function.c (assign_parm_setup_block): Use adjust_address instead
17468 of change_address to preserve MEM_EXPR and friends.
17470 2021-07-20 Martin Sebor <msebor@redhat.com>
17472 * cfgloop.h (single_likely_exit): Adjust by-value argument to
17473 by-const-reference.
17474 * cfgloopanal.c (single_likely_exit): Same.
17475 * cgraph.h (struct cgraph_node): Same.
17476 * cgraphclones.c (cgraph_node::create_virtual_clone): Same.
17477 * genautomata.c (merge_states): Same.
17478 * genextract.c (VEC_char_to_string): Same.
17479 * genmatch.c (dt_node::gen_kids_1): Same.
17480 (walk_captures): Adjust by-value argument to by-reference.
17481 * gimple-ssa-store-merging.c (check_no_overlap): Adjust by-value argument
17482 to by-const-reference.
17483 * gimple.c (gimple_build_call_vec): Same.
17484 (gimple_build_call_internal_vec): Same.
17485 (gimple_build_switch): Same.
17486 (sort_case_labels): Same.
17487 (preprocess_case_label_vec_for_gimple): Adjust by-value argument to
17489 * gimple.h (gimple_build_call_vec): Adjust by-value argument to
17490 by-const-reference.
17491 (gimple_build_call_internal_vec): Same.
17492 (gimple_build_switch): Same.
17493 (sort_case_labels): Same.
17494 (preprocess_case_label_vec_for_gimple): Adjust by-value argument to
17496 * haifa-sched.c (calc_priorities): Adjust by-value argument to
17497 by-const-reference.
17498 (sched_init_luids): Same.
17499 (haifa_init_h_i_d): Same.
17500 * ipa-cp.c (ipa_get_indirect_edge_target_1): Same.
17501 (adjust_callers_for_value_intersection): Adjust by-value argument to
17503 (find_more_scalar_values_for_callers_subset): Adjust by-value argument to
17504 by-const-reference.
17505 (find_more_contexts_for_caller_subset): Same.
17506 (find_aggregate_values_for_callers_subset): Same.
17507 (copy_useful_known_contexts): Same.
17508 * ipa-fnsummary.c (remap_edge_summaries): Same.
17509 (remap_freqcounting_predicate): Same.
17510 * ipa-inline.c (add_new_edges_to_heap): Adjust by-value argument to
17512 * ipa-predicate.c (predicate::remap_after_inlining): Adjust by-value argument
17513 to by-const-reference.
17514 * ipa-predicate.h (predicate::remap_after_inlining): Same.
17515 * ipa-prop.c (ipa_find_agg_cst_for_param): Same.
17516 * ipa-prop.h (ipa_find_agg_cst_for_param): Same.
17517 * ira-build.c (ira_loop_tree_body_rev_postorder): Same.
17518 * read-rtl.c (add_overload_instance): Same.
17519 * rtl.h (native_decode_rtx): Same.
17520 (native_decode_vector_rtx): Same.
17521 * sched-int.h (sched_init_luids): Same.
17522 (haifa_init_h_i_d): Same.
17523 * simplify-rtx.c (native_decode_vector_rtx): Same.
17524 (native_decode_rtx): Same.
17525 * tree-call-cdce.c (gen_shrink_wrap_conditions): Same.
17526 (shrink_wrap_one_built_in_call_with_conds): Same.
17527 (shrink_wrap_conditional_dead_built_in_calls): Same.
17528 * tree-data-ref.c (create_runtime_alias_checks): Same.
17529 (compute_all_dependences): Same.
17530 * tree-data-ref.h (compute_all_dependences): Same.
17531 (create_runtime_alias_checks): Same.
17532 (index_in_loop_nest): Same.
17533 * tree-if-conv.c (mask_exists): Same.
17534 * tree-loop-distribution.c (class loop_distribution): Same.
17535 (loop_distribution::create_rdg_vertices): Same.
17536 (dump_rdg_partitions): Same.
17537 (debug_rdg_partitions): Same.
17538 (partition_contains_all_rw): Same.
17539 (loop_distribution::distribute_loop): Same.
17540 * tree-parloops.c (oacc_entry_exit_ok_1): Same.
17541 (oacc_entry_exit_single_gang): Same.
17542 * tree-ssa-loop-im.c (hoist_memory_references): Same.
17543 (loop_suitable_for_sm): Same.
17544 * tree-ssa-loop-niter.c (bound_index): Same.
17545 * tree-ssa-reassoc.c (update_ops): Same.
17546 (swap_ops_for_binary_stmt): Same.
17547 (rewrite_expr_tree): Same.
17548 (rewrite_expr_tree_parallel): Same.
17549 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Same.
17550 * tree-ssa-sccvn.h (ao_ref_init_from_vn_reference): Same.
17551 * tree-ssa-structalias.c (process_all_all_constraints): Same.
17552 (make_constraints_to): Same.
17553 (handle_lhs_call): Same.
17554 (find_func_aliases_for_builtin_call): Same.
17555 (sort_fieldstack): Same.
17556 (check_for_overlaps): Same.
17557 * tree-vect-loop-manip.c (vect_create_cond_for_align_checks): Same.
17558 (vect_create_cond_for_unequal_addrs): Same.
17559 (vect_create_cond_for_lower_bounds): Same.
17560 (vect_create_cond_for_alias_checks): Same.
17561 * tree-vect-slp-patterns.c (vect_validate_multiplication): Same.
17562 * tree-vect-slp.c (vect_analyze_slp_instance): Same.
17563 (vect_make_slp_decision): Same.
17564 (vect_slp_bbs): Same.
17565 (duplicate_and_interleave): Same.
17566 (vect_transform_slp_perm_load): Same.
17567 (vect_schedule_slp): Same.
17568 * tree-vectorizer.h (vect_transform_slp_perm_load): Same.
17569 (vect_schedule_slp): Same.
17570 (duplicate_and_interleave): Same.
17571 * tree.c (build_vector_from_ctor): Same.
17572 (build_vector): Same.
17573 (check_vector_cst): Same.
17574 (check_vector_cst_duplicate): Same.
17575 (check_vector_cst_fill): Same.
17576 (check_vector_cst_stepped): Same.
17577 * tree.h (build_vector_from_ctor): Same.
17579 2021-07-20 Jakub Jelinek <jakub@redhat.com>
17582 * config/rs6000/rs6000-protos.h (easy_altivec_constant): Change return
17583 type from bool to int.
17584 * config/rs6000/rs6000.c (vspltis_constant): Fix up handling the
17585 EASY_VECTOR_MSB case if either step or copies is not 1.
17586 (vspltis_shifted): Fix comment typo.
17587 (easy_altivec_constant): Change return type from bool to int, instead
17588 of returning true return byte size of the element mode that should be
17589 used to synthetize the constant.
17590 * config/rs6000/predicates.md (easy_vector_constant_msb): Require
17591 that vspltis_shifted is 0, handle the case where easy_altivec_constant
17592 assumes using different vector mode from CONST_VECTOR's mode.
17593 * config/rs6000/altivec.md (easy_vector_constant_msb splitter): Use
17594 easy_altivec_constant to determine mode in which -1 >> -1 should be
17595 performed, use rs6000_expand_vector_init instead of gen_vec_initv4sisi.
17597 2021-07-20 Richard Biener <rguenther@suse.de>
17600 * dwarf2out.h (dwarf_file_data): Add key member.
17601 * dwarf2out.c (dwarf_file_hasher::equal): Compare key.
17602 (dwarf_file_hasher::hash): Hash key.
17603 (lookup_filename): Remap the filename and store it in the
17604 filename member of dwarf_file_data when creating a new
17606 (file_name_acquire): Do not remap the filename again.
17607 (maybe_emit_file): Likewise.
17609 2021-07-20 Jonathan Wright <jonathan.wright@arm.com>
17611 * config/aarch64/aarch64-simd-builtins.def: Use two variant
17612 generators for all TBL/TBX intrinsics and rename to
17613 consistent forms: qtbl[1234] or qtbx[1234].
17614 * config/aarch64/aarch64-simd.md (aarch64_tbl1<mode>):
17616 (aarch64_qtbl1<mode>): This.
17617 (aarch64_tbx1<mode>): Rename to...
17618 (aarch64_qtbx1<mode>): This.
17619 (aarch64_tbl2v16qi): Delete.
17620 (aarch64_tbl3<mode>): Rename to...
17621 (aarch64_qtbl2<mode>): This.
17622 (aarch64_tbx4<mode>): Rename to...
17623 (aarch64_qtbx2<mode>): This.
17624 * config/aarch64/aarch64.c (aarch64_expand_vec_perm_1): Use
17625 renamed qtbl1 and qtbl2 RTL patterns.
17626 * config/aarch64/arm_neon.h (vqtbl1_p8): Use renamed qtbl1
17628 (vqtbl1_s8): Likewise.
17629 (vqtbl1_u8): Likewise.
17630 (vqtbl1q_p8): Likewise.
17631 (vqtbl1q_s8): Likewise.
17632 (vqtbl1q_u8): Likewise.
17633 (vqtbx1_s8): Use renamed qtbx1 RTL pattern.
17634 (vqtbx1_u8): Likewise.
17635 (vqtbx1_p8): Likewise.
17636 (vqtbx1q_s8): Likewise.
17637 (vqtbx1q_u8): Likewise.
17638 (vqtbx1q_p8): Likewise.
17639 (vtbl1_s8): Use renamed qtbl1 RTL pattern.
17640 (vtbl1_u8): Likewise.
17641 (vtbl1_p8): Likewise.
17642 (vtbl2_s8): Likewise
17643 (vtbl2_u8): Likewise.
17644 (vtbl2_p8): Likewise.
17645 (vtbl3_s8): Use renamed qtbl2 RTL pattern.
17646 (vtbl3_u8): Likewise.
17647 (vtbl3_p8): Likewise.
17648 (vtbl4_s8): Likewise.
17649 (vtbl4_u8): Likewise.
17650 (vtbl4_p8): Likewise.
17651 (vtbx2_s8): Use renamed qtbx2 RTL pattern.
17652 (vtbx2_u8): Likewise.
17653 (vtbx2_p8): Likewise.
17654 (vqtbl2_s8): Use renamed qtbl2 RTL pattern.
17655 (vqtbl2_u8): Likewise.
17656 (vqtbl2_p8): Likewise.
17657 (vqtbl2q_s8): Likewise.
17658 (vqtbl2q_u8): Likewise.
17659 (vqtbl2q_p8): Likewise.
17660 (vqtbx2_s8): Use renamed qtbx2 RTL pattern.
17661 (vqtbx2_u8): Likewise.
17662 (vqtbx2_p8): Likewise.
17663 (vqtbx2q_s8): Likewise.
17664 (vqtbx2q_u8): Likewise.
17665 (vqtbx2q_p8): Likewise.
17666 (vtbx4_s8): Likewise.
17667 (vtbx4_u8): Likewise.
17668 (vtbx4_p8): Likewise.
17670 2021-07-20 Uroš Bizjak <ubizjak@gmail.com>
17673 * config/i386/sync.md (define_peephole2 atomic_storedi_fpu):
17675 (define_peephole2 atomic_loaddi_fpu): Ditto.
17677 2021-07-20 Kito Cheng <kito.cheng@sifive.com>
17679 * config.gcc (riscv*-*-*): Detect which python is available.
17681 2021-07-20 Kewen Lin <linkw@linux.ibm.com>
17683 * config/rs6000/vsx.md (mulhs_<mode>): Rename to...
17684 (smul<mode>3_highpart): ... this.
17685 (mulhu_<mode>): Rename to...
17686 (umul<mode>3_highpart): ... this.
17687 * config/rs6000/rs6000-builtin.def (MULHS_V2DI, MULHS_V4SI,
17688 MULHU_V2DI, MULHU_V4SI): Adjust.
17690 2021-07-20 Kewen Lin <linkw@linux.ibm.com>
17692 PR tree-optimization/100696
17693 * internal-fn.c (first_commutative_argument): Add info for IFN_MULH.
17694 * internal-fn.def (IFN_MULH): New internal function.
17695 * tree-vect-patterns.c (vect_recog_mulhs_pattern): Add support to
17696 recog normal multiply highpart as IFN_MULH.
17697 * config/i386/i386.c (ix86_add_stmt_cost): Adjust for combined
17700 2021-07-19 Indu Bhagat <indu.bhagat@oracle.com>
17702 * config/elfos.h (CTF_DEBUGGING_INFO): New definition.
17703 (BTF_DEBUGGING_INFO): Likewise.
17704 * doc/tm.texi.in: Document the new macros.
17705 * doc/tm.texi: Regenerated.
17706 * toplev.c: Guard initialization of debug hooks.
17708 2021-07-19 Indu Bhagat <indu.bhagat@oracle.com>
17710 * flags.h (ctf_debuginfo_p): New function declaration.
17711 * opts.c (ctf_debuginfo_p): New function definition.
17713 2021-07-19 Andrew Stubbs <ams@codesourcery.com>
17716 * config/gcn/gcn-hsa.h (DRIVER_SELF_SPECS): New.
17717 (ASM_SPEC): Set -mattr for xnack and sram-ecc.
17718 * config/gcn/gcn-opts.h (enum sram_ecc_type): New.
17719 * config/gcn/gcn-valu.md: Add a warning comment.
17720 * config/gcn/gcn.c (gcn_option_override): Add "sorry" for -mxnack.
17721 (output_file_start): Add xnack and sram-ecc state to ".amdgcn_target".
17722 * config/gcn/gcn.md: Add a warning comment.
17723 * config/gcn/gcn.opt: Add -mxnack and -msram-ecc.
17724 * config/gcn/mkoffload.c (EF_AMDGPU_MACH_AMDGCN_GFX908): Remove
17726 (EF_AMDGPU_XNACK): New.
17727 (EF_AMDGPU_SRAM_ECC): New.
17729 (copy_early_debug_info): Use elf_flags.
17730 (main): Handle -mxnack and -msram-ecc options.
17731 * doc/invoke.texi: Document -mxnack and -msram-ecc.
17733 2021-07-19 Andrew Pinski <apinski@marvell.com>
17736 * config/aarch64/aarch64.md (csneg3_uxtw_insn): Rename to ...
17737 (*cs<neg_not_cs>3_uxtw_insn4): and extend to NEG_NOT.
17739 2021-07-19 Richard Biener <rguenther@suse.de>
17741 PR tree-optimization/101505
17742 * tree-vect-patterns.c (vect_determine_precisions): Walk
17743 PHIs also for loop vectorization.
17745 2021-07-19 Richard Biener <rguenther@suse.de>
17747 * gimple.h (gimple_expr_type): Remove.
17748 * doc/gimple.texi: Remove gimple_expr_type documentation.
17750 2021-07-19 Richard Biener <rguenther@suse.de>
17752 * tree-ssa-sccvn.c (vn_reference_eq): Handle NULL vr->type.
17753 (ao_ref_init_from_vn_reference): Likewise.
17754 (fully_constant_reference): Likewise.
17755 (vn_reference_lookup_call): Do not set vr->type to random
17757 * tree-ssa-pre.c (compute_avail): Do not try to PRE calls
17759 * tree-vect-generic.c (expand_vector_piecewise): Pass in
17760 whether we expanded parallel.
17761 (expand_vector_parallel): Adjust.
17762 (expand_vector_addition): Likewise.
17763 (expand_vector_comparison): Likewise.
17764 (expand_vector_operation): Likewise.
17765 (expand_vector_scalar_condition): Likewise.
17766 (expand_vector_conversion): Likewise.
17768 2021-07-19 Richard Biener <rguenther@suse.de>
17770 * tree-vrp.c (register_edge_assert_for_2): Use the
17772 (vrp_folder::fold_predicate_in): Likewise.
17773 * vr-values.c (gimple_assign_nonzero_p): Likewise.
17774 (vr_values::extract_range_from_comparison): Likewise.
17775 (vr_values::extract_range_from_ubsan_builtin): Use the
17776 type of the first operand.
17777 (vr_values::extract_range_basic): Push down type
17778 computation, use the appropriate LHS.
17779 (vr_values::extract_range_from_assignment): Use the
17782 2021-07-18 H.J. Lu <hjl.tools@gmail.com>
17785 * common/config/i386/i386-common.c (ix86_handle_option): For
17786 -mgeneral-regs-only, enable the GPR only instructions which are
17787 enabled implicitly by SSE ISAs unless they have been disabled
17790 2021-07-18 H.J. Lu <hjl.tools@gmail.com>
17793 * config/i386/i386.c (ix86_check_avx_upper_stores): Moved before
17794 ix86_avx_u128_mode_needed.
17795 (ix86_avx_u128_mode_needed): Return AVX_U128_DIRTY if callee
17796 returns AVX register.
17798 2021-07-17 Jan Hubicka <hubicka@ucw.cz>
17800 * tree-ssa-structalias.c (handle_rhs_call): Support EAF_NOT_RETURNED.
17801 (handle_const_call): Liekise
17802 (handle_pure_call): Liekise
17804 2021-07-17 Andrew MacLeod <amacleod@redhat.com>
17806 PR tree-optimization/96542
17807 * range-op.cc (range_operator::wi_fold_in_parts): New.
17808 (range_operator::fold_range): Call wi_fold_in_parts.
17809 (operator_lshift::wi_fold): Fix broken lshift by [0,0].
17810 * range-op.h (wi_fold_in_parts): Add prototype.
17812 2021-07-16 David Malcolm <dmalcolm@redhat.com>
17814 * doc/analyzer.texi: Add __analyzer_dump_state.
17816 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17818 * config/rs6000/rbtree.c: New file.
17819 * config/rs6000/rbtree.h: New file.
17821 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17823 * config/rs6000/rs6000-gen-builtins.c (restriction): New enum.
17824 (typeinfo): Add restr field.
17825 (match_bracketed_pair): New function.
17826 (match_const_restriction): Implement.
17828 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17830 * config/rs6000/rs6000-gen-builtins.c (match_basetype): Implement.
17832 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17834 * config/rs6000/rs6000-gen-builtins.c (void_status): New enum.
17835 (basetype): Likewise.
17836 (typeinfo): Likewise.
17837 (handle_pointer): New function.
17838 (match_basetype): New stub function.
17839 (match_const_restriction): Likewise.
17840 (match_type): New function.
17842 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17844 * config/rs6000/rs6000-gen-builtins.c (consume_whitespace): New
17846 (advance_line): Likewise.
17847 (safe_inc_pos): Likewise.
17848 (match_identifier): Likewise.
17849 (match_integer): Likewise.
17850 (match_to_right_bracket): Likewise.
17852 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17854 * config/rs6000/rs6000-gen-builtins.c (bif_file): New variable.
17855 (ovld_file): Likewise.
17856 (header_file): Likewise.
17857 (init_file): Likewise.
17858 (defines_file): Likewise.
17859 (pgm_path): Likewise.
17860 (bif_path): Likewise.
17861 (ovld_path): Likewise.
17862 (header_path): Likewise.
17863 (init_path): Likewise.
17864 (defines_path): Likewise.
17865 (LINELEN): New macro.
17866 (linebuf): New variable.
17870 (bif_diag): New function.
17871 (ovld_diag): Likewise.
17873 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17875 * config/rs6000/rs6000-builtin-new.def: New.
17876 * config/rs6000/rs6000-overload.def: New.
17878 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17880 * config/rs6000/rs6000-gen-builtins.c: New.
17882 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17884 * Makefile.in (EXTRA_GTYPE_DEPS): New variable.
17885 (s-gtype): Depend on EXTRA_GTYPE_DEPS.
17886 * gengtype-state.c (state_writer::write_state_file_list): Add a
17887 parameter to the fileslist expression for the number of build
17889 (read_state_files_list): Detect build headers and strip the
17890 initial "./" or ".\" from their names.
17891 * gengtype.c (build_headers): New global variable.
17892 (num_build_headers): Likewise.
17893 (open_base_files): Emit #include for each build header.
17894 (main): Detect and count build headers.
17895 * gengtype.h (build_headers): New extern variable.
17896 (num_build_headers): Likewise.
17898 2021-07-16 Richard Biener <rguenther@suse.de>
17900 * gimple-ssa-store-merging.c (verify_symbolic_number_p): Use
17901 the type of the LHS.
17902 (find_bswap_or_nop_1): Likewise.
17903 (find_bswap_or_nop): Likewise.
17904 * tree-vectorizer.h (vect_get_smallest_scalar_type): Adjust
17906 * tree-vect-data-refs.c (vect_get_smallest_scalar_type):
17907 Remove unused parameters, pass in the scalar type. Fix
17908 internal store function handling.
17909 * tree-vect-stmts.c (vect_analyze_stmt): Remove assert.
17910 (vect_get_vector_types_for_stmt): Move down check for
17911 existing vector stmt after we've determined a scalar type.
17912 Pass down the used scalar type to vect_get_smallest_scalar_type.
17913 * tree-vect-generic.c (expand_vector_condition): Use
17914 the type of the LHS.
17915 (expand_vector_scalar_condition): Likewise.
17916 (expand_vector_operations_1): Likewise.
17917 * tree-vect-patterns.c (vect_widened_op_tree): Likewise.
17918 (vect_recog_dot_prod_pattern): Likewise.
17919 (vect_recog_sad_pattern): Likewise.
17920 (vect_recog_widen_op_pattern): Likewise.
17921 (vect_recog_widen_sum_pattern): Likewise.
17922 (vect_recog_mixed_size_cond_pattern): Likewise.
17924 2021-07-16 Jan Hubicka <hubicka@ucw.cz>
17926 * ipa-modref.c (struct escape_entry): Use eaf_fleags_t.
17927 (dump_eaf_flags): Dump EAF_NOT_RETURNED
17928 (eaf_flags_useful_p): Use eaf_fleags_t; handle const functions
17929 and EAF_NOT_RETURNED.
17930 (modref_summary::useful_p): Likewise.
17931 (modref_summary_lto::useful_p): Likewise.
17932 (struct) modref_summary_lto: Use eaf_fleags_t.
17933 (deref_flags): Handle EAF_NOT_RETURNED.
17934 (struct escape_point): Use min_flags.
17935 (modref_lattice::init): Add EAF_NOT_RETURNED.
17936 (merge_call_lhs_flags): Ignore EAF_NOT_RETURNED functions
17937 (analyze_ssa_name_flags): Clear EAF_NOT_RETURNED on return;
17939 (analyze_parms): Also analyze const functions; update conition on
17941 (modref_write): Update streaming.
17942 (read_section): Update streaming.
17943 (remap_arg_flags): Use eaf_flags_t.
17944 (modref_merge_call_site_flags): Hanlde EAF_NOT_RETURNED.
17945 * ipa-modref.h: (eaf_flags_t): New typedef.
17946 (struct modref_summary): Use eaf_flags_t.
17947 * tree-core.h (EAF_NOT_RETURNED): New constant.
17949 2021-07-16 Richard Biener <rguenther@suse.de>
17951 * gimple-fold.c (gimple_fold_stmt_to_constant_1): Use
17952 the type of the LHS.
17953 (gimple_assign_nonnegative_warnv_p): Likewise.
17954 (gimple_call_nonnegative_warnv_p): Likewise. Return false
17955 if the call has no LHS.
17956 * gimple.c (gimple_could_trap_p_1): Use the type of the LHS.
17957 * tree-eh.c (stmt_could_throw_1_p): Likewise.
17958 * tree-inline.c (insert_init_stmt): Likewise.
17959 * tree-ssa-loop-niter.c (get_val_for): Likewise.
17960 * tree-outof-ssa.c (ssa_is_replaceable_p): Use the type of
17962 * tree-ssa-sccvn.c (init_vn_nary_op_from_stmt): Take a
17963 gassign *. Use the type of the lhs.
17964 (vn_nary_op_lookup_stmt): Adjust.
17965 (vn_nary_op_insert_stmt): Likewise.
17967 2021-07-16 Ilya Leoshkevich <iii@linux.ibm.com>
17969 * config/s390/predicates.md (bras_sym_operand): Accept all
17970 functions in 64-bit mode, use UNSPEC_PLT31.
17971 (larl_operand): Use UNSPEC_PLT31.
17972 * config/s390/s390.c (s390_loadrelative_operand_p): Likewise.
17973 (legitimize_pic_address): Likewise.
17974 (s390_emit_tls_call_insn): Mark __tls_get_offset as function,
17976 (s390_delegitimize_address): Use UNSPEC_PLT31.
17977 (s390_output_addr_const_extra): Likewise.
17978 (print_operand): Add @PLT to TLS calls, handle %K.
17979 (s390_function_profiler): Mark __fentry__/_mcount as function,
17980 use %K, use UNSPEC_PLT31.
17981 (s390_output_mi_thunk): Use only UNSPEC_GOT, use %K.
17982 (s390_emit_call): Use UNSPEC_PLT31.
17983 (s390_emit_tpf_eh_return): Mark __tpf_eh_return as function.
17984 * config/s390/s390.md (UNSPEC_PLT31): Rename from UNSPEC_PLT.
17985 (*movdi_64): Use %K.
17986 (reload_base_64): Likewise.
17987 (*sibcall_brc): Likewise.
17988 (*sibcall_brcl): Likewise.
17989 (*sibcall_value_brc): Likewise.
17990 (*sibcall_value_brcl): Likewise.
17992 (*brasl): Likewise.
17993 (*bras_r): Likewise.
17994 (*brasl_r): Likewise.
17995 (*bras_tls): Likewise.
17996 (*brasl_tls): Likewise.
17997 (main_base_64): Likewise.
17998 (reload_base_64): Likewise.
17999 (@split_stack_call<mode>): Likewise.
18001 2021-07-16 Richard Biener <rguenther@suse.de>
18003 PR tree-optimization/101467
18004 * tree-vect-stmts.c (vect_gen_while): Properly guard
18005 make_temp_ssa_name usage.
18007 2021-07-16 Cooper Qu <cooper.qu@linux.alibaba.com>
18009 * config.gcc: Don't use forked print-sysroot-suffix.sh and
18010 t-sysroot-suffix for C-SKY.
18011 * config/csky/print-sysroot-suffix.sh: Delete.
18012 * config/csky/t-csky-linux: Delete.
18013 * config/csky/t-sysroot-suffix: Define MULTILIB_DIRNAMES
18014 instead of CSKY_MULTILIB_DIRNAMES.
18016 2021-07-16 Richard Biener <rguenther@suse.de>
18018 * tree-vect-loop.c (vect_transform_cycle_phi): Correct sign
18019 conversion issues with the partial reduction of the reused
18020 vector accumulator.
18022 2021-07-16 Richard Biener <rguenther@suse.de>
18024 * config/i386/i386-options.c (ix86_option_override_internal): Set
18025 param_vect_partial_vector_usage to zero if not set.
18027 2021-07-15 Uroš Bizjak <ubizjak@gmail.com>
18030 * config/i386/i386.h (VALID_SSE_REG_MODE): Add TDmode.
18031 (VALID_INT_MODE_P): Add SDmode and DDmode.
18032 Add TDmode for TARGET_64BIT.
18033 (VALID_DFP_MODE_P): Remove.
18034 * config/i386/i386.c (ix86_hard_regno_mode_ok):
18035 Do not use VALID_DFP_MODE_P.
18037 2021-07-15 Andrew MacLeod <amacleod@redhat.com>
18039 * gimple-range-fold.cc (adjust_pointer_diff_expr): Use
18041 (fold_using_range::fold_stmt): Ditto.
18042 (fold_using_range::range_of_range_op): Ditto.
18043 (fold_using_range::range_of_phi): Ditto.
18044 (fold_using_range::range_of_call): Ditto.
18045 (fold_using_range::range_of_builtin_ubsan_call): Ditto.
18046 (fold_using_range::range_of_builtin_call): Ditto.
18047 (fold_using_range::range_of_cond_expr): Ditto.
18048 * gimple-range-fold.h (gimple_range_type): New.
18050 2021-07-15 Martin Sebor <msebor@redhat.com>
18052 PR middle-end/97027
18053 * tree-ssa-strlen.c (handle_assign): New function.
18054 (maybe_warn_overflow): Add argument.
18055 (nonzero_bytes_for_type): New function.
18056 (count_nonzero_bytes): Handle more tree types. Call
18057 nonzero_bytes_for_tye.
18058 (count_nonzero_bytes): Handle types.
18059 (handle_store): Handle stores from function calls.
18060 (strlen_check_and_optimize_call): Move code to handle_assign. Call
18061 it for assignments from function calls.
18063 2021-07-15 David Malcolm <dmalcolm@redhat.com>
18068 * doc/invoke.texi: Add -Wanalyzer-use-of-uninitialized-value.
18070 2021-07-15 David Malcolm <dmalcolm@redhat.com>
18072 * doc/invoke.texi (-fdump-analyzer-exploded-paths): New.
18074 2021-07-15 Martin Sebor <msebor@redhat.com>
18078 * fold-const.c (operand_compare::operand_equal_p): Handle OEP_DECL_NAME.
18079 (operand_compare::verify_hash_value): Same.
18080 * tree-core.h (OEP_DECL_NAME): New.
18082 2021-07-15 Martin Jambor <mjambor@suse.cz>
18084 * profile-count.h (profile_count::value): Change the return type to
18086 * gimple-pretty-print.c (dump_gimple_bb_header): Adjust print
18088 * tree-cfg.c (dump_function_to_file): Likewise.
18090 2021-07-15 Bill Schmidt <wschmidt@linux.ibm.com>
18093 * config/rs6000/rs6000-p8swap.c (has_part_mult): New.
18094 (rs6000_analyze_swaps): Insns containing a subreg of a mult are
18097 2021-07-15 Richard Biener <rguenther@suse.de>
18099 * tree-vectorizer.h (vect_gen_while): Match up with
18100 vect_gen_while_not.
18101 * tree-vect-stmts.c (vect_gen_while): Adjust API to that
18102 of vect_gen_while_not.
18103 (vect_gen_while_not): Adjust.
18104 * tree-vect-loop-manip.c (vect_set_loop_controls_directly): Likewise.
18106 2021-07-15 Aldy Hernandez <aldyh@redhat.com>
18108 * gimple-range-cache.cc (non_null_ref::adjust_range): New.
18109 (ranger_cache::range_of_def): Call adjust_range.
18110 (ranger_cache::entry_range): Same.
18111 * gimple-range-cache.h (non_null_ref::adjust_range): New.
18112 * gimple-range.cc (gimple_ranger::range_of_expr): Call
18114 (gimple_ranger::range_on_entry): Same.
18116 2021-07-15 Tamar Christina <tamar.christina@arm.com>
18119 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18121 * config/arm/neon.md (<sup>dot_prod<vsi2qi>): Drop statements.
18123 2021-07-15 Tamar Christina <tamar.christina@arm.com>
18126 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18128 * config/aarch64/aarch64-simd-builtins.def (udot, sdot): Rename to...
18129 (sdot_prod, udot_prod): ...These.
18130 * config/aarch64/aarch64-simd.md (<sur>dot_prod<vsi2qi>): Remove.
18131 (aarch64_<sur>dot<vsi2qi>): Rename to...
18132 (<sur>dot_prod<vsi2qi>): ...This.
18133 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32, vdotq_s32):
18136 2021-07-15 Jakub Jelinek <jakub@redhat.com>
18138 PR middle-end/101437
18139 * gimplify.c (gimplify_expr): Throw away volatile reads from empty
18140 types even if they have non-BLKmode TYPE_MODE.
18142 2021-07-15 Richard Biener <rguenther@suse.de>
18145 * gcc.c (process_command): Process -gtoggle like process_options
18146 would after parsing options.
18148 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
18150 * cfgexpand.c (expand_asm_loc): Adjust.
18151 (expand_asm_stmt): Likewise.
18152 * config/arm/aarch-common-protos.h (arm_md_asm_adjust): Likewise.
18153 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
18154 * config/arm/arm.c (thumb1_md_asm_adjust): Likewise.
18155 * config/avr/avr.c (avr_md_asm_adjust): Likewise.
18156 * config/cris/cris.c (cris_md_asm_adjust): Likewise.
18157 * config/i386/i386.c (ix86_md_asm_adjust): Likewise.
18158 * config/mn10300/mn10300.c (mn10300_md_asm_adjust): Likewise.
18159 * config/nds32/nds32.c (nds32_md_asm_adjust): Likewise.
18160 * config/pdp11/pdp11.c (pdp11_md_asm_adjust): Likewise.
18161 * config/rs6000/rs6000.c (rs6000_md_asm_adjust): Likewise.
18162 * config/s390/s390.c (s390_md_asm_adjust): Likewise.
18163 * config/vax/vax.c (vax_md_asm_adjust): Likewise.
18164 * config/visium/visium.c (visium_md_asm_adjust): Likewise.
18165 * doc/tm.texi: Regenerate.
18166 * target.def: Add location argument to md_asm_adjust.
18168 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
18170 * tree-diagnostic.c (diagnostic_report_current_function): Use the
18171 diagnostic's location, not input_location.
18173 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
18175 * cfgexpand.c (tree_conflicts_with_clobbers_p): Pass location to
18177 (expand_asm_stmt): Likewise.
18179 2021-07-14 Peter Bergner <bergner@linux.ibm.com>
18181 * config/rs6000/rs6000.c (adjacent_mem_locations): Return the lower
18182 addressed memory rtx, if any.
18183 (rs6000_split_multireg_move): Fix code formatting.
18184 Handle MMA build built-ins with operands in adjacent memory locations.
18186 2021-07-14 Peter Bergner <bergner@linux.ibm.com>
18188 * config/rs6000/rs6000.c (rs6000_split_multireg_move): Move to later
18191 2021-07-14 Jason Merrill <jason@redhat.com>
18193 * sel-sched-ir.h (get_all_loop_exits): Use auto_vec.
18195 2021-07-14 Jason Merrill <jason@redhat.com>
18197 * doc/invoke.texi: -fdelete-dead-exceptions is on by default for
18200 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18202 * tree-vect-patterns.c (vect_recog_dot_prod_pattern):
18203 Remove erroneous line.
18205 2021-07-14 Andrew MacLeod <amacleod@redhat.com>
18207 * params.opt (param_evrp_mode): Change default.
18209 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18211 * config/aarch64/aarch64-simd-builtins.def (udot, sdot): Rename to...
18212 (sdot_prod, udot_prod): ...These.
18213 * config/aarch64/aarch64-simd.md (<sur>dot_prod<vsi2qi>): Remove.
18214 (aarch64_<sur>dot<vsi2qi>): Rename to...
18215 (<sur>dot_prod<vsi2qi>): ...This.
18216 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32, vdotq_s32):
18219 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18221 * config/arm/neon.md (<sup>dot_prod<vsi2qi>): Drop statements.
18223 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18225 * doc/sourcebuild.texi (arm_v8_2a_i8mm_neon_hw): Document.
18227 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18229 * config/arm/neon.md (usdot_prod<vsi2qi>): New.
18231 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18233 * config/aarch64/aarch64-simd.md (aarch64_usdot<vsi2qi>): Rename to...
18234 (usdot_prod<vsi2qi>): ... This.
18235 * config/aarch64/aarch64-simd-builtins.def (usdot): Rename to...
18236 (usdot_prod): ...This.
18237 * config/aarch64/arm_neon.h (vusdot_s32, vusdotq_s32): Likewise.
18238 * config/aarch64/aarch64-sve.md (@aarch64_<sur>dot_prod<vsi2qi>):
18240 (@<sur>dot_prod<vsi2qi>): ...This.
18241 * config/aarch64/aarch64-sve-builtins-base.cc
18242 (svusdot_impl::expand): Use it.
18244 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18246 * optabs.def (usdot_prod_optab): New.
18247 * doc/md.texi: Document it and clarify other dot prod optabs.
18248 * optabs-tree.h (enum optab_subtype): Add optab_vector_mixed_sign.
18249 * optabs-tree.c (optab_for_tree_code): Support usdot_prod_optab.
18250 * optabs.c (expand_widen_pattern_expr): Likewise.
18251 * tree-cfg.c (verify_gimple_assign_ternary): Likewise.
18252 * tree-vect-loop.c (vectorizable_reduction): Query dot-product kind.
18253 * tree-vect-patterns.c (vect_supportable_direct_optab_p): Take optional
18255 (vect_widened_op_tree): Optionally ignore
18257 (vect_recog_dot_prod_pattern): Support usdot_prod_optab.
18259 2021-07-14 H.J. Lu <hjl.tools@gmail.com>
18262 * config/i386/driver-i386.c (host_detect_local_cpu): Check
18263 "arch [32|64]" and "tune [32|64]" for 32-bit and 64-bit codegen.
18264 Enable UINTR only for 64-bit codegen.
18265 * config/i386/i386-options.c
18266 (ix86_option_override_internal::DEF_PTA): Skip PTA_UINTR if not
18268 * config/i386/i386.h (ARCH_ARG): New.
18269 (CC1_CPU_SPEC): Pass "[arch|tune] 32" for 32-bit codegen and
18270 "[arch|tune] 64" for 64-bit codegen.
18272 2021-07-14 Richard Biener <rguenther@suse.de>
18274 PR tree-optimization/101445
18275 * tree-vect-stmts.c (vectorizable_load): Do the gap adjustment
18276 of the IV in the correct direction for negative stride
18279 2021-07-14 Jakub Jelinek <jakub@redhat.com>
18282 * godump.c (godump_str_hash): New type.
18283 (godump_container::pot_dummy_types): Use string_hash instead of
18284 ptr_hash in the hash_set.
18286 2021-07-14 Richard Biener <rguenther@suse.de>
18288 * tree-vect-loop.c (vect_find_reusable_accumulator): Handle
18289 vector types where the old vector type has a multiple of
18290 the new vector type elements.
18291 (vect_create_partial_epilog): New function, split out from...
18292 (vect_create_epilog_for_reduction): ... here.
18293 (vect_transform_cycle_phi): Reduce the re-used accumulator
18294 to the new vector type.
18296 2021-07-14 Alexandre Oliva <oliva@adacore.com>
18298 * tree-ssa-alias.c (attr_fnspec::verify): Fix index in
18299 non-'t'-sized arg check.
18301 2021-07-14 Alexandre Oliva <oliva@adacore.com>
18303 * tree-cfg.c (cleanup_dead_labels_eh): Update
18304 post_landing_pad label upon change of landing pad block's
18306 (cleanup_dead_labels): Check that a removed label is not that
18309 2021-07-13 Jonathan Wright <jonathan.wright@arm.com>
18311 * combine.c (combine_simplify_rtx): Add vec_select -> subreg
18313 * config/aarch64/aarch64.md (*zero_extend<SHORT:mode><GPI:mode>2_aarch64):
18314 Add Neon to general purpose register case for zero-extend
18316 * config/arm/vfp.md (*arm_movsi_vfp): Remove "*" from *t -> r
18317 case to prevent some cases opting to go through memory.
18318 * cse.c (fold_rtx): Add vec_select -> subreg simplification.
18319 * rtl.c (rtvec_series_p): Define predicate to determine
18320 whether a vector contains a linear series of integers.
18321 * rtl.h (rtvec_series_p): Define.
18322 * rtlanal.c (vec_series_lowpart_p): Define predicate to
18323 determine if a vector selection is equivalent to the low part
18325 * rtlanal.h (vec_series_lowpart_p): Define.
18326 * simplify-rtx.c (simplify_context::simplify_binary_operation_1):
18327 Add vec_select -> subreg simplification.
18329 2021-07-13 Paul A. Clarke <pc@us.ibm.com>
18331 * config/rs6000/smmintrin.h (_mm_testz_si128, _mm_testc_si128,
18332 _mm_testnzc_si128, _mm_test_all_ones, _mm_test_all_zeros,
18333 _mm_test_mix_ones_zeros): New.
18335 2021-07-13 Roger Sayle <roger@nextmovesoftware.com>
18336 Richard Biener <rguenther@suse.de>
18338 * gimple.c (gimple_could_trap_p_1): Make S argument a
18339 "const gimple*". Preserve constness in call to
18340 gimple_asm_volatile_p.
18341 (gimple_could_trap_p): Make S argument a "const gimple*".
18342 * gimple.h (gimple_could_trap_p_1, gimple_could_trap_p):
18343 Update function prototypes.
18345 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18347 * tree-vectorizer.h (vect_reusable_accumulator): New structure.
18348 (_loop_vec_info::main_loop_edge): New field.
18349 (_loop_vec_info::skip_main_loop_edge): Likewise.
18350 (_loop_vec_info::skip_this_loop_edge): Likewise.
18351 (_loop_vec_info::reusable_accumulators): Likewise.
18352 (_stmt_vec_info::reduc_scalar_results): Likewise.
18353 (_stmt_vec_info::reused_accumulator): Likewise.
18354 (vect_get_main_loop_result): Declare.
18355 * tree-vectorizer.c (vec_info::new_stmt_vec_info): Initialize
18356 reduc_scalar_inputs.
18357 (vec_info::free_stmt_vec_info): Free reduc_scalar_inputs.
18358 * tree-vect-loop-manip.c (vect_get_main_loop_result): New function.
18359 (vect_do_peeling): Fill an epilogue loop's main_loop_edge,
18360 skip_main_loop_edge and skip_this_loop_edge fields.
18361 * tree-vect-loop.c (INCLUDE_ALGORITHM): Define.
18362 (vect_emit_reduction_init_stmts): New function.
18363 (get_initial_def_for_reduction): Use it.
18364 (get_initial_defs_for_reduction): Likewise. Change the vinfo
18365 parameter to a loop_vec_info.
18366 (vect_create_epilog_for_reduction): Store the scalar results
18367 in the reduc_info. If an epilogue loop is reusing an accumulator
18368 from the main loop, and if the epilogue loop can also be skipped,
18369 try to place the reduction code in the join block. Record
18370 accumulators that could potentially be reused by epilogue loops.
18371 (vect_transform_cycle_phi): When vectorizing epilogue loops,
18372 try to reuse accumulators from the main loop. Record the initial
18373 value in reduc_info for non-SLP reductions too.
18375 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18377 * tree-vect-loop.c (get_initial_def_for_reduction): Remove
18378 adjustment handling. Take the neutral value as an argument,
18379 in place of the code argument.
18380 (vect_transform_cycle_phi): Update accordingly. Handle the
18381 initial values of cond reductions separately from code reductions.
18382 Choose the adjustment here rather than in
18383 get_initial_def_for_reduction. Sink the splat of vec_initial_def.
18385 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18387 * tree-vect-loop.c (neutral_op_for_slp_reduction): Replace with...
18388 (neutral_op_for_reduction): ...this, providing a more general
18390 (vect_create_epilog_for_reduction): Update accordingly.
18391 (vectorizable_reduction): Likewise.
18392 (vect_transform_cycle_phi): Likewise.
18394 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18396 * tree-vect-loop.c (get_initial_def_for_reduction): Take the
18397 reduc_info instead of the original stmt_vec_info.
18398 (vect_transform_cycle_phi): Update accordingly.
18400 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18402 * tree-vect-loop.c (get_initial_defs_for_reduction): Take the
18403 reduc_info as an additional parameter.
18404 (vect_transform_cycle_phi): Update accordingly.
18406 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18408 * tree-vectorizer.h: Include tree-ssa-operands.h.
18409 (vect_phi_initial_value): New function.
18410 * tree-vect-loop.c (neutral_op_for_slp_reduction): Use it.
18411 (get_initial_defs_for_reduction, info_for_reduction): Likewise.
18412 (vect_create_epilog_for_reduction, vectorizable_reduction): Likewise.
18413 (vect_transform_cycle_phi, vectorizable_induction): Likewise.
18415 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18417 * tree-vect-loop.c (vect_create_epilog_for_reduction): Convert
18418 the phi results to vectype after creating them. Remove later
18419 conversion code that thus becomes redundant.
18421 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18423 * tree-vect-loop.c (vect_create_epilog_for_reduction): Replace
18424 the new_phis vector with a reduc_inputs vector. Combine handling
18425 of reduction chains and ncopies > 1.
18427 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18429 * tree-vect-loop.c (vect_create_epilog_for_reduction): Truncate
18430 scalar_results to group_size elements after reducing down from
18431 N*group_size elements. Construct an array_slice of the live-out
18432 stmts and assert that there is one stmt per scalar result.
18434 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18436 * tree-vect-loop.c (vect_create_epilog_for_reduction): Remove
18437 nested_in_vect_loop and use double_reduc everywhere. Remove dead
18438 assignment to "loop".
18440 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18442 * internal-fn.c (vectorized_internal_fn_supported_p): Handle
18443 vector types first. For scalar types, consider both the preferred
18444 vector mode and the alternative vector modes.
18445 * optabs-query.c (can_vec_mask_load_store_p): Use the same
18446 structure as above, in particular using related_vector_mode
18447 for modes provided by autovectorize_vector_modes.
18449 2021-07-13 Jakub Jelinek <jakub@redhat.com>
18450 Richard Biener <rguenther@suse.de>
18452 PR tree-optimization/101419
18453 * tree-pass.h (PROP_objsz): Define.
18454 (make_pass_early_object_sizes): Declare.
18455 * passes.def (pass_all_early_optimizations): Rename pass_object_sizes
18456 there to pass_early_object_sizes, drop parameter.
18457 (pass_all_optimizations): Move pass_object_sizes right after pass_ccp,
18458 drop parameter, move pass_post_ipa_warn right after that.
18459 * tree-object-size.c (pass_object_sizes::execute): Rename to...
18460 (object_sizes_execute): ... this. Add insert_min_max_p argument.
18461 (pass_data_object_sizes): Move after object_sizes_execute.
18462 (pass_object_sizes): Likewise. In execute method call
18463 object_sizes_execute, drop set_pass_param method and insert_min_max_p
18464 non-static data member and its initializer in the ctor.
18465 (pass_data_early_object_sizes, pass_early_object_sizes,
18466 make_pass_early_object_sizes): New.
18467 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Use
18468 (cfun->curr_properties & PROP_objsz) instead of cfun->after_inlining.
18470 2021-07-13 Kito Cheng <kito.cheng@sifive.com>
18473 * config/riscv/constraints.md ("S"): Update description and remove
18475 * doc/md.texi (Machine Constraints): Document the 'S' constraints
18478 2021-07-13 Richard Biener <rguenther@suse.de>
18481 2021-07-12 Richard Biener <rguenther@suse.de>
18483 * tree-vect-slp.c (vect_slp_region): Show the number of
18484 SLP graph entries in the optimization message.
18486 2021-07-13 Michael Meissner <meissner@linux.ibm.com>
18488 * config/rs6000/altivec.md (xxspltiw_v4sf): Change local variable
18490 * config/rs6000/rs6000-protos.h (rs6000_const_f32_to_i32): Change
18491 return type to long.
18492 * config/rs6000/rs6000.c (rs6000_const_f32_to_i32): Change return
18495 2021-07-12 Andrew MacLeod <amacleod@redhat.com>
18497 * gimple-range-fold.cc (fold_using_range::range_of_builtin_ubsan_call):
18498 Query relation between the 2 operands and use it.
18500 2021-07-12 Sergei Trofimovich <siarheit@google.com>
18502 * doc/cfg.texi: Fix s/ei_safe_safe/ei_safe_edge/ typo.
18504 2021-07-12 Uroš Bizjak <ubizjak@gmail.com>
18507 * config/i386/predicates.md (vec_setm_sse41_operand):
18508 Rename from vec_setm_operand.
18509 (vec_setm_avx2_operand): New predicate.
18510 * config/i386/sse.md (vec_set<V_128:mode>): Use V_128 mode iterator.
18511 Use vec_setm_sse41_operand as operand 2 predicate.
18512 (vec_set<V_256_512:mode): New expander.
18513 * config/i386/mmx.md (vec_setv2hi): Use vec_setm_sse41_operand
18514 as operand 2 predicate.
18516 2021-07-12 Andrew MacLeod <amacleod@redhat.com>
18518 PR tree-optimization/101335
18519 * range-op.cc (operator_cast::lhs_op1_relation): Delete.
18521 2021-07-12 Andrew Pinski <apinski@marvell.com>
18523 * tree-ssa-phiopt.c (match_simplify_replacement): Move
18524 insert of the sequence before the movement of the
18525 statement. Check if to see if the statement is used
18526 outside of the original phi to see if we should move it.
18528 2021-07-12 Richard Biener <rguenther@suse.de>
18530 * dump-context.h (debug_dump_context::debug_dump_context):
18531 Add FILE * parameter defaulted to stderr.
18532 * dumpfile.c (debug_dump_context::debug_dump_context): Adjust.
18533 * tree-vect-slp.c (dot_slp_tree): New functions.
18535 2021-07-12 Richard Biener <rguenther@suse.de>
18537 PR tree-optimization/101373
18538 * tree-ssa-pre.c (prune_clobbered_mems): Also prune trapping
18539 references when the BB may not return.
18540 (compute_avail): Pass in the function we're working on and
18541 replace cfun references with it. Externally throwing
18542 const calls also possibly terminate the function.
18543 (pass_pre::execute): Pass down the function we're working on.
18544 * gcse.c (compute_hash_table_work): Externally throwing
18545 const/pure calls also need record_last_mem_set_info.
18546 * postreload-gcse.c (record_opr_changes): Looping or externally
18547 throwing const/pure calls also need record_last_mem_set_info.
18549 2021-07-12 Uroš Bizjak <ubizjak@gmail.com>
18551 * recog.c (memory_address_addr_space_p): Change the type to bool.
18552 Return true/false instead of 1/0.
18553 (offsettable_memref_p): Ditto.
18554 (offsettable_nonstrict_memref_p): Ditto.
18555 (offsettable_address_addr_space_p): Ditto.
18556 Change the type of addressp indirect function to bool.
18557 * recog.h (memory_address_addr_space_p): Change the type to bool.
18558 (strict_memory_address_addr_space_p): Ditto.
18559 (offsettable_memref_p): Ditto.
18560 (offsettable_nonstrict_memref_p): Ditto.
18561 (offsettable_address_addr_space_p): Ditto.
18562 * reload.c (maybe_memory_address_addr_space_p): Ditto.
18563 (strict_memory_address_addr_space_p): Change the type to bool.
18564 Return true/false instead of 1/0.
18565 (maybe_memory_address_addr_space_p): Change the type to bool.
18567 2021-07-12 Richard Biener <rguenther@suse.de>
18569 * tree-vect-slp.c (vect_slp_region): Show the number of
18570 SLP graph entries in the optimization message.
18572 2021-07-12 Richard Biener <rguenther@suse.de>
18574 PR tree-optimization/101394
18575 * tree-ssa-pre.c (do_pre_regular_insertion): Avoid inserting
18576 copies from abnormals for a full redundancy.
18578 2021-07-12 Richard Biener <rguenther@suse.de>
18580 PR middle-end/101423
18581 * gimple.c (gimple_could_trap_p_1): Internal function calls
18583 * tree-eh.c (tree_could_trap_p): Likewise.
18585 2021-07-12 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
18588 * config/arm/arm_neon.h (vmul_n_u32): Replace call to builtin with
18590 (vmulq_n_u32): Likewise.
18591 (vmul_n_f32): Gate __a * __b on __FAST_MATH__.
18592 (vmulq_n_f32): Likewise.
18593 (vmul_n_f16): Likewise.
18594 (vmulq_n_f16): Likewise.
18596 2021-07-12 Martin Liska <mliska@suse.cz>
18598 PR sanitizer/101425
18599 * gcc.c (check_offload_target_name): Call
18600 candidates_list_and_hint only if we have a candidate.
18602 2021-07-12 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
18605 * config/arm/neon.md (vec_init): Move to ...
18606 * config/arm/vec-common.md (vec_init): ... here.
18607 Change the pattern's mode to VDQX and gate it on VALID_MVE_MODE.
18609 2021-07-12 Roger Sayle <roger@nextmovesoftware.com>
18611 PR tree-optimization/101403
18612 * match.pd ((T)bswap(X)>>C): Correctly handle cases where
18613 signedness of the shift is not the same as the signedness of
18614 the type extension.
18616 2021-07-09 Roger Sayle <roger@nextmovesoftware.com>
18617 Uroš Bizjak <ubizjak@gmail.com>
18619 * config/i386/i386.md (*divmodsi4_const): Optimize SImode
18620 divmod of a constant numerator with new define_insn_and_split.
18622 2021-07-09 Iain Sandoe <iain@sandoe.co.uk>
18625 * config/i386/i386-expand.c (ix86_expand_call): If a call is
18626 to a non-local-binding, or local but to a public symbol, then
18627 assume that it might be indirected via the lazy symbol binder.
18628 Mark R10 and R10 as clobbered in that case.
18630 2021-07-09 Eric Botcazou <ebotcazou@adacore.com>
18633 * gcc.c (ASM_DEBUG_DWARF_OPTION): Set again to --gdwarf2 in
18634 the case where HAVE_AS_WORKING_DWARF_N_FLAG is not defined
18635 and HAVE_LD_BROKEN_PE_DWARF5 is defined.
18637 2021-07-09 Uroš Bizjak <ubizjak@gmail.com>
18639 * config/i386/i386.md (*udivmodsi4_pow2_zext_1): Limit the
18640 log2 range of operands[3] to [1,31].
18641 (*udivmodsi4_pow2_zext_2): Ditto. Correct insn RTX pattern.
18643 2021-07-09 Sergei Trofimovich <siarheit@google.com>
18645 * doc/md.texi: Don't split @smallexample in multiple @groups.
18647 2021-07-09 Sergei Trofimovich <siarheit@google.com>
18649 * doc/md.texi: Add missing 'see' word.
18651 2021-07-09 Andrew Pinski <apinski@marvell.com>
18653 * tree-ssa-phiopt.c (phiopt_early_allow): Change arguments
18654 to take sequence and gimple_match_op. Accept the case where
18655 op is a SSA_NAME and one statement in the sequence.
18656 Also allow constants.
18657 (gimple_simplify_phiopt): Always pass a sequence to resimplify.
18658 Update call to phiopt_early_allow. Discard the sequence if not
18661 2021-07-09 Xi Ruoyao <xry111@mengyan1223.wang>
18666 * config/mips/mips.c (mips_const_insns): Use MSA_SUPPORTED_MODE_P
18667 instead of ISA_HAS_MSA.
18668 (mips_expand_vec_unpack): Likewise.
18669 (mips_expand_vector_init): Likewise.
18671 2021-07-09 Kewen Lin <linkw@linux.ibm.com>
18673 * config/rs6000/vsx.md (mods_<mode>): Rename to...
18674 (mod<mode>3): ... this.
18675 (modu_<mode>): Rename to...
18676 (umod<mode>3): ... this.
18677 * config/rs6000/rs6000-builtin.def (MODS_V2DI, MODS_V4SI, MODU_V2DI,
18678 MODU_V4SI): Adjust.
18680 2021-07-08 Jeff Law <jeffreyalaw@gmail.com>
18682 * config/h8300/shiftrotate.md (variable shifts): Expose condition
18683 code handling for the test before the loop.
18685 2021-07-08 Martin Jambor <mjambor@suse.cz>
18688 * ipa-sra.c (class isra_call_summary): New member
18689 m_before_any_store, initialize it in the constructor.
18690 (isra_call_summary::dump): Dump the new field.
18691 (ipa_sra_call_summaries::duplicate): Copy it.
18692 (process_scan_results): Set it.
18693 (isra_write_edge_summary): Stream it.
18694 (isra_read_edge_summary): Likewise.
18695 (param_splitting_across_edge): Only override
18696 safe_to_import_accesses if m_before_any_store is set.
18698 2021-07-08 Martin Sebor <msebor@redhat.com>
18700 PR bootstrap/101374
18701 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref):
18702 Use Object Size Type 0 instead of 1.
18704 2021-07-08 Richard Sandiford <richard.sandiford@arm.com>
18706 * tree-vect-loop.c (vectorizable_reduction): Remove always-true
18709 2021-07-08 Richard Sandiford <richard.sandiford@arm.com>
18711 * match.pd: Simplify an extend-operate-truncate sequence involving
18714 2021-07-08 Roger Sayle <roger@nextmovesoftware.com>
18715 Richard Biener <rguenther@suse.de>
18717 PR tree-optimization/40210
18718 * match.pd (bswap optimizations): Simplify (bswap(x)>>C1)&C2 as
18719 (x>>C3)&C2 when possible. Simplify bswap(x)>>C1 as ((T)x)>>C2
18720 when possible. Simplify bswap(x)&C1 as (x>>C2)&C1 when 0<=C1<=255.
18722 2021-07-08 Uroš Bizjak <ubizjak@gmail.com>
18725 * config/i386/i386-expand.c (ix86_expand_sse_unpack):
18727 * config/i386/mmx.md (V_32): New mode iterator.
18728 (mov<V_32:mode>): Use V_32 mode iterator.
18729 (*mov<V_32:mode>_internal): Ditto.
18730 (*push<V_32:mode>2_rex64): Ditto.
18731 (*push<V_32:mode>2): Ditto.
18732 (movmisalign<V_32:mode>): Ditto.
18733 (mmx_<any_shiftrt:insn>v1si3): New insn pattern.
18734 (sse4_1_<any_extend:code>v2qiv2hi2): Ditto.
18735 (vec_unpacks_lo_v4qi): New expander.
18736 (vec_unpacks_hi_v4qi): Ditto.
18737 (vec_unpacku_lo_v4qi): Ditto.
18738 (vec_unpacku_hi_v4qi): Ditto.
18739 * config/i386/i386.h (VALID_SSE2_REG_MODE): Add V1SImode.
18740 (VALID_INT_MODE_P): Ditto.
18742 2021-07-08 Michael Meissner <meissner@linux.ibm.com>
18745 * config/rs6000/rs6000.md (udivti3): New insn.
18746 (divti3): New insn.
18747 (umodti3): New insn.
18748 (modti3): New insn.
18750 2021-07-07 Martin Sebor <msebor@redhat.com>
18752 PR tree-optimization/100137
18753 PR tree-optimization/99121
18754 PR tree-optimization/97027
18755 * builtins.c (access_ref::access_ref): Also set offmax.
18756 (access_ref::offset_in_range): Define new function.
18757 (access_ref::add_offset): Set offmax.
18758 (access_ref::inform_access): Handle access_none.
18759 (handle_mem_ref): Clear ostype.
18760 (compute_objsize_r): Handle ASSERT_EXPR.
18761 * builtins.h (struct access_ref): Add offmax member.
18762 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Use
18763 compute_objsize() and simplify.
18765 2021-07-07 Peter Bergner <bergner@linux.ibm.com>
18767 * config/rs6000/rs6000-call.c (mma_init_builtins): Use VSX_BUILTIN_LXVP
18768 and VSX_BUILTIN_STXVP.
18770 2021-07-07 Martin Sebor <msebor@redhat.com>
18773 * config/aarch64/aarch64.c (aarch64_simd_lane_bounds): Remove
18774 a stray %K from error_at() missed in r12-2088.
18776 2021-07-07 Richard Biener <rguenther@suse.de>
18778 PR tree-optimization/99728
18779 * tree-ssa-loop-im.c (gather_mem_refs_stmt): Record
18781 (mem_refs_may_alias_p): Add assert we handled aggregate
18783 (sm_seq_valid_bb): Give up when running into aggregate copies.
18784 (ref_indep_loop_p): Handle aggregate copies as never
18785 being invariant themselves but allow other refs to be
18786 disambiguated against them.
18787 (can_sm_ref_p): Do not try to apply store-motion to aggregate
18790 2021-07-06 Indu Bhagat <indu.bhagat@oracle.com>
18793 * dwarf2ctf.c (ctf_get_AT_data_member_location): Multiply by 8 to get
18796 2021-07-06 Martin Sebor <msebor@redhat.com>
18798 * gimple-pretty-print.c (percent_G_format): Remove.
18799 * tree-diagnostic.c (default_tree_printer): Remove calls.
18800 * tree-pretty-print.c (percent_K_format): Remove.
18801 * tree-pretty-print.h (percent_K_format): Remove.
18803 2021-07-06 Martin Sebor <msebor@redhat.com>
18805 * config/aarch64/aarch64-builtins.c (aarch64_simd_expand_builtin):
18806 Remove %K and use error_at.
18807 (aarch64_expand_fcmla_builtin): Same.
18808 (aarch64_expand_builtin_tme): Same.
18809 (aarch64_expand_builtin_memtag): Same.
18810 * config/arm/arm-builtins.c (arm_expand_acle_builtin): Same.
18811 (arm_expand_builtin): Same.
18812 * config/arm/arm.c (bounds_check): Same.
18814 2021-07-06 Martin Sebor <msebor@redhat.com>
18816 * builtins.c (warn_string_no_nul): Remove %G.
18817 (maybe_warn_for_bound): Same.
18818 (warn_for_access): Same.
18819 (check_access): Same.
18820 (check_strncat_sizes): Same.
18821 (expand_builtin_strncat): Same.
18822 (expand_builtin_strncmp): Same.
18823 (expand_builtin): Same.
18824 (expand_builtin_object_size): Same.
18825 (warn_dealloc_offset): Same.
18826 (maybe_emit_free_warning): Same.
18827 * calls.c (maybe_warn_alloc_args_overflow): Same.
18828 (maybe_warn_nonstring_arg): Same.
18829 (maybe_warn_rdwr_sizes): Same.
18830 * expr.c (expand_expr_real_1): Remove %K.
18831 * gimple-fold.c (gimple_fold_builtin_strncpy): Remove %G.
18832 (gimple_fold_builtin_strncat): Same.
18833 * gimple-ssa-sprintf.c (format_directive): Same.
18834 (handle_printf_call): Same.
18835 * gimple-ssa-warn-alloca.c (pass_walloca::execute): Same.
18836 * gimple-ssa-warn-restrict.c (maybe_diag_overlap): Same.
18837 (maybe_diag_access_bounds): Same. Call gimple_location.
18838 (check_bounds_or_overlap): Same.
18839 * trans-mem.c (ipa_tm_scan_irr_block): Remove %K. Simplify.
18840 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Remove %G.
18841 * tree-ssa-strlen.c (maybe_warn_overflow): Same.
18842 (maybe_diag_stxncpy_trunc): Same.
18843 (handle_builtin_stxncpy_strncat): Same.
18844 (maybe_warn_pointless_strcmp): Same.
18845 * tree-ssa-uninit.c (maybe_warn_operand): Same.
18847 2021-07-06 Uroš Bizjak <ubizjak@gmail.com>
18850 * config/i386/predicates.md (vec_setm_operand): Enable
18851 register_operand for TARGET_SSE4_1.
18852 * config/i386/mmx.md (vec_setv2hi): Use vec_setm_operand
18853 as operand 2 predicate. Call ix86_expand_vector_set_var
18854 for non-constant index operand.
18855 (vec_setv4qi): Use vec_setm_mmx_operand as operand 2 predicate.
18856 Call ix86_expand_vector_set_var for non-constant index operand.
18858 2021-07-06 Jeff Law <jeffreyalaw@gmail.com>
18860 * config/h8300/jumpcall.md (*branch): When possible, generate
18861 the comparison in CCZN mode.
18862 * config/h8300/predicates.md (simple_memory_operand): Reject all
18863 auto-increment addressing modes.
18865 2021-07-06 Iain Sandoe <iain@sandoe.co.uk>
18867 PR bootstrap/100246
18868 * config/i386/i386.h (struct stringop_algs): Define a CTOR for
18871 2021-07-06 Richard Biener <rguenther@suse.de>
18873 * doc/md.texi (vec_fmaddsub<mode>4): Document.
18874 (vec_fmsubadd<mode>4): Likewise.
18875 * optabs.def (vec_fmaddsub$a4): Add.
18876 (vec_fmsubadd$a4): Likewise.
18877 * internal-fn.def (IFN_VEC_FMADDSUB): Add.
18878 (IFN_VEC_FMSUBADD): Likewise.
18879 * tree-vect-slp-patterns.c (addsub_pattern::recognize):
18880 Refactor to handle IFN_VEC_FMADDSUB and IFN_VEC_FMSUBADD.
18881 (addsub_pattern::build): Likewise.
18882 * tree-vect-slp.c (vect_optimize_slp): CFN_VEC_FMADDSUB
18883 and CFN_VEC_FMSUBADD are not transparent for permutes.
18884 * config/i386/sse.md (vec_fmaddsub<mode>4): New expander.
18885 (vec_fmsubadd<mode>4): Likewise.
18887 2021-07-06 Richard Biener <rguenther@suse.de>
18889 * doc/invoke.texi (fmove-loop-stores): Document.
18890 * common.opt (fmove-loop-stores): New option.
18891 * opts.c (default_options_table): Enable -fmove-loop-stores
18892 at -O1 but not -Og.
18893 * tree-ssa-loop-im.c (pass_lim::execute): Pass
18894 flag_move_loop_stores instead of true to
18895 loop_invariant_motion_in_fun.
18897 2021-07-06 Iain Sandoe <iain@sandoe.co.uk>
18899 * doc/install.texi: Document --with-dsymutil.
18901 2021-07-06 Andrew Pinski <apinski@marvell.com>
18903 PR tree-optimization/101256
18904 * dbgcnt.def (phiopt_edge_range): New counter.
18905 * tree-ssa-phiopt.c (replace_phi_edge_with_variable):
18906 Check to make sure the new name is defined in the same
18907 bb as the conditional before duplicating range info.
18908 Also add debug counter.
18910 2021-07-06 Kewen Lin <linkw@linux.ibm.com>
18912 PR rtl-optimization/100328
18913 * config/i386/i386-options.c (ix86_option_override_internal):
18914 Set param_ira_consider_dup_in_all_alts to 0.
18916 2021-07-06 Kewen Lin <linkw@linux.ibm.com>
18918 PR rtl-optimization/100328
18919 * doc/invoke.texi (ira-consider-dup-in-all-alts): Document new
18921 * ira.c (ira_get_dup_out_num): Adjust as parameter
18922 param_ira_consider_dup_in_all_alts.
18923 * params.opt (ira-consider-dup-in-all-alts): New.
18924 * ira-conflicts.c (process_regs_for_copy): Add one parameter
18925 single_input_op_has_cstr_p.
18926 (get_freq_for_shuffle_copy): New function.
18927 (add_insn_allocno_copies): Adjust as single_input_op_has_cstr_p.
18928 * ira-int.h (ira_get_dup_out_num): Add one bool parameter.
18930 2021-07-05 Jeff Law <jeffreyalaw@gmail.com>
18932 * config/h8300/shiftrotate.md (shift-by-variable patterns): Update to
18933 generate condition code aware RTL directly.
18935 2021-07-05 Andrew Pinski <apinski@marvell.com>
18937 PR tree-optimization/101039
18938 * match.pd (A CMP 0 ? A : -A): New patterns.
18939 * tree-ssa-phiopt.c (abs_replacement): Delete function.
18940 (tree_ssa_phiopt_worker): Don't call abs_replacement.
18941 Update comment about abs_replacement.
18943 2021-07-05 Andrew Pinski <apinski@marvell.com>
18945 * tree-ssa-phiopt.c (gimple_simplify_phiopt):
18946 If "A ? B : C" fails to simplify, try "(!A) ? C : B".
18948 2021-07-05 Andrew Pinski <apinski@marvell.com>
18950 * tree-ssa-phiopt.c (match_simplify_replacement):
18951 Add early_p argument. Call gimple_simplify_phiopt
18952 instead of gimple_simplify.
18953 (tree_ssa_phiopt_worker): Update call to
18954 match_simplify_replacement and allow unconditionally.
18955 (phiopt_early_allow): New function.
18956 (gimple_simplify_phiopt): New function.
18958 2021-07-05 Andrew Pinski <apinski@marvell.com>
18960 PR middle-end/101237
18961 * fold-const.c (negate_expr_p): Remove call to element_mode
18962 and TREE_MODE/TREE_TYPE when calling HONOR_SIGNED_ZEROS,
18963 HONOR_SIGN_DEPENDENT_ROUNDING, and HONOR_SNANS.
18964 (fold_negate_expr_1): Likewise.
18965 (const_unop): Likewise.
18966 (fold_cond_expr_with_comparison): Likewise.
18967 (fold_binary_loc): Likewise.
18968 (fold_ternary_loc): Likewise.
18969 (tree_call_nonnegative_warnv_p): Likewise.
18970 * match.pd (-(A + B) -> (-B) - A): Likewise.
18972 2021-07-05 Iain Sandoe <iain@sandoe.co.uk>
18974 * configure.ac: Handle --with-dsymutil in the same way as we
18975 do for the assembler and linker. (DEFAULT_DSYMUTIL): New.
18976 Extract the type and version for the dsymutil configured or
18977 found by the default searches.
18978 * config.in: Regenerated.
18979 * configure: Regenerated.
18980 * collect2.c (do_dsymutil): Handle locating dsymutil in the
18981 same way as for the assembler and linker.
18982 * config/darwin.h (DSYMUTIL): Delete.
18983 * gcc.c: Report a configured dsymutil correctly.
18984 * exec-tool.in: Allow for dsymutil.
18986 2021-07-05 Uroš Bizjak <ubizjak@gmail.com>
18988 * config/i386/i386-expand.c (ix86_split_mmx_punpck):
18989 Handle V4QI and V2HI modes.
18990 (expand_vec_perm_blend): Allow 4-byte vector modes with TARGET_SSE4_1.
18991 Handle V4QI mode. Emit mmx_pblendvb32 for 4-byte modes.
18992 (expand_vec_perm_pshufb): Rewrite to use switch statemets.
18993 Handle 4-byte dual operands with TARGET_XOP and single operands
18994 with TARGET_SSSE3. Emit mmx_ppermv32 for TARGET_XOP and
18995 mmx_pshufbv4qi3 for TARGET_SSSE3.
18996 (expand_vec_perm_pblendv): Allow 4-byte vector modes with TARGET_SSE4_1.
18997 (expand_vec_perm_interleave2): Allow 4-byte vector modes.
18998 (expand_vec_perm_pshufb2): Allow 4-byte vector modes with TARGET_SSSE3.
18999 (expand_vec_perm_even_odd_1): Handle V4QI mode.
19000 (expand_vec_perm_broadcast_1): Handle V4QI mode.
19001 (ix86_vectorize_vec_perm_const): Handle V4QI mode.
19002 * config/i386/mmx.md (mmx_ppermv32): New insn pattern.
19003 (mmx_pshufbv4qi3): Ditto.
19004 (*mmx_pblendw32): Ditto.
19005 (*mmx_pblendw64): Rename from *mmx_pblendw.
19006 (mmx_punpckhbw_low): New insn_and_split pattern.
19007 (mmx_punpcklbw_low): Ditto.
19009 2021-07-05 Richard Biener <rguenther@suse.de>
19011 * tree-vect-loop-manip.c (vect_loop_versioning): Do not
19012 set LOOP_C_INFINITE on the vectorized loop.
19014 2021-07-05 Richard Biener <rguenther@suse.de>
19016 PR middle-end/101291
19017 * cfgloopmanip.c (loop_version): Set the loop copy of the
19018 versioned loop to the new loop.
19020 2021-07-04 Iain Sandoe <iain@sandoe.co.uk>
19023 * config.gcc: Ensure that Darwin biarch definitions are
19024 added before i386.h.
19025 * config/i386/darwin.h (TARGET_64BIT): Remove.
19026 (PR80556_WORKAROUND): New.
19027 (REAL_LIBGCC_SPEC): Amend to use PR80556_WORKAROUND.
19028 (DARWIN_SUBARCH_SPEC): New.
19029 * config/i386/darwin32-biarch.h (TARGET_64BIT_DEFAULT,
19030 TARGET_BI_ARCH, PR80556_WORKAROUND): New.
19031 (REAL_LIBGCC_SPEC): Remove.
19032 * config/i386/darwin64-biarch.h (TARGET_64BIT_DEFAULT,
19033 TARGET_BI_ARCH, PR80556_WORKAROUND): New.
19034 (REAL_LIBGCC_SPEC): Remove.
19036 2021-07-03 H.J. Lu <hjl.tools@gmail.com>
19038 PR middle-end/101294
19039 * expr.c (store_constructor): Don't use vec_duplicate on vector.
19041 2021-07-02 Martin Sebor <msebor@redhat.com>
19043 PR middle-end/98871
19044 PR middle-end/98512
19045 * diagnostic.c (get_any_inlining_info): New.
19046 (update_effective_level_from_pragmas): Handle inlining context.
19047 (diagnostic_enabled): Same.
19048 (diagnostic_report_diagnostic): Same.
19049 * diagnostic.h (struct diagnostic_info): Add ctor.
19050 (struct diagnostic_context): Add new member.
19051 * tree-diagnostic.c (set_inlining_locations): New.
19052 (tree_diagnostics_defaults): Set new callback pointer.
19054 2021-07-02 Peter Bergner <bergner@linux.ibm.com>
19056 * config/rs6000/rs6000-builtin.def (BU_MMA_PAIR_LD, BU_MMA_PAIR_ST):
19058 (__builtin_vsx_lxvp, __builtin_vsx_stxvp): New built-ins.
19059 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Expand
19060 lxvp and stxvp built-ins.
19061 (mma_init_builtins): Handle lxvp and stxvp built-ins.
19062 (builtin_function_type): Likewise.
19063 * doc/extend.texi (__builtin_vsx_lxvp, __builtin_mma_stxvp): Document.
19065 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19067 * config/h8300/h8300-protos.h (compute_a_shift_cc): Accept
19068 additional argument for the code.
19069 * config/h8300/h8300.c (compute_a_shift_cc): Accept additional
19070 argument for the code. Just return if the ZN bits are useful or
19071 not rather than the old style CC_* enums.
19072 * config/h8300/shiftrotate.md (shiftqi_noscratch): Move before
19073 more generic shiftqi patterns.
19074 (shifthi_noscratch, shiftsi_noscratch): Similarly.
19075 (shiftqi_noscratch_set_flags): New pattern.
19076 (shifthi_noscratch_set_flags, shiftsi_noscratch_set_flags): Likewise.
19078 2021-07-02 Andrew MacLeod <amacleod@redhat.com>
19080 PR tree-optimization/101223
19081 * range-op.cc (build_lt): Add -1 for signed values.
19082 (built_gt): Subtract -1 for signed values.
19084 2021-07-02 David Faust <david.faust@oracle.com>
19086 * btfout.c (get_btf_kind): Support BTF_KIND_FLOAT.
19087 (btf_asm_type): Likewise.
19089 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19091 * config/h8300/h8300-protos.h (output_a_shift): Make first argument
19092 an array of rtx rather than a pointer to rtx. Add code argument.
19093 (compute_a_shift_length): Similarly.
19094 * config/h8300/h8300.c (h8300_shift_costs): Adjust now that the
19095 shift itself isn't an operand. Create dummy operand[0] to carry
19096 a mode and pass a suitable rtx code to compute_a_shift_length.
19097 (get_shift_alg): Adjust operand number of clobber in output templates.
19098 (output_a_shift): Make first argument an array of rtx rather than
19099 a pointer to rtx. Add code argument for the type of shift.
19100 Adjust now that the shift itself is no longer an operand.
19101 (compute_a_shift_length): Similarly.
19102 * config/h8300/shiftrotate.md (shiftqi, shifthi, shiftsi): Use an
19103 iterator rather than nshift_operator.
19104 (shiftqi_noscratch, shifthi_noscratch, shiftsi_noscratch): Likewise.
19105 (shiftqi_clobber_flags): Adjust to API changes in output_a_shift
19106 and compute_a_shift_length.
19107 (shiftqi_noscratch_clobber_flags): Likewise.
19108 (shifthi_noscratch_clobber_flags): Likewise.
19109 (shiftsi_noscratch_clobber_flags): Likewise.
19111 2021-07-02 Iain Sandoe <iain@sandoe.co.uk>
19114 * config/darwin.h (DSYMUTIL_SPEC): Do not try to run
19115 dsymutil for BTF/CTF.
19117 2021-07-02 Iain Sandoe <iain@sandoe.co.uk>
19120 * config/darwin.h (CTF_INFO_SECTION_NAME): Update the
19121 segment to include BTF.
19122 (BTF_INFO_SECTION_NAME): New.
19124 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19126 * config/m32r/m32r-protos.h (call_operand): Adjust return type.
19127 (small_data_operand, memreg_operand, small_insn_p): Likewise.
19128 * config/m32r/m32r.c (call_operand): Adjust return type.
19129 (small_data_operand, memreg_operand): Likewise.
19131 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19133 * config/frv/frv-protos.h (integer_register_operand): Adjust return
19135 (frv_load_operand, gpr_or_fpr_operand, gpr_no_subreg_operand): Likewise.
19136 (fpr_or_int6_operand, gpr_or_int_operand); Likewise.
19137 (gpr_or_int12_operand, gpr_or_int10_operand); Likewise.
19138 (move_source_operand, move_destination_operand): Likewise.
19139 (condexec_source_operand, condexec_dest_operand): Likewise.
19140 (lr_operand, gpr_or_memory_operand, fpr_or_memory_operand): Likewise.
19141 (reg_or_0_operand, fcc_operand, icc_operand, cc_operand): Likewise.
19142 (fcr_operand, icr_operand, cr_operand, call_operand): Likewise.
19143 (fpr_operand, even_reg_operand, odd_reg_operand): Likewise.
19144 (even_gpr_operand, odd_gpr_operand, quad_fpr_operand): Likewise.
19145 (even_fpr_operand, odd_fpr_operand): Likewise.
19146 (dbl_memory_one_insn_operand, dbl_memory_two_insn_operand): Likewise.
19147 (int12_operand, int6_operand, int5_operand, uint5_operand): Likewise.
19148 (uint4_operand, uint1_operand, int_2word_operand): Likewise
19149 (upper_int16_operand, uint16_operand, symbolic_operand): Likewise.
19150 (relational_operator, float_relational_operator): Likewise.
19151 (ccr_eqne_operator, minmax_operator): Likewise.
19152 (condexec_si_binary_operator, condexec_si_media_operator): Likewise.
19153 (condexec_si_divide_operator, condexec_si_unary_operator): Likewise.
19154 (condexec_sf_conv_operator, condexec_sf_add_operator): Likewise.
19155 (intop_compare_operator, acc_operand, even_acc_operand): Likewise.
19156 (quad_acc_operand, accg_operand): Likewise.
19158 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
19160 * config/stormy16/stormy16-protos.h (xstormy16_below_100_symbol): Change
19161 return type to a bool.
19162 (nonimmediate_nonstack_operand): Likewise.
19163 (xstormy16_splittable_below100_operand): Likewise.
19164 * config/stormy16/stormy16.c (xstormy16_below_100_symbol): Fix
19166 (xstormy16_splittable_below100_operand): Likewise.
19168 2021-07-02 Richard Biener <rguenther@suse.de>
19170 PR tree-optimization/101293
19171 * tree-ssa-loop-im.c (mem_ref_hasher::equal): Compare MEM_REF bases
19172 with combined offsets.
19173 (gather_mem_refs_stmt): Hash MEM_REFs as if their offset were
19174 combined with the rest of the offset.
19176 2021-07-02 Eric Botcazou <ebotcazou@adacore.com>
19178 * config/i386/i386.c (asm_preferred_eh_data_format): Always use the
19179 PIC encodings for PE-COFF targets.
19181 2021-07-02 Jakub Jelinek <jakub@redhat.com>
19184 * config/i386/i386-expand.c (ix86_broadcast_from_integer_constant):
19185 Return nullptr for TImode inner mode.
19187 2021-07-02 Richard Biener <rguenther@suse.de>
19189 PR tree-optimization/101280
19190 PR tree-optimization/101173
19191 * gimple-loop-interchange.cc
19192 (tree_loop_interchange::valid_data_dependences): Properly
19193 guard all dependence checks with DDR_REVERSED_P or its
19196 2021-07-02 Hongyu Wang <hongyu.wang@intel.com>
19198 * config/i386/i386-expand.c (ix86_expand_builtin):
19199 Add branch to clear odata when ZF is set for asedecenc_expand
19200 and wideaesdecenc_expand.
19202 2021-07-02 Eugene Rozenfeld <erozen@microsoft.com>
19204 * config/i386/gcc-auto-profile: regenerate
19206 2021-07-02 liuhongt <hongtao.liu@intel.com>
19208 * config/i386/sse.md (trunc<mode><pmov_dst_4>2): Refined to ..
19209 (trunc<mode><pmov_dst_4_lower>2): this.
19211 2021-07-01 David Malcolm <dmalcolm@redhat.com>
19213 * diagnostic.h (diagnostic_context::m_file_cache): New field.
19214 * input.c (class fcache): Rename to...
19215 (class file_cache_slot): ...this, making most members private and
19216 prefixing fields with "m_".
19217 (file_cache_slot::get_file_path): New accessor.
19218 (file_cache_slot::get_use_count): New accessor.
19219 (file_cache_slot::missing_trailing_newline_p): New accessor.
19220 (file_cache_slot::inc_use_count): New.
19221 (fcache_buffer_size): Move to...
19222 (file_cache_slot::buffer_size): ...here.
19223 (fcache_line_record_size): Move to...
19224 (file_cache_slot::line_record_size): ...here.
19225 (fcache_tab): Delete, in favor of global_dc->m_file_cache.
19226 (fcache_tab_size): Move to file_cache::num_file_slots.
19227 (diagnostic_file_cache_init): Update for move of fcache_tab
19228 to global_dc->m_file_cache.
19229 (diagnostic_file_cache_fini): Likewise.
19230 (lookup_file_in_cache_tab): Convert to...
19231 (file_cache::lookup_file): ...this.
19232 (diagnostics_file_cache_forcibly_evict_file): Update for move of
19233 fcache_tab to global_dc->m_file_cache, moving most of
19234 implementation to...
19235 (file_cache::forcibly_evict_file): ...this new function and...
19236 (file_cache_slot::evict): ...this new function.
19237 (evicted_cache_tab_entry): Convert to...
19238 (file_cache::evicted_cache_tab_entry): ...this.
19239 (add_file_to_cache_tab): Convert to...
19240 (file_cache::add_file): ...this, moving bulk of implementation
19242 (file_cache_slot::create): ..this new function.
19243 (file_cache::file_cache): New.
19244 (file_cache::~file_cache): New.
19245 (lookup_or_add_file_to_cache_tab): Convert to...
19246 (file_cache::lookup_or_add_file): ..this new function.
19247 (fcache::fcache): Rename to...
19248 (file_cache_slot::file_cache_slot): ...this, adding "m_" prefixes
19250 (fcache::~fcache): Rename to...
19251 (file_cache_slot::~file_cache_slot): ...this, adding "m_" prefixes
19253 (needs_read): Convert to...
19254 (file_cache_slot::needs_read_p): ...this.
19255 (needs_grow): Convert to...
19256 (file_cache_slot::needs_grow_p): ...this.
19257 (maybe_grow): Convert to...
19258 (file_cache_slot::maybe_grow): ...this.
19259 (read_data): Convert to...
19260 (file_cache_slot::read_data): ...this.
19261 (maybe_read_data): Convert to...
19262 (file_cache_slot::maybe_read_data): ...this.
19263 (get_next_line): Convert to...
19264 (file_cache_slot::get_next_line): ...this.
19265 (goto_next_line): Convert to...
19266 (file_cache_slot::goto_next_line): ...this.
19267 (read_line_num): Convert to...
19268 (file_cache_slot::read_line_num): ...this.
19269 (location_get_source_line): Update for moving of globals to
19270 global_dc->m_file_cache.
19271 (location_missing_trailing_newline): Likewise.
19272 * input.h (class file_cache_slot): New forward decl.
19273 (class file_cache): New.
19275 2021-07-01 Michael Meissner <meissner@linux.ibm.com>
19277 * config/rs6000/rs6000.c (rs6000_maybe_emit_fp_cmove): Add IEEE
19278 128-bit floating point conditional move support.
19279 (have_compare_and_set_mask): Add IEEE 128-bit floating point
19281 * config/rs6000/rs6000.md (mov<mode>cc, IEEE128 iterator): New insn.
19282 (mov<mode>cc_p10, IEEE128 iterator): New insn.
19283 (mov<mode>cc_invert_p10, IEEE128 iterator): New insn.
19284 (fpmask<mode>, IEEE128 iterator): New insn.
19285 (xxsel<mode>, IEEE128 iterator): New insn.
19287 2021-07-01 Iain Sandoe <iain@sandoe.co.uk>
19290 * config/darwin.h (CTF_INFO_SECTION_NAME): New.
19292 2021-07-01 H.J. Lu <hjl.tools@gmail.com>
19294 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
19296 * config/i386/i386-protos.h (ix86_expand_vector_init_duplicate):
19298 * config/i386/sse.md (INT_BROADCAST_MODE): New mode iterator.
19299 (vec_duplicate<mode>): New expander.
19301 2021-07-01 H.J. Lu <hjl.tools@gmail.com>
19304 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
19306 (ix86_byte_broadcast): New function.
19307 (ix86_convert_const_wide_int_to_broadcast): Likewise.
19308 (ix86_expand_move): Convert CONST_WIDE_INT to broadcast if mode
19309 size is 16 bytes or bigger.
19310 (ix86_broadcast_from_integer_constant): New function.
19311 (ix86_expand_vector_move): Convert CONST_WIDE_INT and CONST_VECTOR
19312 to broadcast if mode size is 16 bytes or bigger.
19313 * config/i386/i386-protos.h (ix86_gen_scratch_sse_rtx): New
19315 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): New function.
19317 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
19319 * config/i386/predicates.md (ix86_endbr_immediate_operand):
19320 Return true/false instead of 1/0.
19321 (movq_parallel): Ditto.
19323 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
19325 * recog.c (general_operand): Return true/false instead of 1/0.
19326 (register_operand): Ditto.
19327 (immediate_operand): Ditto.
19328 (const_int_operand): Ditto.
19329 (const_scalar_int_operand): Ditto.
19330 (const_double_operand): Ditto.
19331 (push_operand): Ditto.
19332 (pop_operand): Ditto.
19333 (memory_operand): Ditto.
19334 (indirect_operand): Ditto.
19336 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
19338 * genpreds.c (write_predicate_subfunction):
19339 Change the type of written subfunction to bool.
19340 (write_one_predicate_function):
19341 Change the type of written function to bool.
19342 (write_tm_preds_h): Ditto.
19343 * recog.h (*insn_operand_predicate_fn): Change the type to bool.
19344 * recog.c (general_operand): Change the type to bool.
19345 (address_operand): Ditto.
19346 (register_operand): Ditto.
19347 (pmode_register_operand): Ditto.
19348 (scratch_operand): Ditto.
19349 (immediate_operand): Ditto.
19350 (const_int_operand): Ditto.
19351 (const_scalar_int_operand): Ditto.
19352 (const_double_operand): Ditto.
19353 (nonimmediate_operand): Ditto.
19354 (nonmemory_operand): Ditto.
19355 (push_operand): Ditto.
19356 (pop_operand): Ditto.
19357 (memory_operand): Ditto.
19358 (indirect_operand): Ditto.
19359 (ordered_comparison_operator): Ditto.
19360 (comparison_operator): Ditto.
19361 * config/i386/i386-expand.c (ix86_expand_sse_cmp):
19362 Change the type of indirect predicate function to bool.
19363 * config/rs6000/rs6000.c (easy_vector_constant):
19364 Change the type to bool.
19365 * config/mips/mips-protos.h (m16_based_address_p):
19366 Change the type of operand 3 to bool.
19368 2021-07-01 Richard Biener <rguenther@suse.de>
19370 PR tree-optimization/101280
19371 PR tree-optimization/101173
19372 * gimple-loop-interchange.cc
19373 (tree_loop_interchange::valid_data_dependences): Revert
19374 previous change and instead correctly handle DDR_REVERSED_P
19377 2021-07-01 Richard Biener <rguenther@suse.de>
19379 PR tree-optimization/101278
19380 * tree-ssa-dse.c (dse_classify_store): First check for
19381 uses, then ignore stmt for chaining purposes.
19383 2021-07-01 Richard Biener <rguenther@suse.de>
19385 PR tree-optimization/100778
19386 * tree-vect-slp.c (vect_schedule_slp_node): Do not place trapping
19387 vectorized ops ahead of their scalar BB.
19389 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
19392 * config/i386/i386.md (*nabs<dwi>2_doubleword):
19393 New insn_and_split pattern.
19394 (*nabs<dwi>2_1): Ditto.
19395 * config/i386/i386-features.c
19396 (general_scalar_chain::compute_convert_gain):
19397 Handle (NEG (ABS (...))) RTX. Rewrite src code
19398 scanner as switch statement.
19399 (general_scalar_chain::convert_insn):
19400 Handle (NEG (ABS (...))) RTX.
19401 (general_scalar_to_vector_candidate_p):
19402 Detect (NEG (ABS (...))) RTX. Reorder case statements
19403 for (AND (NOT (...) ...)) fallthrough.
19405 2021-07-01 Richard Biener <rguenther@suse.de>
19407 PR tree-optimization/101178
19408 * tree-vect-slp.c (slpg_vertex::materialize): Remove.
19409 (slpg::perm_in): Add.
19410 (slpg::get_perm_in): Remove.
19411 (slpg::get_perm_materialized): Add.
19412 (vect_optimize_slp): Handle VEC_PERM nodes more optimally
19413 during permute propagation and materialization.
19415 2021-07-01 Jakub Jelinek <jakub@redhat.com>
19418 * dwarf2out.c (loc_list_from_tree_1): Handle COMPOUND_LITERAL_EXPR.
19420 2021-07-01 Jakub Jelinek <jakub@redhat.com>
19422 PR middle-end/94366
19423 * omp-low.c (lower_rec_input_clauses): Rename is_fp_and_or to
19424 is_truth_op, set it for TRUTH_*IF_EXPR regardless of new_var's type,
19425 use boolean_type_node instead of integer_type_node as NE_EXPR type.
19426 (lower_reduction_clauses): Likewise.
19428 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
19430 * config/gcn/gcn.c: Include dwarf2.h.
19431 (gcn_addr_space_debug): New function.
19432 (TARGET_ADDR_SPACE_DEBUG): New hook.
19434 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
19436 * common/config/gcn/gcn-common.c
19437 (gcn_option_optimization_table): Change OPT_fomit_frame_pointer to -O3.
19438 * config/gcn/gcn.c (gcn_expand_prologue): Prefer the frame pointer
19440 (gcn_expand_prologue): Prefer the frame pointer when emitting CFI.
19441 (gcn_frame_pointer_rqd): New function.
19442 (TARGET_FRAME_POINTER_REQUIRED): New hook.
19444 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
19446 * config/gcn/gcn.c (move_callee_saved_registers): Emit CFI notes for
19447 prologue register saves.
19448 (gcn_debug_unwind_info): Use UI_DWARF2.
19449 (gcn_dwarf_register_number): Map DWARF_LINK_REGISTER to DWARF PC.
19450 (gcn_dwarf_register_span): DWARF_LINK_REGISTER doesn't span.
19451 * config/gcn/gcn.h: (DWARF_FRAME_RETURN_COLUMN): New define.
19452 (DWARF_LINK_REGISTER): New define.
19453 (FIRST_PSEUDO_REGISTER): Increment.
19454 (FIXED_REGISTERS): Add entry for DWARF_LINK_REGISTER.
19455 (CALL_USED_REGISTERS): Likewise.
19456 (REGISTER_NAMES): Likewise.
19458 2021-06-30 Richard Biener <rguenther@suse.de>
19460 PR tree-optimization/101267
19461 * tree-vect-stmts.c (vect_check_scalar_mask): Adjust
19462 API and use SLP compatible interface of vect_is_simple_use.
19463 Reject not vectorized SLP defs for callers that do not support
19465 (vect_check_store_rhs): Handle masked stores and pass down
19466 the appropriate operator index.
19467 (vectorizable_call): Adjust.
19468 (vectorizable_store): Likewise.
19469 (vectorizable_load): Likewise. Handle SLP pecularity of
19471 (vect_is_simple_use): Remove special-casing of masked stores.
19473 2021-06-30 Tobias Burnus <tobias@codesourcery.com>
19475 * common.opt (foffload): Remove help as Driver only.
19476 * gcc.c (display_help): Add -foffload.
19478 2021-06-30 Tobias Burnus <tobias@codesourcery.com>
19480 * gcc.c (close_at_file, execute): Replace alloca by XALLOCAVEC.
19481 (check_offload_target_name): Fix splitting OFFLOAD_TARGETS into
19482 a candidate list; better inform no offload target is configured
19483 and fix hint extraction when passed target is not '\0' at [len].
19484 * common.opt (foffload): Add tailing '.'.
19485 (foffload-options): Likewise; fix flag name in the help string.
19487 2021-06-30 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
19490 * config/arm/arm_neon.h: Move vabs intrinsics before vcage_f32.
19491 (vcage_f32): Gate comparison on __FAST_MATH__.
19492 (vcageq_f32): Likewise.
19493 (vcale_f32): Likewise.
19494 (vcaleq_f32): Likewise.
19495 (vcagt_f32): Likewise.
19496 (vcagtq_f32): Likewise.
19497 (vcalt_f32): Likewise.
19498 (vcaltq_f32): Likewise.
19499 (vcage_f16): Likewise.
19500 (vcageq_f16): Likewise.
19501 (vcale_f16): Likewise.
19502 (vcaleq_f16): Likewise.
19503 (vcagt_f16): Likewise.
19504 (vcagtq_f16): Likewise.
19505 (vcalt_f16): Likewise.
19506 (vcaltq_f16): Likewise.
19508 2021-06-30 Richard Biener <rguenther@suse.de>
19510 PR tree-optimization/101264
19511 * tree-vect-slp.c (vect_optimize_slp): Propagate the
19512 computed perm_in to all "any" permute successors
19513 we cannot de-duplicate immediately.
19515 2021-06-30 liuhongt <hongtao.liu@intel.com>
19518 * config/i386/sse.md
19519 (avx512f_sfixupimm<mode><sd_maskz_name><round_saeonly_name>):
19521 (avx512f_sfixupimm<mode><maskz_scalar_name><round_saeonly_name>):
19523 (avx512f_sfixupimm<mode>_mask<round_saeonly_name>"): Refined.
19524 * config/i386/subst.md (maskz_scalar): New define_subst.
19525 (maskz_scalar_name): New subst_attr.
19526 (maskz_scalar_op5): Ditto.
19527 (round_saeonly_maskz_scalar_op5): Ditto.
19528 (round_saeonly_maskz_scalar_operand5): Ditto.
19530 2021-06-30 David Edelsohn <dje.gcc@gmail.com>
19532 * config/rs6000/rs6000.c (rs6000_xcoff_section_type_flags):
19533 Increase code CSECT alignment to at least 32 bytes.
19534 * config/rs6000/xcoff.h (TEXT_SECTION_ASM_OP): Add 32 byte
19535 alignment designation.
19537 2021-06-29 Sergei Trofimovich <siarheit@google.com>
19539 * doc/generic.texi: Fix s/net yet/not yet/ typo.
19541 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
19543 PR tree-optimization/101254
19544 * range-op.cc (operator_minus::op1_op2_relation_effect): Check for
19545 wrapping/non-wrapping when setting the result range.
19547 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
19549 * value-query.cc (gimple_range_global): Allow phis.
19551 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
19553 * vr-values.c (vr_values::vrp_stmt_computes_nonzero): Use stmt.
19554 (simplify_using_ranges::op_with_boolean_value_range_p): Add a
19555 statement for location context.
19556 (check_for_binary_op_overflow): Ditto.
19557 (simplify_using_ranges::get_vr_for_comparison): Ditto.
19558 (simplify_using_ranges::compare_name_with_value): Ditto.
19559 (simplify_using_ranges::compare_names): Ditto.
19560 (vrp_evaluate_conditional_warnv_with_ops_using_ranges): Ditto.
19561 (simplify_using_ranges::simplify_truth_ops_using_ranges): Ditto.
19562 (simplify_using_ranges::simplify_min_or_max_using_ranges): Ditto.
19563 (simplify_using_ranges::simplify_internal_call_using_ranges): Ditto.
19564 (simplify_using_ranges::two_valued_val_range_p): Ditto.
19565 (simplify_using_ranges::simplify): Ditto.
19566 * vr-values.h: Adjust prototypes.
19568 2021-06-29 Uroš Bizjak <ubizjak@gmail.com>
19571 * config/i386/mmx.md (vec_addsubv2sf3): New insn pattern.
19573 2021-06-29 Julian Brown <julian@codesourcery.com>
19575 * config/gcn/gcn.c (gcn_init_libfuncs): New function.
19576 (TARGET_INIT_LIBFUNCS): Define target hook using above function.
19577 * config/gcn/gcn.h (UNITS_PER_WORD): Define to 8 for IN_LIBGCC2, 4
19579 (LIBGCC2_UNITS_PER_WORD, BITS_PER_WORD): Remove definitions.
19580 (MAX_FIXED_MODE_SIZE): Change to 128.
19582 2021-06-29 Julian Brown <julian@codesourcery.com>
19584 * config/gcn/gcn.md (UNSPEC_FLBIT_INT): New unspec constant.
19585 (s_mnemonic): Add clrsb.
19586 (gcn_flbit<mode>_int): Add insn pattern for SImode/DImode.
19587 (clrsb<mode>2): Add expander for SImode/DImode.
19589 2021-06-29 Julian Brown <julian@codesourcery.com>
19591 * config/gcn/gcn.md (<su>mulsidi3, <su>mulsidi3_reg, <su>mulsidi3_imm,
19592 muldi3): Add patterns.
19594 2021-06-29 Julian Brown <julian@codesourcery.com>
19596 * config/gcn/gcn.md (<su>mulsi3_highpart): Change to expander.
19597 (<su>mulsi3_highpart_reg, <su>mulsi3_highpart_imm): New patterns.
19599 2021-06-29 Julian Brown <julian@codesourcery.com>
19601 * config/gcn/gcn.md (mulsi3): Make s_mulk_i32 variant clobber SCC.
19603 2021-06-29 Joseph Myers <joseph@codesourcery.com>
19605 * btfout.c, ctfout.c: Include "memmodel.h".
19607 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
19609 * gcc.c (check_offload_target_name): Cast len argument to
19610 %q.*s to 'int'; avoid -Wstringop-truncation warning.
19612 2021-06-29 Richard Biener <rguenther@suse.de>
19614 * tree-vect-slp.c (vect_optimize_slp): Forward propagate
19615 to "any" permute nodes and relax "any" permute proapgation
19616 during iterative backward propagation.
19618 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
19621 * common.opt (-foffload=): Update description.
19622 (-foffload-options=): New.
19623 * doc/invoke.texi (C Language Options): Document
19624 -foffload and -foffload-options.
19625 * gcc.c (check_offload_target_name): New, split off from
19626 handle_foffload_option.
19627 (check_foffload_target_names): New.
19628 (handle_foffload_option): Handle -foffload=default.
19629 (driver_handle_option): Update for -foffload-options.
19630 * lto-opts.c (lto_write_options): Use -foffload-options
19631 instead of -foffload.
19632 * lto-wrapper.c (merge_and_complain, append_offload_options):
19634 * opts.c (common_handle_option): Likewise.
19636 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
19638 * doc/invoke.texi (C Language Options): Sort options
19639 alphabetically in optlist and also the description itself.
19640 Remove leftover -fallow-single-precision from and add missing
19641 -fgnu-tm to the optlist.
19643 2021-06-29 Richard Biener <rguenther@suse.de>
19645 * tree-vect-slp.c (slpg_vertex::visited): Remove.
19646 (vect_slp_perms_eq): Handle -1 permutes.
19647 (vect_optimize_slp): Rewrite permute propagation.
19649 2021-06-29 Jakub Jelinek <jakub@redhat.com>
19652 * match.pd ((intptr_t)x eq/ne CST to x eq/ne (typeof x) CST): Don't
19653 perform the optimization in GENERIC when sanitizing and x has a
19656 2021-06-29 Richard Biener <rguenther@suse.de>
19658 PR tree-optimization/101242
19659 * tree-vect-slp.c (vect_slp_build_vertices): Force-add
19660 PHIs with not represented initial values as leafs.
19662 2021-06-29 Jan-Benedict Glaw <jbglaw@getslash.de>
19664 * config/pdp11/pdp11.h (ASM_OUTPUT_SKIP): Fix signedness warning.
19665 * config/pdp11/pdp11.c (pdp11_asm_print_operand_punct_valid_p): Remove
19666 "register" keyword.
19667 (pdp11_initial_elimination_offset) Remove unused variable.
19668 (pdp11_cmp_length) Ditto.
19669 (pdp11_insn_cost): Ditto, and fix signedness warning.
19671 2021-06-29 David Edelsohn <dje.gcc@gmail.com>
19673 * btfout.c: Include tm_p.h.
19676 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
19678 * config/bpf/bpf.c (bpf_expand_prologue): Do not mark insns as
19680 (bpf_expand_epilogue): Likewise.
19681 * config/bpf/bpf.h (DWARF2_FRAME_INFO): Define to 0.
19682 Do not define DBX_DEBUGGING_INFO.
19684 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
19686 * doc/invoke.texi: Document the CTF and BTF debug info options.
19688 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
19689 David Faust <david.faust@oracle.com>
19690 Jose E. Marchesi <jose.marchesi@oracle.com>
19691 Weimin Pan <weimin.pan@oracle.com>
19693 * Makefile.in: Add ctfc.*, ctfout.c and btfout.c files to
19694 GTFILES. Add new object files.
19695 * common.opt: Add CTF and BTF debug info options.
19696 * btfout.c: New file.
19697 * ctfc.c: Likewise.
19698 * ctfc.h: Likewise.
19699 * ctfout.c: Likewise.
19700 * dwarf2ctf.c: Likewise.
19701 * dwarf2ctf.h: Likewise.
19702 * dwarf2cfi.c (dwarf2out_do_frame): Acknowledge CTF_DEBUG and
19704 * dwarf2out.c (dwarf2out_source_line): Likewise.
19705 (dwarf2out_finish): Skip emitting DWARF if CTF or BTF are to
19707 (debug_format_do_cu): New function.
19708 (dwarf2out_early_finish): Traverse DIEs and emit CTF/BTF for
19710 Include dwarf2ctf.c.
19711 * final.c (dwarf2_debug_info_emitted_p): Acknowledge DWARF-based debug
19713 * flag-types.h (enum debug_info_type): Add CTF_DEBUG and BTF_DEBUG.
19714 (CTF_DEBUG): New bitmask.
19715 (BTF_DEBUG): Likewise.
19716 (enum ctf_debug_info_levels): New enum.
19717 * gengtype.c (open_base_files): Handle ctfc.h.
19718 (main): Handle uint32_t type.
19719 * flags.h (btf_debuginfo_p): New definition.
19720 (dwarf_based_debuginfo_p): Likewise.
19721 * opts.c (debug_type_names): Add entries for CTF and BTF.
19722 (btf_debuginfo_p): New function.
19723 (dwarf_based_debuginfo_p): Likewise.
19724 (common_handle_option): Handle -gctfN and -gbtf options.
19725 (set_debug_level): Set CTF_DEBUG, BTF_DEBUG whenever appropriate.
19726 * toplev.c (process_options): Inform the user and ignore -gctfLEVEL if
19729 2021-06-28 Jose E. Marchesi <jose.marchesi@oracle.com>
19731 * dwarf2out.c (AT_class): Function is no longer static.
19732 (AT_int): Likewise.
19733 (AT_unsigned): Likewise.
19734 (AT_loc): Likewise.
19735 (get_AT): Likewise.
19736 (get_AT_string): Likewise.
19737 (get_AT_flag): Likewise.
19738 (get_AT_unsigned): Likewise.
19739 (get_AT_ref): Likewise.
19740 (new_die_raw): Likewise.
19741 (lookup_decl_die): Likewise.
19742 (base_type_die): Likewise.
19743 (add_name_attribute): Likewise.
19744 (add_AT_int): Likewise.
19745 (add_AT_unsigned): Likewise.
19746 (add_AT_loc): Likewise.
19747 (dw_get_die_tag): New function.
19748 (dw_get_die_child): Likewise.
19749 (dw_get_die_sib): Likewise.
19750 (struct dwarf_file_data): Move from here to dwarf2out.h
19751 (struct dw_attr_struct): Likewise.
19752 * dwarf2out.h: Analogous changes.
19754 2021-06-28 Martin Jambor <mjambor@suse.cz>
19757 * ipa-param-manipulation.h (class ipa_param_body_adjustments): New
19758 members m_dead_stmts and m_dead_ssas.
19759 * ipa-param-manipulation.c
19760 (ipa_param_body_adjustments::mark_dead_statements): New function.
19761 (ipa_param_body_adjustments::common_initialization): Call it on
19762 all removed but not split parameters.
19763 (ipa_param_body_adjustments::ipa_param_body_adjustments): Initialize
19765 (ipa_param_body_adjustments::modify_call_stmt): Remove arguments that
19767 * tree-inline.c (remap_gimple_stmt): Do not copy dead statements, reset
19768 dead debug statements.
19769 (copy_phis_for_bb): Do not copy dead PHI nodes.
19771 2021-06-28 Martin Jambor <mjambor@suse.cz>
19774 * symtab-clones.h (clone_info): Removed member param_adjustments.
19775 * ipa-param-manipulation.h: Adjust initial comment to reflect how we
19776 deal with pass-through splits now.
19777 (ipa_param_performed_split): Removed.
19778 (ipa_param_adjustments::modify_call): Adjusted parameters.
19779 (class ipa_param_body_adjustments): Adjusted parameters of
19780 register_replacement, modify_gimple_stmt and modify_call_stmt.
19781 (ipa_verify_edge_has_no_modifications): Declare.
19782 (ipa_edge_modifications_finalize): Declare.
19783 * cgraph.c (cgraph_edge::redirect_call_stmt_to_callee): Remove
19784 performed_splits processing, pas only edge to padjs->modify_call,
19785 check that call arguments were not modified if they should not have
19787 * cgraphclones.c (cgraph_node::create_clone): Do not copy performed
19789 * ipa-param-manipulation.c (struct pass_through_split_map): New type.
19790 (ipa_edge_modification_info): Likewise.
19791 (ipa_edge_modification_sum): Likewise.
19792 (ipa_edge_modifications): New edge summary.
19793 (ipa_verify_edge_has_no_modifications): New function.
19794 (transitive_split_p): Removed.
19795 (transitive_split_map): Likewise.
19796 (init_transitive_splits): Likewise.
19797 (ipa_param_adjustments::modify_call): Adjusted to use the new edge
19798 summary instead of performed_splits.
19799 (ipa_param_body_adjustments::register_replacement): Drop dummy
19800 parameter, set base_index of the created ipa_param_body_replacement.
19801 (phi_arg_will_live_p): New function.
19802 (ipa_param_body_adjustments::common_initialization): Do not create
19803 IPA_SRA dummy decls.
19804 (simple_tree_swap_info): Removed.
19805 (remap_split_decl_to_dummy): Likewise.
19806 (record_argument_state_1): New function.
19807 (record_argument_state): Likewise.
19808 (ipa_param_body_adjustments::modify_call_stmt): New parameter
19809 orig_stmt. Do not work with dummy decls, save necessary info about
19810 changes to ipa_edge_modifications.
19811 (ipa_param_body_adjustments::modify_gimple_stmt): New parameter
19812 orig_stmt, pass it to modify_call_stmt.
19813 (ipa_param_body_adjustments::modify_cfun_body): Adjust call to
19814 modify_gimple_stmt.
19815 (ipa_edge_modifications_finalize): New function.
19816 * tree-inline.c (remap_gimple_stmt): Pass original statement to
19817 modify_gimple_stmt.
19818 (copy_phis_for_bb): Do not copy dead PHI nodes.
19819 (expand_call_inline): Do not remap performed_splits.
19820 (update_clone_info): Likewise.
19821 * toplev.c: Include ipa-param-manipulation.h.
19822 (toplev::finalize): Call ipa_edge_modifications_finalize.
19824 2021-06-28 Andrew Pinski <apinski@marvell.com>
19826 * tree-ssa-phiopt.c (replace_phi_edge_with_variable): Duplicate range
19827 info if we're the only things setting the target PHI.
19828 (value_replacement): Don't duplicate range here.
19829 (minmax_replacement): Likewise.
19831 2021-06-28 Richard Biener <rguenther@suse.de>
19833 PR tree-optimization/101229
19834 * gimple-walk.c (gimple_walk_op): Handle PHIs.
19836 2021-06-28 Martin Liska <mliska@suse.cz>
19838 * config/v850/v850.c (construct_dispose_instruction): Allocate
19840 (construct_prepare_instruction): Likewise.
19842 2021-06-28 Martin Liska <mliska@suse.cz>
19844 * config/v850/v850.c (v850_option_override): Build default
19846 (v850_can_inline_p): New. Allow MASK_PROLOG_FUNCTION to be
19847 ignored for inlining.
19848 (TARGET_CAN_INLINE_P): New.
19850 2021-06-28 Richard Biener <rguenther@suse.de>
19852 PR tree-optimization/101207
19853 * tree-vect-slp.c (vect_optimize_slp): Do BB reduction
19854 permute eliding for load permutations properly.
19856 2021-06-28 Richard Biener <rguenther@suse.de>
19858 PR tree-optimization/101173
19859 * gimple-loop-interchange.cc
19860 (tree_loop_interchange::valid_data_dependences): Disallow outer
19861 loop dependence distance of zero.
19863 2021-06-28 liuhongt <hongtao.liu@intel.com>
19866 * config/i386/sse.md (*avx_cmp<mode>3_lt): New
19867 define_insn_and_split.
19868 (*avx_cmp<mode>3_ltint): Ditto.
19869 (*avx2_pcmp<mode>3_3): Ditto.
19870 (*avx2_pcmp<mode>3_4): Ditto.
19871 (*avx2_pcmp<mode>3_5): Ditto.
19873 2021-06-28 liuhongt <hongtao.liu@intel.com>
19875 * config/i386/i386-builtin.def (IX86_BUILTIN_BLENDVPD256,
19876 IX86_BUILTIN_BLENDVPS256, IX86_BUILTIN_PBLENDVB256,
19877 IX86_BUILTIN_BLENDVPD, IX86_BUILTIN_BLENDVPS,
19878 IX86_BUILTIN_PBLENDVB128): Replace icode with
19880 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold blendv
19882 * config/i386/sse.md (*<sse4_1_avx2>_pblendvb_lt_subreg_not):
19883 New pre_reload splitter.
19885 2021-06-27 Andrew Pinski <apinski@marvell.com>
19887 PR middle-end/101230
19888 * fold-const.c (fold_ternary_loc): Check
19889 the return value of invert_tree_comparison.
19891 2021-06-27 David Edelsohn <dje.gcc@gmail.com>
19893 * config.gcc: Add SPDX License Identifier.
19894 (powerpc-ibm-aix789): Default to aix73.h.
19895 (powerpc-ibm-aix7.2.*.*): New stanza.
19896 * config/rs6000/aix72.h: Add SPDX License Identifier.
19897 * config/rs6000/aix73.h: New file.
19899 2021-06-26 Jason Merrill <jason@redhat.com>
19901 * except.c: #include "dwarf2.h" instead of "dwarf2out.h".
19903 2021-06-26 Andrew Pinski <apinski@marvell.com>
19905 * genmatch.c (lower_cond): Copy for_subst_vec
19906 for the simplify also.
19907 (lower): Swap the order for lower_for and lower_cond.
19909 2021-06-26 Andrew Pinski <apinski@marvell.com>
19911 * tree-ssa-phiopt.c (match_simplify_replacement): Reset
19912 flow senatitive info on the moved ssa set.
19914 2021-06-26 Andrew Pinski <apinski@marvell.com>
19916 * fold-const.c (fold_cond_expr_with_comparison):
19917 Exand arg0 into comp_code, arg00, and arg01.
19918 (fold_ternary_loc): Use invert_tree_comparison
19919 instead of fold_invert_truthvalue for the case
19920 where we have A CMP B ? C : A.
19922 2021-06-25 Martin Sebor <msebor@redhat.com>
19924 PR middle-end/101216
19925 * calls.c (maybe_warn_rdwr_sizes): Use the no_warning constant.
19927 2021-06-25 Jeff Law <jeffreyalaw@gmail.com>
19929 * config/h8300/h8300.c (select_cc_mode): Handle ASHIFTRT and LSHIFTRT.
19931 2021-06-25 Richard Biener <rguenther@suse.de>
19933 PR tree-optimization/101202
19934 * tree-vect-slp.c (vect_optimize_slp): Explicitely handle
19937 2021-06-25 Richard Biener <rguenther@suse.de>
19939 * tree-vect-slp-patterns.c (addsub_pattern::build): Copy
19940 STMT_VINFO_REDUC_DEF from the original representative.
19942 2021-06-25 Martin Sebor <msebor@redhat.com>
19944 * builtins.c (warn_string_no_nul): Replace uses of TREE_NO_WARNING,
19945 gimple_no_warning_p and gimple_set_no_warning with
19946 warning_suppressed_p, and suppress_warning.
19948 (maybe_warn_for_bound): Same.
19949 (warn_for_access): Same.
19950 (check_access): Same.
19951 (expand_builtin_strncmp): Same.
19952 (fold_builtin_varargs): Same.
19953 * calls.c (maybe_warn_nonstring_arg): Same.
19954 (maybe_warn_rdwr_sizes): Same.
19955 * cfgexpand.c (expand_call_stmt): Same.
19956 * cgraphunit.c (check_global_declaration): Same.
19957 * fold-const.c (fold_undefer_overflow_warnings): Same.
19958 (fold_truth_not_expr): Same.
19959 (fold_unary_loc): Same.
19960 (fold_checksum_tree): Same.
19961 * gimple-array-bounds.cc (array_bounds_checker::check_array_ref): Same.
19962 (array_bounds_checker::check_mem_ref): Same.
19963 (array_bounds_checker::check_addr_expr): Same.
19964 (array_bounds_checker::check_array_bounds): Same.
19965 * gimple-expr.c (copy_var_decl): Same.
19966 * gimple-fold.c (gimple_fold_builtin_strcpy): Same.
19967 (gimple_fold_builtin_strncat): Same.
19968 (gimple_fold_builtin_stxcpy_chk): Same.
19969 (gimple_fold_builtin_stpcpy): Same.
19970 (gimple_fold_builtin_sprintf): Same.
19971 (fold_stmt_1): Same.
19972 * gimple-ssa-isolate-paths.c (diag_returned_locals): Same.
19973 * gimple-ssa-nonnull-compare.c (do_warn_nonnull_compare): Same.
19974 * gimple-ssa-sprintf.c (handle_printf_call): Same.
19975 * gimple-ssa-store-merging.c (imm_store_chain_info::output_merged_store): Same.
19976 * gimple-ssa-warn-restrict.c (maybe_diag_overlap): Same.
19977 * gimple-ssa-warn-restrict.h: Adjust declarations.
19978 (maybe_diag_access_bounds): Replace uses of TREE_NO_WARNING,
19979 gimple_no_warning_p and gimple_set_no_warning with
19980 warning_suppressed_p, and suppress_warning.
19981 (check_call): Same.
19982 (check_bounds_or_overlap): Same.
19983 * gimple.c (gimple_build_call_from_tree): Same.
19984 * gimplify.c (gimplify_return_expr): Same.
19985 (gimplify_cond_expr): Same.
19986 (gimplify_modify_expr_complex_part): Same.
19987 (gimplify_modify_expr): Same.
19988 (gimple_push_cleanup): Same.
19989 (gimplify_expr): Same.
19990 * omp-expand.c (expand_omp_for_generic): Same.
19991 (expand_omp_taskloop_for_outer): Same.
19992 * omp-low.c (lower_rec_input_clauses): Same.
19993 (lower_lastprivate_clauses): Same.
19994 (lower_send_clauses): Same.
19995 (lower_omp_target): Same.
19996 * tree-cfg.c (pass_warn_function_return::execute): Same.
19997 * tree-complex.c (create_one_component_var): Same.
19998 * tree-inline.c (remap_gimple_op_r): Same.
19999 (copy_tree_body_r): Same.
20000 (declare_return_variable): Same.
20001 (expand_call_inline): Same.
20002 * tree-nested.c (lookup_field_for_decl): Same.
20003 * tree-sra.c (create_access_replacement): Same.
20004 (generate_subtree_copies): Same.
20005 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Same.
20006 * tree-ssa-forwprop.c (combine_cond_expr_cond): Same.
20007 * tree-ssa-loop-ch.c (ch_base::copy_headers): Same.
20008 * tree-ssa-loop-im.c (execute_sm): Same.
20009 * tree-ssa-phiopt.c (cond_store_replacement): Same.
20010 * tree-ssa-strlen.c (maybe_warn_overflow): Same.
20011 (handle_builtin_strcpy): Same.
20012 (maybe_diag_stxncpy_trunc): Same.
20013 (handle_builtin_stxncpy_strncat): Same.
20014 (handle_builtin_strcat): Same.
20015 * tree-ssa-uninit.c (get_no_uninit_warning): Same.
20016 (set_no_uninit_warning): Same.
20017 (uninit_undefined_value_p): Same.
20018 (warn_uninit): Same.
20019 (maybe_warn_operand): Same.
20020 * tree-vrp.c (compare_values_warnv): Same.
20021 * vr-values.c (vr_values::extract_range_for_var_from_comparison_expr): Same.
20022 (test_for_singularity): Same.
20023 * gimple.h (warning_suppressed_p): New function.
20024 (suppress_warning): Same.
20025 (copy_no_warning): Same.
20026 (gimple_set_block): Call gimple_set_location.
20027 (gimple_set_location): Call copy_warning.
20029 2021-06-25 Martin Sebor <msebor@redhat.com>
20031 * tree.h (warning_suppressed_at, copy_warning,
20032 warning_suppressed_p, suppress_warning): New functions.
20034 2021-06-25 Martin Sebor <msebor@redhat.com>
20036 * Makefile.in (OBJS-libcommon): Add diagnostic-spec.o.
20037 * gengtype.c (open_base_files): Add diagnostic-spec.h.
20038 * diagnostic-spec.c: New file.
20039 * diagnostic-spec.h: New file.
20040 * tree.h (no_warning, all_warnings, suppress_warning_at): New
20042 * warning-control.cc: New file.
20044 2021-06-25 liuhongt <hongtao.liu@intel.com>
20047 * config/i386/i386.c (x86_order_regs_for_local_alloc):
20050 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
20052 PR tree-optimization/101189
20053 * gimple-range-fold.cc (fold_using_range::range_of_range_op): Pass
20054 LHS range of condition to postfold routine.
20055 (fold_using_range::postfold_gcond_edges): Only process the TRUE or
20056 FALSE edge if the LHS range supports it being taken.
20057 * gimple-range-fold.h (postfold_gcond_edges): Add range parameter.
20059 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
20061 * value-relation.cc (equiv_oracle::dump): Do not dump NULL blocks.
20062 (relation_oracle::find_relation_block): Check correct bitmap.
20063 (relation_oracle::dump): Do not dump NULL blocks.
20065 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
20067 * gimple-range-cache.cc (ranger_cache::propagate_cache): Call
20068 range_on_edge instead of manually calculating.
20070 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
20072 * range-op.cc: Fix comment.
20074 2021-06-24 Uroš Bizjak <ubizjak@gmail.com>
20077 * config/i386/i386-expand.c (ix86_expand_sse_unpack):
20078 Handle V8QI and V4HI modes.
20079 * config/i386/mmx.md (sse4_1_<any_extend:code>v4qiv4hi2):
20081 (sse4_1_<any_extend:code>v4qiv4hi2): Ditto.
20082 (mmxpackmode): New mode attribute.
20083 (vec_pack_trunc_<mmxpackmode:mode>): New expander.
20084 (mmxunpackmode): New mode attribute.
20085 (vec_unpacks_lo_<mmxunpackmode:mode>): New expander.
20086 (vec_unpacks_hi_<mmxunpackmode:mode>): Ditto.
20087 (vec_unpacku_lo_<mmxunpackmode:mode>): Ditto.
20088 (vec_unpacku_hi_<mmxunpackmode:mode>): Ditto.
20089 * config/i386/i386.md (extsuffix): Move from ...
20090 * config/i386/sse.md: ... here.
20092 2021-06-24 Eric Botcazou <ebotcazou@adacore.com>
20094 * dwarf2out.c (dwarf2out_assembly_start): Emit .file 0 marker here..
20095 (dwarf2out_finish): ...instead of here.
20097 2021-06-24 Eric Botcazou <ebotcazou@adacore.com>
20099 * configure.ac (--gdwarf-5 option): Use objdump instead of readelf.
20100 (working --gdwarf-4/--gdwarf-5 for all sources): Likewise.
20101 (--gdwarf-4 not refusing generated .debug_line): Adjust for Windows.
20102 * configure: Regenerate.
20104 2021-06-24 Richard Biener <rguenther@suse.de>
20106 * config/i386/sse.md (vec_addsubv4df3, vec_addsubv2df3,
20107 vec_addsubv8sf3, vec_addsubv4sf3): Merge into ...
20108 (vec_addsub<mode>3): ... using a new addsub_cst mode attribute.
20110 2021-06-24 Richard Biener <rguenther@suse.de>
20112 * config/i386/sse.md (avx_addsubv4df3): Rename to
20114 (avx_addsubv8sf3): Rename to vec_addsubv8sf3.
20115 (sse3_addsubv2df3): Rename to vec_addsubv2df3.
20116 (sse3_addsubv4sf3): Rename to vec_addsubv4sf3.
20117 * config/i386/i386-builtin.def: Adjust.
20118 * internal-fn.def (VEC_ADDSUB): New internal optab fn.
20119 * optabs.def (vec_addsub_optab): New optab.
20120 * tree-vect-slp-patterns.c (class addsub_pattern): New.
20121 (slp_patterns): Add addsub_pattern.
20122 * tree-vect-slp.c (vect_optimize_slp): Disable propagation
20123 across CFN_VEC_ADDSUB.
20124 * tree-vectorizer.h (vect_pattern::vect_pattern): Make
20126 * doc/md.texi (vec_addsub<mode>3): Document.
20128 2021-06-24 Jakub Jelinek <jakub@redhat.com>
20130 PR middle-end/101170
20131 * df-scan.c (df_ref_record): For paradoxical big-endian SUBREGs
20132 where regno + subreg_regno_offset wraps around use 0 as starting
20135 2021-06-24 Jakub Jelinek <jakub@redhat.com>
20137 PR middle-end/101172
20138 * stor-layout.c (finish_bitfield_representative): If nextf has
20139 error_mark_node type, set repr type to error_mark_node too.
20141 2021-06-24 Ilya Leoshkevich <iii@linux.ibm.com>
20143 * config/s390/s390.c (s390_function_profiler): Ignore labelno
20145 * config/s390/s390.h (NO_PROFILE_COUNTERS): Define.
20147 2021-06-24 Richard Biener <rguenther@suse.de>
20149 * tree-vect-slp.c (vect_optimize_slp): Do not propagate
20150 across operations that have different semantics on different
20153 2021-06-24 Jakub Jelinek <jakub@redhat.com>
20155 * tree.h (OMP_CLAUSE_MAP_IN_REDUCTION): Document meaning for OpenMP.
20156 * gimplify.c (gimplify_scan_omp_clauses): For OpenMP map clauses
20157 with OMP_CLAUSE_MAP_IN_REDUCTION flag partially defer gimplification
20158 of non-decl OMP_CLAUSE_DECL. For OMP_CLAUSE_IN_REDUCTION on
20159 OMP_TARGET user outer_ctx instead of ctx for placeholders and
20160 initializer/combiner gimplification.
20161 * omp-low.c (scan_sharing_clauses): Handle OMP_CLAUSE_MAP_IN_REDUCTION
20162 on target constructs.
20163 (lower_rec_input_clauses): Likewise.
20164 (lower_omp_target): Likewise.
20165 * omp-expand.c (expand_omp_target): Temporarily ignore nowait clause
20166 on target if in_reduction is present.
20168 2021-06-24 Kewen Lin <linkw@linux.ibm.com>
20170 * tree-predcom.c (class pcom_worker): New class.
20171 (release_chain): Renamed to...
20172 (pcom_worker::release_chain): ...this.
20173 (release_chains): Renamed to...
20174 (pcom_worker::release_chains): ...this.
20175 (aff_combination_dr_offset): Renamed to...
20176 (pcom_worker::aff_combination_dr_offset): ...this.
20177 (determine_offset): Renamed to...
20178 (pcom_worker::determine_offset): ...this.
20179 (class comp_ptrs): New class.
20180 (split_data_refs_to_components): Renamed to...
20181 (pcom_worker::split_data_refs_to_components): ...this,
20182 and update with class comp_ptrs.
20183 (suitable_component_p): Renamed to...
20184 (pcom_worker::suitable_component_p): ...this.
20185 (filter_suitable_components): Renamed to...
20186 (pcom_worker::filter_suitable_components): ...this.
20187 (valid_initializer_p): Renamed to...
20188 (pcom_worker::valid_initializer_p): ...this.
20189 (find_looparound_phi): Renamed to...
20190 (pcom_worker::find_looparound_phi): ...this.
20191 (add_looparound_copies): Renamed to...
20192 (pcom_worker::add_looparound_copies): ...this.
20193 (determine_roots_comp): Renamed to...
20194 (pcom_worker::determine_roots_comp): ...this.
20195 (determine_roots): Renamed to...
20196 (pcom_worker::determine_roots): ...this.
20197 (single_nonlooparound_use): Renamed to...
20198 (pcom_worker::single_nonlooparound_use): ...this.
20199 (remove_stmt): Renamed to...
20200 (pcom_worker::remove_stmt): ...this.
20201 (execute_pred_commoning_chain): Renamed to...
20202 (pcom_worker::execute_pred_commoning_chain): ...this.
20203 (execute_pred_commoning): Renamed to...
20204 (pcom_worker::execute_pred_commoning): ...this.
20205 (struct epcc_data): New member worker.
20206 (execute_pred_commoning_cbck): Call execute_pred_commoning
20207 with pcom_worker pointer.
20208 (find_use_stmt): Renamed to...
20209 (pcom_worker::find_use_stmt): ...this.
20210 (find_associative_operation_root): Renamed to...
20211 (pcom_worker::find_associative_operation_root): ...this.
20212 (find_common_use_stmt): Renamed to...
20213 (pcom_worker::find_common_use_stmt): ...this.
20214 (combinable_refs_p): Renamed to...
20215 (pcom_worker::combinable_refs_p): ...this.
20216 (reassociate_to_the_same_stmt): Renamed to...
20217 (pcom_worker::reassociate_to_the_same_stmt): ...this.
20218 (stmt_combining_refs): Renamed to...
20219 (pcom_worker::stmt_combining_refs): ...this.
20220 (combine_chains): Renamed to...
20221 (pcom_worker::combine_chains): ...this.
20222 (try_combine_chains): Renamed to...
20223 (pcom_worker::try_combine_chains): ...this.
20224 (prepare_initializers_chain): Renamed to...
20225 (pcom_worker::prepare_initializers_chain): ...this.
20226 (prepare_initializers): Renamed to...
20227 (pcom_worker::prepare_initializers): ...this.
20228 (prepare_finalizers_chain): Renamed to...
20229 (pcom_worker::prepare_finalizers_chain): ...this.
20230 (prepare_finalizers): Renamed to...
20231 (pcom_worker::prepare_finalizers): ...this.
20232 (tree_predictive_commoning_loop): Renamed to...
20233 (pcom_worker::tree_predictive_commoning_loop): ...this, adjust
20234 some calls and remove some cleanup code.
20235 (tree_predictive_commoning): Adjusted to use pcom_worker instance.
20236 (static variable looparound_phis): Remove.
20237 (static variable name_expansions): Remove.
20239 2021-06-24 Richard Biener <rguenther@suse.de>
20241 * tree-vect-slp.c (slpg_vertex): New struct.
20242 (vect_slp_build_vertices): Adjust.
20243 (vect_optimize_slp): Likewise. Maintain an outgoing permute
20244 and a materialized one.
20246 2021-06-24 Richard Biener <rguenther@suse.de>
20248 PR tree-optimization/101105
20249 * tree-vect-data-refs.c (vect_prune_runtime_alias_test_list):
20250 Only ignore steps when they are equal or scalar order is preserved.
20252 2021-06-24 liuhongt <hongtao.liu@intel.com>
20255 * config/i386/i386-expand.c (ix86_expand_vec_interleave):
20256 Adjust comments for ix86_expand_vecop_qihi2.
20257 (ix86_expand_vecmul_qihi): Renamed to ..
20258 (ix86_expand_vecop_qihi2): Adjust function prototype to
20259 support shift operation, add static to definition.
20260 (ix86_expand_vec_shift_qihi_constant): Add static to definition.
20261 (ix86_expand_vecop_qihi): Call ix86_expand_vecop_qihi2 and
20262 ix86_expand_vec_shift_qihi_constant.
20263 * config/i386/i386-protos.h (ix86_expand_vecmul_qihi): Deleted.
20264 (ix86_expand_vec_shift_qihi_constant): Deleted.
20265 * config/i386/sse.md (VI12_256_512_AVX512VL): New mode
20267 (mulv8qi3): Call ix86_expand_vecop_qihi directly, add
20268 condition TARGET_64BIT.
20269 (mul<mode>3): Ditto.
20270 (<insn><mode>3): Ditto.
20271 (vlshr<mode>3): Extend to support avx512 vlshr.
20272 (v<insn><mode>3): New expander for
20274 (v<insn>v8qi3): Ditto.
20275 (vashrv8hi3<mask_name>): Renamed to ..
20276 (vashr<mode>3): And extend to support V16QImode for avx512.
20277 (vashrv16qi3): Deleted.
20278 (vashrv2di3<mask_name>): Extend expander to support avx512
20281 2021-06-23 Dimitar Dimitrov <dimitar@dinux.eu>
20283 * doc/lto.texi (Design Overview): Update that slim objects are
20286 2021-06-23 Aaron Sawdey <acsawdey@linux.ibm.com>
20288 * config/rs6000/rs6000-cpus.def: Take OPTION_MASK_PCREL_OPT out
20289 of OTHER_POWER10_MASKS so it will not be enabled by default.
20291 2021-06-23 Richard Biener <rguenther@suse.de>
20292 Martin Jambor <mjambor@suse.cz>
20294 * tree-inline.c (setup_one_parameter): Set TREE_READONLY of the
20295 param replacement unconditionally. Adjust comment.
20297 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
20299 * Makefile.in (OBJS): Add gimple-range-fold.o
20300 * gimple-range-fold.cc: New.
20301 * gimple-range-fold.h: New.
20302 * gimple-range-gori.cc (gimple_range_calc_op1): Move to here.
20303 (gimple_range_calc_op2): Ditto.
20304 * gimple-range-gori.h: Move prototypes to here.
20305 * gimple-range.cc: Adjust include files.
20306 (fur_source:fur_source): Relocate to gimple-range-fold.cc.
20307 (fur_source::get_operand): Ditto.
20308 (fur_source::get_phi_operand): Ditto.
20309 (fur_source::query_relation): Ditto.
20310 (fur_source::register_relation): Ditto.
20311 (class fur_edge): Ditto.
20312 (fur_edge::fur_edge): Ditto.
20313 (fur_edge::get_operand): Ditto.
20314 (fur_edge::get_phi_operand): Ditto.
20315 (fur_stmt::fur_stmt): Ditto.
20316 (fur_stmt::get_operand): Ditto.
20317 (fur_stmt::get_phi_operand): Ditto.
20318 (fur_stmt::query_relation): Ditto.
20319 (class fur_depend): Relocate to gimple-range-fold.h.
20320 (fur_depend::fur_depend): Relocate to gimple-range-fold.cc.
20321 (fur_depend::register_relation): Ditto.
20322 (fur_depend::register_relation): Ditto.
20323 (class fur_list): Ditto.
20324 (fur_list::fur_list): Ditto.
20325 (fur_list::get_operand): Ditto.
20326 (fur_list::get_phi_operand): Ditto.
20327 (fold_range): Ditto.
20328 (adjust_pointer_diff_expr): Ditto.
20329 (gimple_range_adjustment): Ditto.
20330 (gimple_range_base_of_assignment): Ditto.
20331 (gimple_range_operand1): Ditto.
20332 (gimple_range_operand2): Ditto.
20333 (gimple_range_calc_op1): Relocate to gimple-range-gori.cc.
20334 (gimple_range_calc_op2): Ditto.
20335 (fold_using_range::fold_stmt): Relocate to gimple-range-fold.cc.
20336 (fold_using_range::range_of_range_op): Ditto.
20337 (fold_using_range::range_of_address): Ditto.
20338 (fold_using_range::range_of_phi): Ditto.
20339 (fold_using_range::range_of_call): Ditto.
20340 (fold_using_range::range_of_builtin_ubsan_call): Ditto.
20341 (fold_using_range::range_of_builtin_call): Ditto.
20342 (fold_using_range::range_of_cond_expr): Ditto.
20343 (fold_using_range::range_of_ssa_name_with_loop_info): Ditto.
20344 (fold_using_range::relation_fold_and_or): Ditto.
20345 (fold_using_range::postfold_gcond_edges): Ditto.
20346 * gimple-range.h: Add gimple-range-fold.h to include files. Change
20347 GIMPLE_RANGE_STMT_H to GIMPLE_RANGE_H.
20348 (gimple_range_handler): Relocate to gimple-range-fold.h.
20349 (gimple_range_ssa_p): Ditto.
20350 (range_compatible_p): Ditto.
20351 (class fur_source): Ditto.
20352 (class fur_stmt): Ditto.
20353 (class fold_using_range): Ditto.
20354 (gimple_range_calc_op1): Relocate to gimple-range-gori.h
20355 (gimple_range_calc_op2): Ditto.
20357 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
20359 PR tree-optimization/101148
20360 PR tree-optimization/101014
20361 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust.
20362 (ranger_cache::~ranger_cache): Adjust.
20363 (ranger_cache::block_range): Check if propagation disallowed.
20364 (ranger_cache::propagate_cache): Disallow propagation if new value
20365 can't be stored properly.
20366 * gimple-range-cache.h (ranger_cache::m_propfail): New member.
20368 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
20370 * gimple-range-cache.cc (class ssa_block_ranges): Adjust prototype.
20371 (sbr_vector::set_bb_range): Return true.
20372 (class sbr_sparse_bitmap): Adjust.
20373 (sbr_sparse_bitmap::set_bb_range): Return value.
20374 (block_range_cache::set_bb_range): Return value.
20375 (ranger_cache::propagate_cache): Use return value to print msg.
20376 * gimple-range-cache.h (class block_range_cache): Adjust.
20378 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
20380 * gimple-range.cc (dump_bb): Use range_on_edge from the cache.
20382 2021-06-23 Jeff Law <jeffreyalaw@gmail.com>
20384 * config/h8300/logical.md (<code><mode>3<ccnz>): Use <cczn>
20385 so this pattern can be used for test/compare removal. Pass
20386 current insn to compute_logical_op_length and output_logical_op.
20387 * config/h8300/h8300.c (compute_logical_op_cc): Remove.
20388 (h8300_and_costs): Add argument to compute_logical_op_length.
20389 (output_logical_op): Add new argument. Use it to determine if the
20390 condition codes are used and adjust the output accordingly.
20391 (compute_logical_op_length): Add new argument and update length
20392 computations when condition codes are used.
20393 * config/h8300/h8300-protos.h (compute_logical_op_length): Update
20395 (output_logical_op): Likewise.
20397 2021-06-23 Uroš Bizjak <ubizjak@gmail.com>
20400 * config/i386/i386-expand.c (expand_vec_perm_pshufb):
20401 Handle 64bit modes for TARGET_XOP. Use indirect gen_* functions.
20402 * config/i386/mmx.md (mmx_ppermv64): New insn pattern.
20403 * config/i386/i386.md (unspec): Move UNSPEC_XOP_PERMUTE from ...
20404 * config/i386/sse.md (unspec): ... here.
20406 2021-06-23 Martin Liska <mliska@suse.cz>
20409 * optc-save-gen.awk: Put back arm_fp16_format to
20412 2021-06-23 Uroš Bizjak <ubizjak@gmail.com>
20415 * config/i386/i386.md (bsr_rex64): Add zero-flag setting RTX.
20418 (clz<mode>2): Update RTX pattern for additions.
20420 2021-06-23 Jakub Jelinek <jakub@redhat.com>
20422 PR middle-end/101167
20423 * omp-low.c (lower_omp_regimplify_p): Regimplify also PARM_DECLs
20424 and RESULT_DECLs that have DECL_HAS_VALUE_EXPR_P set.
20426 2021-06-22 Sergei Trofimovich <siarheit@google.com>
20428 * doc/rtl.texi: drop unbalanced parenthesis.
20430 2021-06-22 Richard Biener <rguenther@suse.de>
20432 PR middle-end/101156
20433 * gimplify.c (gimplify_expr): Remove premature incorrect
20436 2021-06-22 Jakub Jelinek <jakub@redhat.com>
20438 PR tree-optimization/101159
20439 * tree-vect-patterns.c (vect_recog_popcount_pattern): Fix some
20442 2021-06-22 Jakub Jelinek <jakub@redhat.com>
20444 PR middle-end/101160
20445 * function.c (assign_parms): For decl_result with TYPE_EMPTY_P type
20446 clear crtl->return_rtx instead of keeping it referencing a pseudo.
20448 2021-06-22 Jakub Jelinek <jakub@redhat.com>
20449 Andrew Pinski <apinski@marvell.com>
20451 PR tree-optimization/101162
20452 * fold-const.c (range_check_type): Handle OFFSET_TYPE like pointer
20455 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20457 * range-op.cc (range_relational_tests): New.
20458 (range_op_tests): Call range_relational_tests.
20460 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20462 * range-op.cc (operator_cast::lhs_op1_relation): New.
20463 (operator_identity::lhs_op1_relation): Mew.
20465 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20467 * range-op.cc (operator_minus::op1_op2_relation_effect): New.
20469 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20471 * range-op.cc (operator_plus::lhs_op1_relation): New.
20472 (operator_plus::lhs_op2_relation): New.
20474 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20476 * gimple-range-cache.cc (ranger_cache::ranger_cache): Create a
20477 relation_oracle if dominators exist.
20478 (ranger_cache::~ranger_cache): Dispose of oracle.
20479 (ranger_cache::dump_bb): Dump oracle.
20480 * gimple-range.cc (fur_source::fur_source): New.
20481 (fur_source::get_operand): Use mmeber query.
20482 (fur_source::get_phi_operand): Use member_query.
20483 (fur_source::query_relation): New.
20484 (fur_source::register_dependency): Delete.
20485 (fur_source::register_relation): New.
20486 (fur_edge::fur_edge): Adjust.
20487 (fur_edge::get_phi_operand): Fix comment.
20488 (fur_edge::query): Delete.
20489 (fur_stmt::fur_stmt): Adjust.
20490 (fur_stmt::query): Delete.
20491 (fur_depend::fur_depend): Adjust.
20492 (fur_depend::register_relation): New.
20493 (fur_depend::register_relation): New.
20494 (fur_list::fur_list): Adjust.
20495 (fur_list::get_operand): Use member query.
20496 (fold_using_range::range_of_range_op): Process and query relations.
20497 (fold_using_range::range_of_address): Adjust dependency call.
20498 (fold_using_range::range_of_phi): Ditto.
20499 (gimple_ranger::gimple_ranger): New. Use ranger_ache oracle.
20500 (fold_using_range::relation_fold_and_or): New.
20501 (fold_using_range::postfold_gcond_edges): New.
20502 * gimple-range.h (class gimple_ranger): Adjust.
20503 (class fur_source): Adjust members.
20504 (class fur_stmt): Ditto.
20505 (class fold_using_range): Ditto.
20507 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20509 * range-op.cc (range_operator::wi_fold): Apply relation effect.
20510 (range_operator::fold_range): Adjust and apply relation effect.
20511 (*::fold_range): Add relation parameters.
20512 (*::op1_range): Ditto.
20513 (*::op2_range): Ditto.
20514 (range_operator::lhs_op1_relation): New.
20515 (range_operator::lhs_op2_relation): New.
20516 (range_operator::op1_op2_relation): New.
20517 (range_operator::op1_op2_relation_effect): New.
20518 (relop_early_resolve): New.
20519 (operator_equal::op1_op2_relation): New.
20520 (operator_equal::fold_range): Call relop_early_resolve.
20521 (operator_not_equal::op1_op2_relation): New.
20522 (operator_not_equal::fold_range): Call relop_early_resolve.
20523 (operator_lt::op1_op2_relation): New.
20524 (operator_lt::fold_range): Call relop_early_resolve.
20525 (operator_le::op1_op2_relation): New.
20526 (operator_le::fold_range): Call relop_early_resolve.
20527 (operator_gt::op1_op2_relation): New.
20528 (operator_gt::fold_range): Call relop_early_resolve.
20529 (operator_ge::op1_op2_relation): New.
20530 (operator_ge::fold_range): Call relop_early_resolve.
20531 * range-op.h (class range_operator): Adjust parameters and methods.
20533 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20535 * Makefile.in (OBJS): Add value-relation.o.
20536 * gimple-range.h: Adjust include files.
20537 * tree-data-ref.c: Adjust include file order.
20538 * value-query.cc (range_query::get_value_range): Default to no oracle.
20539 (range_query::query_relation): New.
20540 (range_query::query_relation): New.
20541 * value-query.h (class range_query): Adjust.
20542 * value-relation.cc: New.
20543 * value-relation.h: New.
20545 2021-06-22 Richard Biener <rguenther@suse.de>
20547 PR tree-optimization/101151
20548 * tree-ssa-sink.c (statement_sink_location): Expand irreducible
20551 2021-06-22 Jojo R <rjiejie@linux.alibaba.com>
20553 * config/riscv/riscv.c (thead_c906_tune_info): New.
20554 (riscv_tune_info_table): Use new tune.
20556 2021-06-22 Richard Biener <rguenther@suse.de>
20558 PR tree-optimization/101158
20559 * tree-vect-slp.c (vect_build_slp_tree_1): Move same operand
20560 checking after checking for matching operation.
20562 2021-06-22 Richard Biener <rguenther@suse.de>
20564 PR tree-optimization/101159
20565 * tree-vect-patterns.c (vect_recog_popcount_pattern): Add
20566 missing NULL vectype check.
20568 2021-06-22 Richard Biener <rguenther@suse.de>
20570 PR tree-optimization/101154
20571 * tree-vect-slp.c (vect_build_slp_tree_2): Fix out-of-bound access.
20573 2021-06-22 Jakub Jelinek <jakub@redhat.com>
20576 * config/i386/i386-protos.h (ix86_last_zero_store_uid): Declare.
20577 * config/i386/i386-expand.c (ix86_last_zero_store_uid): New variable.
20578 * config/i386/i386.c (ix86_expand_prologue): Clear it.
20579 * config/i386/i386.md (peephole2s for 1/2/4 stores of const0_rtx):
20580 Remove "" from match_operand. Emit new insns using emit_move_insn and
20581 set ix86_last_zero_store_uid to INSN_UID of the last store.
20582 Add peephole2s for 1/2/4 stores of const0_rtx following previous
20585 2021-06-22 Martin Liska <mliska@suse.cz>
20587 * auto-profile.c (AUTO_PROFILE_VERSION): Bump as string format
20590 2021-06-22 Martin Liska <mliska@suse.cz>
20592 * gcov-io.h: Remove padding entries.
20594 2021-06-22 liuhongt <hongtao.liu@intel.com>
20596 PR tree-optimization/97770
20597 * tree-vect-patterns.c (vect_recog_popcount_pattern):
20599 (vect_recog_func vect_vect_recog_func_ptrs): Add new pattern.
20601 2021-06-22 liuhongt <hongtao.liu@intel.com>
20604 * config/i386/i386-builtin.def (BDESC): Adjust builtin name.
20605 * config/i386/sse.md (<avx512>_expand<mode>_mask): Rename to ..
20606 (expand<mode>_mask): this ..
20607 (*expand<mode>_mask): New pre_reload splitter to transform
20608 v{,p}expand* to vmov* when mask is zero, all ones, or has all
20609 ones in it's lower part, otherwise still generate
20612 2021-06-22 liuhongt <hongtao.liu@intel.com>
20615 * config/i386/i386-expand.c
20616 (ix86_expand_special_args_builtin): Keep constm1_operand only
20617 if it satisfies insn's operand predicate.
20619 2021-06-21 Jason Merrill <jason@redhat.com>
20622 * df-scan.c (df_ref_record): Check that regno < endregno.
20623 * function.c (assign_parms, expand_function_end): Do nothing with a
20624 TYPE_EMPTY_P result.
20626 2021-06-21 Richard Biener <rguenther@suse.de>
20628 PR tree-optimization/101120
20629 * tree-vect-data-refs.c (bump_vector_ptr): Fold the
20631 * tree-vect-slp.c (vect_transform_slp_perm_load): Add
20632 DR chain DCE capability.
20633 * tree-vectorizer.h (vect_transform_slp_perm_load): Adjust.
20634 * tree-vect-stmts.c (vectorizable_load): Remove unused
20635 loads in the DR chain for SLP.
20637 2021-06-21 Jakub Jelinek <jakub@redhat.com>
20639 PR inline-asm/100785
20640 * gimplify.c (gimplify_asm_expr): Don't diagnose errors if
20641 output or input operands were already error_mark_node.
20642 * cfgexpand.c (expand_asm_stmt): If errors are emitted,
20643 remove all inputs, outputs and clobbers from the asm and
20644 set template to "".
20646 2021-06-21 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
20648 * config/arm/arm_neon.h (vceq_s8): Replace builtin with __a == __b.
20649 (vceq_s16): Likewise.
20650 (vceq_s32): Likewise.
20651 (vceq_u8): Likewise.
20652 (vceq_u16): Likewise.
20653 (vceq_u32): Likewise.
20654 (vceq_p8): Likewise.
20655 (vceqq_s8): Likewise.
20656 (vceqq_s16): Likewise.
20657 (vceqq_s32): Likewise.
20658 (vceqq_u8): Likewise.
20659 (vceqq_u16): Likewise.
20660 (vceqq_u32): Likewise.
20661 (vceqq_p8): Likewise.
20662 (vceq_f32): Gate __a == __b on __FAST_MATH__.
20663 (vceqq_f32): Likewise.
20664 (vceq_f16): Likewise.
20665 (vceqq_f16): Likewise.
20667 2021-06-21 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
20670 * config/arm/iterators.md (NEON_VACMP): Remove.
20671 * config/arm/neon.md (neon_vca<cmp_op><mode>): Use GLTE instead of GTGE
20673 (neon_vca<cmp_op><mode>_insn): Likewise.
20674 (neon_vca<cmp_op_unsp><mode>_insn_unspec): Use NEON_VAGLTE instead of
20677 2021-06-21 Richard Biener <rguenther@suse.de>
20679 PR tree-optimization/101121
20680 * tree-vect-slp.c (vect_build_slp_tree_2): To not fail fatally
20681 when we just lack a stmt with the desired op when doing permutation.
20682 (vect_build_slp_tree): When caching a failed SLP build attempt
20683 assert that at least one lane is marked as not matching.
20685 2021-06-21 liuhongt <hongtao.liu@intel.com>
20688 * config/i386/i386.md: (*anddi_1): Disparage slightly the mask
20689 register alternative.
20690 (*and<mode>_1): Ditto.
20692 (*andn<mode>_1): Ditto.
20693 (*<code><mode>_1): Ditto.
20694 (*<code>qi_1): Ditto.
20695 (*one_cmpl<mode>2_1): Ditto.
20696 (*one_cmplsi2_1_zext): Ditto.
20697 (*one_cmplqi2_1): Ditto.
20698 * config/i386/i386.c (x86_order_regs_for_local_alloc): Change
20699 the order of mask registers to be before general registers.
20701 2021-06-21 Roger Sayle <roger@nextmovesoftware.com>
20704 * config/i386/i386.md: New define_peephole2s to shrink writing
20705 1, 2 or 4 consecutive zeros to memory when optimizing for size.
20707 2021-06-18 Jeff Law <jeffreyalaw@gmail.com>
20709 * config/h8300/h8300.c (h8300_select_cc_mode): Handle SYMBOL_REF.
20710 * config/h8300/logical.md (<code><mode>3 logcial expander): Generate
20711 more efficient code when the source can be trivially simplified.
20713 2021-06-18 Andrew MacLeod <amacleod@redhat.com>
20715 * gimple-range-cache.cc (ranger_cache::range_of_def): Calculate
20716 a range if global is not available.
20717 (ranger_cache::entry_range): Fallback to range_of_def.
20718 * gimple-range-cache.h (range_of_def): Adjust prototype.
20720 2021-06-18 Andrew MacLeod <amacleod@redhat.com>
20722 PR tree-optimization/101014
20723 * gimple-range-cache.cc (ranger_cache::ranger_cache): Remove poor
20725 (ranger_cache::~ranger_cache): Ditto.
20726 (ranger_cache::enable_new_values): Delete.
20727 (ranger_cache::push_poor_value): Delete.
20728 (ranger_cache::range_of_def): Remove poor value processing.
20729 (ranger_cache::entry_range): Ditto.
20730 (ranger_cache::fill_block_cache): Ditto.
20731 * gimple-range-cache.h (class ranger_cache): Remove poor value members.
20732 * gimple-range.cc (gimple_ranger::range_of_expr): Remove call.
20733 * gimple-range.h (class gimple_ranger): Adjust.
20735 2021-06-18 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
20738 * common/config/arm/arm-common.c (arm_canon_arch_option_1): New function
20739 derived from arm_canon_arch.
20740 (arm_canon_arch_option): Call it.
20741 (arm_canon_arch_multilib_option): New function.
20742 * config/arm/arm-cpus.in (IGNORE_FOR_MULTILIB): New fgroup.
20743 * config/arm/arm.h (arm_canon_arch_multilib_option): New prototype.
20744 (CANON_ARCH_MULTILIB_SPEC_FUNCTION): New macro.
20745 (MULTILIB_ARCH_CANONICAL_SPECS): New macro.
20746 (DRIVER_SELF_SPECS): Add MULTILIB_ARCH_CANONICAL_SPECS.
20747 * config/arm/arm.opt (mlibarch): New option.
20748 * config/arm/t-rmprofile (MULTILIB_MATCHES): For armv8*-m, replace use
20749 of march on RHS with mlibarch.
20751 2021-06-18 Marcel Vollweiler <marcel@codesourcery.com>
20753 * config.in: Regenerate.
20754 * config/gcn/gcn.c (print_operand_address): Fix for global_load assembler
20756 * configure: Regenerate.
20757 * configure.ac: Fix for global_load assembler functions.
20759 2021-06-18 Richard Biener <rguenther@suse.de>
20761 PR tree-optimization/101112
20762 * tree-vect-slp.c (vect_slp_linearize_chain): Fix condition
20763 to lookup a pattern stmt def.
20765 2021-06-18 Jakub Jelinek <jakub@redhat.com>
20767 PR middle-end/101062
20768 * stor-layout.c (finish_bitfield_layout): Don't add bitfield
20769 representatives in QUAL_UNION_TYPE.
20771 2021-06-18 Andrew Pinski <apinski@marvell.com>
20773 * tree-ssa-phiopt.c (replace_phi_edge_with_variable):
20774 Add counting of how many times it is done.
20775 (factor_out_conditional_conversion): Likewise.
20776 (match_simplify_replacement): Likewise.
20777 (value_replacement): Likewise.
20778 (spaceship_replacement): Likewise.
20779 (cond_store_replacement): Likewise.
20780 (cond_if_else_store_replacement_1): Likewise.
20781 (hoist_adjacent_loads): Likewise.
20783 2021-06-18 Andrew Pinski <apinski@marvell.com>
20785 * tree-cfg.c (verify_gimple_assign_unary): Reject point and offset
20786 types on NEGATE_EXPR, ABS_EXPR, BIT_NOT_EXPR, PAREN_EXPR and CNONJ_EXPR.
20787 (verify_gimple_assign_binary): Reject point and offset types on
20788 MULT_EXPR, MULT_HIGHPART_EXPR, TRUNC_DIV_EXPR, CEIL_DIV_EXPR,
20789 FLOOR_DIV_EXPR, ROUND_DIV_EXPR, TRUNC_MOD_EXPR, CEIL_MOD_EXPR,
20790 FLOOR_MOD_EXPR, ROUND_MOD_EXPR, RDIV_EXPR, and EXACT_DIV_EXPR.
20792 2021-06-18 Michael Meissner <meissner@linux.ibm.com>
20794 * config/rs6000/rs6000.c (rs6000_emit_minmax): Add support for ISA
20795 3.1 IEEE 128-bit floating point xsmaxcqp/xsmincqp instructions.
20796 * config/rs6000/rs6000.md (s<minmax><mode>3, IEEE128 iterator):
20799 2021-06-17 Aaron Sawdey <acsawdey@linux.ibm.com>
20801 * config/rs6000/genfusion.pl (gen_logical_addsubf): Add
20802 earlyclobber to alts 0/1.
20803 (gen_addadd): Add earlyclobber to alts 0/1.
20804 * config/rs6000/fusion.md: Regenerate file.
20806 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
20808 * cfgloopanal.c (get_loop_hot_path): Make path an auto_vec.
20810 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
20812 * gimple-range-cache.cc: Comment cleanups.
20813 * gimple-range-gori.cc: Comment cleanups.
20814 * gimple-range.cc: Comment/spacing cleanups
20815 * value-range.h: Comment cleanups.
20817 2021-06-17 H.J. Lu <hjl.tools@gmail.com>
20820 * calls.c (expand_call): Replace PUSH_ARGS with
20821 targetm.calls.push_argument (0).
20822 (emit_library_call_value_1): Likewise.
20823 * defaults.h (PUSH_ARGS): Removed.
20824 (PUSH_ARGS_REVERSED): Replace PUSH_ARGS with
20825 targetm.calls.push_argument (0).
20826 * expr.c (block_move_libcall_safe_for_call_parm): Likewise.
20827 (emit_push_insn): Pass the number bytes to push to
20828 targetm.calls.push_argument and pass 0 if ARGS_ADDR is 0.
20829 * hooks.c (hook_bool_uint_true): New.
20830 * hooks.h (hook_bool_uint_true): Likewise.
20831 * rtlanal.c (nonzero_bits1): Replace PUSH_ARGS with
20832 targetm.calls.push_argument (0).
20833 * target.def (push_argument): Add a targetm.calls hook.
20834 * targhooks.c (default_push_argument): New.
20835 * targhooks.h (default_push_argument): Likewise.
20836 * config/bpf/bpf.h (PUSH_ARGS): Removed.
20837 * config/cr16/cr16.c (TARGET_PUSH_ARGUMENT): New.
20838 * config/cr16/cr16.h (PUSH_ARGS): Removed.
20839 * config/i386/i386.c (ix86_push_argument): New.
20840 (TARGET_PUSH_ARGUMENT): Likewise.
20841 * config/i386/i386.h (PUSH_ARGS): Removed.
20842 * config/m32c/m32c.c (TARGET_PUSH_ARGUMENT): New.
20843 * config/m32c/m32c.h (PUSH_ARGS): Removed.
20844 * config/nios2/nios2.h (PUSH_ARGS): Likewise.
20845 * config/pru/pru.h (PUSH_ARGS): Likewise.
20846 * doc/tm.texi.in: Remove PUSH_ARGS documentation. Add
20847 TARGET_PUSH_ARGUMENT hook.
20848 * doc/tm.texi: Regenerated.
20850 2021-06-17 Uroš Bizjak <ubizjak@gmail.com>
20853 * config/i386/i386-expand.c (expand_vector_set_var):
20854 Handle V2FS mode remapping. Pass TARGET_MMX_WITH_SSE to
20855 ix86_expand_vector_init_duplicate.
20856 (ix86_expand_vector_init_duplicate): Emit insv_1 for
20857 QImode for !TARGET_PARTIAL_REG_STALL.
20858 * config/i386/predicates.md (vec_setm_mmx_operand): New predicate.
20859 * config/i386/mmx.md (vec_setv2sf): Use vec_setm_mmx_operand
20860 as operand 2 predicate. Call ix86_expand_vector_set_var
20861 for non-constant index operand.
20862 (vec_setv2si): Ditto.
20863 (vec_setv4hi): Ditto.
20864 (vec_setv8qi): ditto.
20866 2021-06-17 Aldy Hernandez <aldyh@redhat.com>
20868 PR tree-optimization/100790
20869 * gimple-range.cc (range_of_builtin_call): Cleanup clz and ctz
20872 2021-06-17 Martin Liska <mliska@suse.cz>
20874 * doc/invoke.texi: Use consistently -O1 instead of -O.
20876 2021-06-17 Martin Liska <mliska@suse.cz>
20878 * gcov-io.h: Update documentation entry about string format.
20880 2021-06-17 Marius Hillenbrand <mhillen@linux.ibm.com>
20883 * config/s390/vecintrin.h (vec_doublee): Fix to use
20884 __builtin_s390_vflls.
20885 (vec_floate): Fix to use __builtin_s390_vflrd.
20887 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
20889 * dominance.c (get_dominated_to_depth): Return auto_vec<basic_block>.
20890 * dominance.h (get_dominated_to_depth): Likewise.
20891 (get_all_dominated_blocks): Likewise.
20892 * cfgcleanup.c (delete_unreachable_blocks): Adjust.
20893 * gcse.c (hoist_code): Likewise.
20894 * tree-cfg.c (remove_edge_and_dominated_blocks): Likewise.
20895 * tree-parloops.c (oacc_entry_exit_ok): Likewise.
20896 * tree-ssa-dce.c (eliminate_unnecessary_stmts): Likewise.
20897 * tree-ssa-phiprop.c (pass_phiprop::execute): Likewise.
20899 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
20901 * dominance.c (get_dominated_by_region): Return auto_vec<basic_block>.
20902 * dominance.h (get_dominated_by_region): Likewise.
20903 * tree-cfg.c (gimple_duplicate_sese_region): Adjust.
20904 (gimple_duplicate_sese_tail): Likewise.
20905 (move_sese_region_to_fn): Likewise.
20907 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
20909 * dominance.c (get_dominated_by): Return auto_vec<basic_block>.
20910 * dominance.h (get_dominated_by): Likewise.
20911 * auto-profile.c (afdo_find_equiv_class): Adjust.
20912 * cfgloopmanip.c (duplicate_loop_to_header_edge): Likewise.
20913 * loop-unroll.c (unroll_loop_runtime_iterations): Likewise.
20914 * tree-cfg.c (test_linear_chain): Likewise.
20915 (test_diamond): Likewise.
20917 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
20919 * cfgloop.h (get_loop_hot_path): Return auto_vec<basic_block>.
20920 * cfgloopanal.c (get_loop_hot_path): Likewise.
20921 * tree-ssa-loop-ivcanon.c (tree_estimate_loop_size): Likewise.
20923 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
20925 * cgraph.c (cgraph_node::collect_callers): Return
20926 auto_vec<cgraph_edge *>.
20927 * cgraph.h (cgraph_node::collect_callers): Likewise.
20928 * ipa-cp.c (create_specialized_node): Adjust.
20929 (decide_about_value): Likewise.
20930 (decide_whether_version_node): Likewise.
20931 * ipa-sra.c (process_isra_node_results): Likewise.
20933 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
20935 * vec.h (vl_ptr>::using_auto_storage): Handle null m_vec.
20936 (auto_vec<T, 0>::auto_vec): Define move constructor, and delete copy
20938 (auto_vec<T, 0>::operator=): Define move assignment and delete copy
20941 2021-06-17 Aldy Hernandez <aldyh@redhat.com>
20943 * gimple-range.cc (debug_seed_ranger): New.
20944 (dump_ranger): New.
20945 (debug_ranger): New.
20947 2021-06-17 Richard Biener <rguenther@suse.de>
20949 PR tree-optimization/54400
20950 * tree-vectorizer.h (enum slp_instance_kind): Add
20951 slp_inst_kind_bb_reduc.
20952 (reduction_fn_for_scalar_code): Declare.
20953 * tree-vect-data-refs.c (vect_slp_analyze_instance_dependence):
20954 Check SLP_INSTANCE_KIND instead of looking at the
20956 (vect_slp_analyze_instance_alignment): Likewise.
20957 * tree-vect-loop.c (reduction_fn_for_scalar_code): Export.
20958 * tree-vect-slp.c (vect_slp_linearize_chain): Split out
20959 chain linearization from vect_build_slp_tree_2 and generalize
20960 for the use of BB reduction vectorization.
20961 (vect_build_slp_tree_2): Adjust accordingly.
20962 (vect_optimize_slp): Elide permutes at the root of BB reduction
20964 (vectorizable_bb_reduc_epilogue): New function.
20965 (vect_slp_prune_covered_roots): Likewise.
20966 (vect_slp_analyze_operations): Use them.
20967 (vect_slp_check_for_constructors): Recognize associatable
20968 chains for BB reduction vectorization.
20969 (vectorize_slp_instance_root_stmt): Generate code for the
20970 BB reduction epilogue.
20972 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
20974 * gimple-range-gori.cc (gori_compute::has_edge_range_p): Check with
20976 (gori_compute::may_recompute_p): New.
20977 (gori_compute::outgoing_edge_range_p): Perform recomputations.
20978 * gimple-range-gori.h (class gori_compute): Add prototype.
20980 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
20982 * gimple-range-cache.cc (ranger_cache::range_on_edge): Always return
20983 true when a range can be calculated.
20984 * gimple-range.cc (gimple_ranger::dump_bb): Check has_edge_range_p.
20986 2021-06-16 Martin Sebor <msebor@redhat.com>
20988 * doc/invoke.texi (-Wmismatched-dealloc, -Wmismatched-new-delete):
20989 Correct documented defaults.
20991 2021-06-16 Andrew MacLeod <amacleod@redhat.com>
20993 * gimple-range-cache.cc (ranger_cache::ranger_cache): Initialize
20994 m_new_value_p directly.
20996 2021-06-16 Uroš Bizjak <ubizjak@gmail.com>
20999 * config/i386/i386-expand.c (expand_vec_perm_2perm_pblendv):
21000 Handle 64bit modes for TARGET_SSE4_1.
21001 (expand_vec_perm_pshufb2): Handle 64bit modes for TARGET_SSSE3.
21002 (expand_vec_perm_even_odd_pack): Handle V4HI mode.
21003 (expand_vec_perm_even_odd_1) <case E_V4HImode>: Expand via
21004 expand_vec_perm_pshufb2 for TARGET_SSSE3 and via
21005 expand_vec_perm_even_odd_pack for TARGET_SSE4_1.
21006 * config/i386/mmx.md (mmx_packusdw): New insn pattern.
21008 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
21010 * config/aarch64/aarch64-simd.md (aarch64_<sur><addsub>hn<mode>):
21011 Change to an expander that emits the correct instruction
21012 depending on endianness.
21013 (aarch64_<sur><addsub>hn<mode>_insn_le): Define.
21014 (aarch64_<sur><addsub>hn<mode>_insn_be): Define.
21016 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
21018 * config/aarch64/aarch64-simd-builtins.def: Split generator
21019 for aarch64_<su>qmovn builtins into scalar and vector
21021 * config/aarch64/aarch64-simd.md (aarch64_<su>qmovn<mode>_insn_le):
21023 (aarch64_<su>qmovn<mode>_insn_be): Define.
21024 (aarch64_<su>qmovn<mode>): Split into scalar and vector
21025 variants. Change vector variant to an expander that emits the
21026 correct instruction depending on endianness.
21028 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
21030 * config/aarch64/aarch64-simd-builtins.def: Split generator
21031 for aarch64_sqmovun builtins into scalar and vector variants.
21032 * config/aarch64/aarch64-simd.md (aarch64_sqmovun<mode>):
21033 Split into scalar and vector variants. Change vector variant
21034 to an expander that emits the correct instruction depending
21036 (aarch64_sqmovun<mode>_insn_le): Define.
21037 (aarch64_sqmovun<mode>_insn_be): Define.
21039 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
21041 * config/aarch64/aarch64-simd.md (aarch64_xtn<mode>_insn_le):
21042 Define - modeling zero-high-half semantics.
21043 (aarch64_xtn<mode>): Change to an expander that emits the
21044 appropriate instruction depending on endianness.
21045 (aarch64_xtn<mode>_insn_be): Define - modeling zero-high-half
21047 (aarch64_xtn2<mode>_le): Rename to...
21048 (aarch64_xtn2<mode>_insn_le): This.
21049 (aarch64_xtn2<mode>_be): Rename to...
21050 (aarch64_xtn2<mode>_insn_be): This.
21051 (vec_pack_trunc_<mode>): Emit truncation instruction instead
21053 * config/aarch64/iterators.md (Vnarrowd): Add Vnarrowd mode
21054 attribute iterator.
21056 2021-06-16 Martin Jambor <mjambor@suse.cz>
21058 PR tree-optimization/100453
21059 * tree-sra.c (create_access): Disqualify any const candidates
21060 which are written to.
21061 (sra_modify_expr): Do not store sub-replacements back to a const base.
21062 (handle_unscalarized_data_in_subtree): Likewise.
21063 (sra_modify_assign): Likewise. Earlier, use TREE_READONLy test
21064 instead of constant_decl_p.
21066 2021-06-16 Jakub Jelinek <jakub@redhat.com>
21068 PR middle-end/101062
21069 * stor-layout.c (finish_bitfield_representative): For fields in unions
21070 assume nextf is always NULL.
21071 (finish_bitfield_layout): Compute bit field representatives also in
21072 unions, but handle it as if each bitfield was the only field in the
21075 2021-06-16 Richard Biener <rguenther@suse.de>
21077 PR tree-optimization/101088
21078 * tree-ssa-loop-im.c (sm_seq_valid_bb): Only look for
21079 supported refs on edges. Do not assert same ref but
21080 different kind stores are unsuported but mark them so.
21081 (hoist_memory_references): Only look for supported refs
21084 2021-06-16 Roger Sayle <roger@nextmovesoftware.com>
21086 PR rtl-optimization/46235
21087 * config/i386/i386.md: New define_split for bt followed by cmov.
21088 (*bt<mode>_setcqi): New define_insn_and_split for bt followed by setc.
21089 (*bt<mode>_setncqi): New define_insn_and_split for bt then setnc.
21090 (*bt<mode>_setnc<mode>): New define_insn_and_split for bt followed
21091 by setnc with zero extension.
21093 2021-06-16 Richard Biener <rguenther@suse.de>
21095 PR tree-optimization/101083
21096 * tree-vect-slp.c (vect_slp_build_two_operator_nodes): Get
21097 vectype as argument.
21098 (vect_build_slp_tree_2): Adjust.
21100 2021-06-15 Martin Sebor <msebor@redhat.com>
21102 PR middle-end/100876
21103 * builtins.c: (gimple_call_return_array): Account for size_t
21104 mangling as either unsigned int or unsigned long
21106 2021-06-15 Jeff Law <jeffreyalaw@gmail.com>
21108 * compare-elim.c (try_eliminate_compare): Run DCE to clean things
21109 up before eliminating comparisons.
21111 2021-06-15 Aldy Hernandez <aldyh@redhat.com>
21113 * range-op.cc (operator_bitwise_or::wi_fold): Make sure
21114 nonzero|X is nonzero.
21115 (range_op_bitwise_and_tests): Add tests for above.
21117 2021-06-15 Carl Love <cel@us.ibm.com>
21120 * config/rs6000/rs6000-builtin.def (VCMPEQUT): Fix the ICODE for the
21122 (VRLQ, VSLQ, VSRQ, VSRAQ): Remove unused BU_P10_OVERLOAD_2
21125 2021-06-15 Tobias Burnus <tobias@codesourcery.com>
21128 * gimplify.c (enum gimplify_defaultmap_kind): Add GDMK_SCALAR_TARGET.
21129 (struct gimplify_omp_ctx): Extend defaultmap array by one.
21130 (new_omp_context): Init defaultmap[GDMK_SCALAR_TARGET].
21131 (omp_notice_variable): Update type classification for Fortran.
21132 (gimplify_scan_omp_clauses): Update calls for new argument; handle
21133 GDMK_SCALAR_TARGET; for Fortran, GDMK_POINTER avoid GOVD_MAP_0LEN_ARRAY.
21134 * langhooks-def.h (lhd_omp_scalar_p): Add 'ptr_ok' argument.
21135 * langhooks.c (lhd_omp_scalar_p): Likewise.
21136 (LANG_HOOKS_OMP_ALLOCATABLE_P, LANG_HOOKS_OMP_SCALAR_TARGET_P): New.
21137 (LANG_HOOKS_DECLS): Add them.
21138 * langhooks.h (struct lang_hooks_for_decls): Add new hooks, update
21139 omp_scalar_p pointer type to include the new bool argument.
21141 2021-06-15 David Malcolm <dmalcolm@redhat.com>
21143 * doc/analyzer.texi
21144 (Special Functions for Debugging the Analyzer): Add
21145 __analyzer_dump_capacity.
21147 2021-06-15 Jakub Jelinek <jakub@redhat.com>
21150 * expr.c (expand_expr_real_2) <case VEC_PACK_FIX_TRUNC_EXPR,
21151 case VEC_PACK_TRUNC_EXPR>: Clear subtarget when changing mode.
21153 2021-06-15 Richard Biener <rguenther@suse.de>
21155 * cfgloopanal.c (mark_irreducible_loops): Use a dominance
21156 check to identify loop latches.
21157 * cfgloop.c (verify_loop_structure): Likewise.
21158 * loop-init.c (apply_loop_flags): Allow marked irreducible
21159 regions even with multiple latches.
21160 * predict.c (rebuild_frequencies): Simplify.
21162 2021-06-15 Richard Biener <rguenther@suse.de>
21164 * tree-ssa-threadupdate.c
21165 (jump_thread_path_registry::mark_threaded_blocks): Assert we
21166 have marked irreducible regions.
21168 2021-06-14 Martin Sebor <msebor@redhat.com>
21171 * builtins.c (gimple_call_return_array): Check for attribute fn spec.
21172 Handle calls to placement new.
21173 (ndecl_dealloc_argno): Avoid placement delete.
21175 2021-06-14 Peter Bergner <bergner@linux.ibm.com>
21178 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Use
21179 create_tmp_reg_or_ssa_name().
21181 2021-06-14 Andrew MacLeod <amacleod@redhat.com>
21183 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust.
21184 (ranger_cache::enable_new_values): Set to specified value and
21185 return the old value.
21186 (ranger_cache::disable_new_values): Delete.
21187 (ranger_cache::fill_block_cache): Disable non 1st order derived
21189 * gimple-range-cache.h (ranger_cache): Adjust prototypes.
21190 * gimple-range.cc (gimple_ranger::range_of_expr): Adjust.
21192 2021-06-14 Uroš Bizjak <ubizjak@gmail.com>
21195 * config/i386/i386-expand.c (ix86_vectorize_vec_perm_const):
21196 Return true early when testing with V2HImode.
21197 * config/i386/mmx.md (*punpckwd): Split to sse2_pshuflw_1.
21199 2021-06-14 Christophe Lyon <christophe.lyon@linaro.org>
21201 * config/arm/mve.md (mve_vec_unpack<US>_lo_<mode>): New pattern.
21202 (mve_vec_unpack<US>_hi_<mode>): New pattern.
21203 (@mve_vec_pack_trunc_lo_<mode>): New pattern.
21204 (mve_vmovntq_<supf><mode>): Prefix with '@'.
21205 * config/arm/neon.md (vec_unpack<US>_hi_<mode>): Move to
21207 (vec_unpack<US>_lo_<mode>): Likewise.
21208 (vec_pack_trunc_<mode>): Rename to
21209 neon_quad_vec_pack_trunc_<mode>.
21210 * config/arm/vec-common.md (vec_unpack<US>_hi_<mode>): New
21212 (vec_unpack<US>_lo_<mode>): New.
21213 (vec_pack_trunc_<mode>): New.
21215 2021-06-14 Richard Biener <rguenther@suse.de>
21217 PR tree-optimization/100934
21218 * tree-ssa-dom.c (pass_dominator::execute): Properly
21219 mark irreducible regions.
21221 2021-06-14 Martin Liska <mliska@suse.cz>
21223 * doc/invoke.texi: Put r{...} on the same line as @item.
21225 2021-06-14 Martin Liska <mliska@suse.cz>
21227 * doc/invoke.texi: Add missing newline.
21229 2021-06-14 Martin Liska <mliska@suse.cz>
21231 * doc/invoke.texi: Remove '+' charasters.
21233 2021-06-14 Claudiu Zissulescu <claziss@synopsys.com>
21235 * config.gcc (arc): Add support for with_cpu option.
21236 * config/arc/arc.h (OPTION_DEFAULT_SPECS): Add fpu.
21238 2021-06-14 Richard Biener <rguenther@suse.de>
21240 PR tree-optimization/101031
21241 * tree-ssa-strlen.c (maybe_invalidate): Increment max_size
21242 instead of size when accounting for a possibly string
21245 2021-06-14 Martin Liska <mliska@suse.cz>
21247 * gimple-ssa-evrp.c (pointer_equiv_analyzer::~pointer_equiv_analyzer): Use delete[].
21249 2021-06-14 Aldy Hernandez <aldyh@redhat.com>
21251 * value-query.cc (gimple_range_global): Call get_range_global
21252 if called after inlining.
21254 2021-06-13 Uroš Bizjak <ubizjak@gmail.com>
21257 * config/i386/i386-expand.c (expand_vec_perm_pshufb):
21258 Emit constant permutation insn directly from here.
21260 2021-06-13 Trevor Saunders <tbsaunde@tbsaunde.org>
21262 * attribs.c (find_attribute_namespace): Iterate over vec<> with
21264 * auto-profile.c (afdo_find_equiv_class): Likewise.
21265 * gcc.c (do_specs_vec): Likewise.
21266 (do_spec_1): Likewise.
21267 (driver::set_up_specs): Likewise.
21268 * gimple-loop-jam.c (any_access_function_variant_p): Likewise.
21269 * gimple-ssa-store-merging.c (compatible_load_p): Likewise.
21270 (imm_store_chain_info::try_coalesce_bswap): Likewise.
21271 (imm_store_chain_info::coalesce_immediate_stores): Likewise.
21272 (get_location_for_stmts): Likewise.
21273 * graphite-poly.c (print_iteration_domains): Likewise.
21274 (free_poly_bb): Likewise.
21275 (remove_gbbs_in_scop): Likewise.
21276 (free_scop): Likewise.
21277 (dump_gbb_cases): Likewise.
21278 (dump_gbb_conditions): Likewise.
21279 (print_pdrs): Likewise.
21280 (print_scop): Likewise.
21281 * ifcvt.c (cond_move_process_if_block): Likewise.
21282 * lower-subreg.c (decompose_multiword_subregs): Likewise.
21283 * regcprop.c (pass_cprop_hardreg::execute): Likewise.
21284 * sanopt.c (sanitize_rewrite_addressable_params): Likewise.
21285 * sel-sched-dump.c (dump_insn_vector): Likewise.
21286 * store-motion.c (store_ops_ok): Likewise.
21287 (store_killed_in_insn): Likewise.
21288 * timevar.c (timer::named_items::print): Likewise.
21289 * tree-cfgcleanup.c (cleanup_control_flow_pre): Likewise.
21290 (cleanup_tree_cfg_noloop): Likewise.
21291 * tree-data-ref.c (dump_data_references): Likewise.
21292 (print_dir_vectors): Likewise.
21293 (print_dist_vectors): Likewise.
21294 (dump_data_dependence_relations): Likewise.
21295 (dump_dist_dir_vectors): Likewise.
21296 (dump_ddrs): Likewise.
21297 (create_runtime_alias_checks): Likewise.
21298 (free_subscripts): Likewise.
21299 (save_dist_v): Likewise.
21300 (save_dir_v): Likewise.
21301 (invariant_access_functions): Likewise.
21302 (same_access_functions): Likewise.
21303 (access_functions_are_affine_or_constant_p): Likewise.
21304 (find_data_references_in_stmt): Likewise.
21305 (graphite_find_data_references_in_stmt): Likewise.
21306 (free_dependence_relations): Likewise.
21307 (free_data_refs): Likewise.
21308 * tree-inline.c (copy_debug_stmts): Likewise.
21309 * tree-into-ssa.c (dump_currdefs): Likewise.
21310 (rewrite_update_phi_arguments): Likewise.
21311 * tree-ssa-propagate.c (clean_up_loop_closed_phi): Likewise.
21312 * tree-vect-data-refs.c (vect_analyze_possibly_independent_ddr):
21314 (vect_slp_analyze_node_dependences): Likewise.
21315 (vect_slp_analyze_instance_dependence): Likewise.
21316 (vect_record_base_alignments): Likewise.
21317 (vect_get_peeling_costs_all_drs): Likewise.
21318 (vect_peeling_supportable): Likewise.
21319 * tree-vectorizer.c (vec_info::~vec_info): Likewise.
21320 (vec_info::free_stmt_vec_infos): Likewise.
21322 2021-06-13 Jeff Law <jeffreyalaw@gmail.com>
21324 * config/h8300/logical.md (<code>qi3_1<cczn>): New pattern.
21325 (andqi3_1<cczn>): Removed.
21326 (<ors>qi3_1): Do not split for IOR/XOR a single bit.
21327 (H8/SX bit logicals): Split out from other patterns.
21328 * config/h8300/multiply.md (mulqihi3_const<cczn>): Renamed from
21329 mulqihi3_const_clobber_flags.
21330 (mulqihi3<cczn>, mulhisi3_const<cczn>, mulhisi3<cczn>): Similarly
21332 2021-06-13 H.J. Lu <hjl.tools@gmail.com>
21335 * config/i386/i386.c (ix86_expand_prologue): Set red_zone_used
21336 to true if red zone is used.
21337 (ix86_output_indirect_jmp): Replace ix86_red_zone_size with
21338 ix86_red_zone_used.
21339 * config/i386/i386.h (machine_function): Add red_zone_used.
21340 (ix86_red_zone_size): Removed.
21341 (ix86_red_zone_used): New.
21342 * config/i386/i386.md (peephole2 patterns): Replace
21343 ix86_red_zone_size with ix86_red_zone_used.
21345 2021-06-12 Jason Merrill <jason@redhat.com>
21347 * doc/extend.texi (unused variable attribute): Applies to
21348 structure fields as well.
21350 2021-06-12 Eugene Rozenfeld <erozen@microsoft.com>
21352 * auto-profile.c (read_profile): fix a typo in an error string
21354 2021-06-11 Thomas Schwinge <thomas@codesourcery.com>
21356 * tree-pretty-print.h (dump_omp_clauses): Add 'bool = true'
21358 * tree-pretty-print.c (dump_omp_clauses): Update.
21359 (dump_generic_node) <OMP_CLAUSE>: Use it.
21361 2021-06-11 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
21364 * config/arm/arm_mve.h (__arm_vld1q): Change __ARM_mve_coerce(p0,
21365 int8_t const *) to __ARM_mve_coerce1(p0, int8_t *) in the argument for
21366 the polymorphic variants matching code.
21367 (__arm_vld1q_z): Likewise.
21368 (__arm_vld2q): Likewise.
21369 (__arm_vld4q): Likewise.
21370 (__arm_vldrbq_gather_offset): Likewise.
21371 (__arm_vldrbq_gather_offset_z): Likewise.
21373 2021-06-11 Roger Sayle <roger@nextmovesoftware.com>
21375 PR tree-optimization/96392
21376 * fold-const.h (tree_expr_maybe_real_minus_zero_p): Fix prototype.
21378 2021-06-11 Roger Sayle <roger@nextmovesoftware.com>
21380 PR tree-optimization/96392
21381 * fold-const.c (fold_real_zero_addition_p): Take both arguments
21382 of the addition or subtraction, not just the zero. Use this
21383 other argument in tests for signaling NaNs and signed zeros.
21384 (tree_expr_maybe_real_minus_zero_p): New predicate.
21385 * fold-const.h (fold_real_zero_addition_p): Update prototype.
21386 (tree_expr_maybe_real_minus_zero_p): New function prototype.
21387 * match.pd: Update calls to fold_real_zero_addition_p.
21388 Replace HONOR_NANS with tree_expr_maybe_nan_p.
21389 Replace HONOR_SIGNED_ZEROS with tree_expr_maybe_real_minus_zero_p.
21390 Replace HONOR_SNANS with tree_expr_maybe_signaling_nan_p.
21391 * tree-ssa-reassoc.c (eliminate_using_constants): Update
21392 call to fold_real_zero_addition_p.
21394 2021-06-11 Richard Biener <rguenther@suse.de>
21396 PR tree-optimization/101025
21397 * tree-ssa-loop-im.c (sm_seq_valid_bb): Make sure to process
21398 all refs that require dependence checking.
21400 2021-06-11 Richard Biener <rguenther@suse.de>
21402 PR tree-optimization/101028
21403 * tree-vect-slp.c (vect_build_slp_tree_2): When SLP
21404 reassoc discovery fails fatally, mark appropriate lanes
21407 2021-06-11 Richard Biener <rguenther@suse.de>
21409 PR tree-optimization/101026
21410 * tree-vect-slp.c (vect_build_slp_tree_2): Make sure we
21411 have a representative for the associated chain nodes.
21413 2021-06-11 Jakub Jelinek <jakub@redhat.com>
21415 PR rtl-optimization/101008
21416 * simplify-rtx.c (relational_result): New function.
21417 (simplify_logical_relational_operation,
21418 simplify_relational_operation): Use it.
21420 2021-06-11 Jakub Jelinek <jakub@redhat.com>
21423 * config/i386/sse.md (*vec_concat<mode>_0_1): Require TARGET_SSE2.
21425 2021-06-11 Uroš Bizjak <ubizjak@gmail.com>
21428 * config/i386/i386-expand.c (expand_vec_perm_pshufb): Return
21429 false if the permutation can be implemented with constant
21430 permutation instruction in wider mode.
21431 (canonicalize_vector_int_perm): Move above expand_vec_perm_pshufb.
21432 Handle V8QImode and V4HImode.
21434 2021-06-11 Martin Liska <mliska@suse.cz>
21436 PR gcov-profile/100788
21437 * common.opt: Add new option.
21438 * coverage.c (coverage_begin_function): Emit warning instead on
21439 the internal compiler error.
21440 * doc/invoke.texi: Document the option.
21441 * toplev.c (process_options): Enable it by default.
21443 2021-06-11 Richard Biener <rguenther@suse.de>
21445 PR middle-end/101009
21446 * tree-data-ref.c (build_classic_dist_vector_1): Make sure
21447 to set *init_b to true when we encounter a constant equal
21449 (compute_affine_dependence): Also dump the actual DR_REF.
21451 2021-06-10 Aldy Hernandez <aldyh@redhat.com>
21453 PR tree-optimization/100984
21454 * gimple-ssa-evrp.c (ssa_equiv_stack): Use auto_vec for
21455 replacements table.
21456 (ssa_equiv_stack::~ssa_equiv_stack): Remove.
21458 2021-06-11 Kewen Lin <linkw@linux.ibm.com>
21460 * config/rs6000/rs6000.md
21461 (floatsi<SFDF:mode>2_lfiwax_<QHI:mode>_mem_zext): New
21462 define_insn_and_split.
21464 2021-06-11 Richard Biener <rguenther@suse.de>
21466 * tree-vect-slp.c (vect_build_slp_tree_2): Use stablesort
21467 to sort operands of the associative chain.
21469 2021-06-11 Richard Biener <rguenther@suse.de>
21471 * system.h (gcc_stablesort_r): Declare.
21472 * sort.cc (gcc_sort_r): Support stable sort.
21473 (gcc_stablesort_r): Define.
21474 * vec.h (vec<>::stablesort): Add.
21476 2021-06-10 Uroš Bizjak <ubizjak@gmail.com>
21479 * config/i386/i386-expand.c (ix86_split_mmx_punpck):
21480 Handle V2SF mode. Emit SHUFPS to fixup unpack-high for V2SF mode.
21481 (expand_vec_perm_blend): Handle 64bit modes for TARGET_SSE4_1.
21482 (expand_vec_perm_pshufb): Handle 64bit modes for TARGET_SSSE3.
21483 (expand_vec_perm_pblendv): Handle 64bit modes for TARGET_SSE4_1.
21484 (expand_vec_perm_interleave2): Handle 64bit modes.
21485 (expand_vec_perm_even_odd_pack): Handle V8QI mode.
21486 (expand_vec_perm_even_odd_1): Ditto.
21487 (ix86_vectorize_vec_perm_const): Ditto.
21488 * config/i386/i386.md (UNSPEC_PSHUFB): Move from ...
21489 * config/i386/sse.md: ... here.
21490 * config/i386/mmx.md (*vec_interleave_lowv2sf):
21491 New insn_and_split pattern.
21492 (*vec_interleave_highv2sf): Ditto.
21493 (mmx_pshufbv8qi3): New insn pattern.
21494 (*mmx_pblendw): Ditto.
21496 2021-06-10 Peter Bergner <bergner@linux.ibm.com>
21498 * config/rs6000/rs6000-builtin.def (build_pair): New built-in.
21499 (build_acc): Likewise.
21500 * config/rs6000/rs6000-call.c (mma_expand_builtin): Swap assemble
21501 source operands in little-endian mode.
21502 (rs6000_gimple_fold_mma_builtin): Handle VSX_BUILTIN_BUILD_PAIR.
21503 (mma_init_builtins): Likewise.
21504 * config/rs6000/rs6000.c (rs6000_split_multireg_move): Handle endianness
21505 ordering for the MMA assemble and build source operands.
21506 * doc/extend.texi (__builtin_vsx_build_acc, __builtin_mma_build_pair):
21508 (__builtin_mma_assemble_acc, __builtin_mma_assemble_pair): Remove
21511 2021-06-10 Jeff Law <jeffreyalaw@gmail.com>
21513 * config/h8300/h8300.c (select_cc_mode): Handle MEM. Use
21515 * config/h8300/extensions.md: Replace _clobber_flags patterns
21518 2021-06-10 Robin Dapp <rdapp@linux.ibm.com>
21520 * config/s390/vector.md (vcond_mask_<mode><mode>): Change to
21521 (vcond_mask_<mode><tointvec>): this.
21523 2021-06-10 Andrew Stubbs <ams@codesourcery.com>
21524 Thomas Schwinge <thomas@codesourcery.com>
21526 * omp-builtins.def (BUILT_IN_GOACC_ENTER_EXIT_DATA): Split into...
21527 (BUILT_IN_GOACC_ENTER_DATA, BUILT_IN_GOACC_EXIT_DATA): ... these.
21528 * gimple.h (enum gf_mask): Split
21529 'GF_OMP_TARGET_KIND_OACC_ENTER_EXIT_DATA' into
21530 'GF_OMP_TARGET_KIND_OACC_ENTER_DATA' and
21531 'GF_OMP_TARGET_KIND_OACC_EXIT_DATA'.
21532 (is_gimple_omp_oacc): Update.
21533 * gimple-pretty-print.c (dump_gimple_omp_target): Likewise.
21534 * gimplify.c (gimplify_omp_target_update): Likewise.
21535 * omp-expand.c (expand_omp_target, build_omp_regions_1)
21536 (omp_make_gimple_edges): Likewise.
21537 * omp-low.c (check_omp_nesting_restrictions, lower_omp_target):
21540 2021-06-10 Aldy Hernandez <aldyh@redhat.com>
21542 * value-query.cc (value_query::value_on_edge): Rename name to
21544 (range_query::range_on_edge): Same.
21545 (range_query::value_of_expr): Same.
21546 (range_query::value_on_edge): Same.
21547 * value-query.h (class value_query): Same.
21548 (class range_query): Same.
21550 2021-06-10 Richard Biener <rguenther@suse.de>
21552 PR tree-optimization/101003
21553 * tree-vect-slp.c (vect_build_slp_tree_2): Appropriately
21554 use the pattern stmt defs when linearizing a chain.
21556 2021-06-10 Jakub Jelinek <jakub@redhat.com>
21559 * ifcvt.c (noce_get_alt_condition, noce_try_abs): Use
21560 prev_nonnote_nondebug_insn instead of prev_nonnote_insn.
21562 2021-06-10 Clement Chigot <clement.chigot@atos.net>
21564 * config/rs6000/aix71.h (ASM_CPU_SPEC): Add Power10 directive.
21565 * config/rs6000/aix72.h (ASM_CPU_SPEC): Likewise.
21567 2021-06-09 Andrew Pinski <apinski@marvell.com>
21569 PR tree-optimization/100925
21570 * match.pd (a ? CST1 : CST2): Limit transformations
21571 that would produce a negative to integeral types only.
21572 Change !POINTER_TYPE_P to INTEGRAL_TYPE_P also.
21574 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
21577 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
21579 * doc/tm.texi: Correctly update.
21581 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
21583 * doc/tm.texi: Correctly update.
21585 2021-06-09 H.J. Lu <hjl.tools@gmail.com>
21588 * doc/tm.texi.in (Trampolines): Add a missing blank line.
21590 2021-06-09 Paul Eggert <eggert@cs.ucla.edu>
21593 * doc/invoke.texi (Code Gen Options); Document that -fno-trampolines
21594 and -ftrampolines work only with Ada.
21595 * doc/tm.texi.in (Trampolines): Likewise.
21596 * doc/tm.texi: Regenerated.
21598 2021-06-09 Carl Love <cel@us.ibm.com>
21600 * config/rs6000/altivec.h (vec_signextll, vec_signexti, vec_signextq):
21601 Add define for new builtins.
21602 * config/rs6000/altivec.md(altivec_vreveti2): Add define_expand.
21603 * config/rs6000/rs6000-builtin.def (VSIGNEXTI, VSIGNEXTLL): Add
21604 overloaded builtin definitions.
21605 (VSIGNEXTSB2W, VSIGNEXTSH2W, VSIGNEXTSB2D, VSIGNEXTSH2D,VSIGNEXTSW2D,
21606 VSIGNEXTSD2Q): Add builtin expansions.
21607 (SIGNEXT): Add P10 overload definition.
21608 * config/rs6000/rs6000-call.c (P9V_BUILTIN_VEC_VSIGNEXTI, P9V_BUILTIN_VEC_VSIGNEXTLL,
21609 P10_BUILTIN_VEC_SIGNEXT): Add overloaded argument definitions.
21610 * config/rs6000/vsx.md (vsx_sign_extend_v2di_v1ti): Add define_insn.
21611 (vsignextend_v2di_v1ti, vsignextend_qi_<mode>, vsignextend_hi_<mode>,
21612 vsignextend_si_v2di)[VIlong]: Add define_expand.
21613 Make define_insn vsx_sign_extend_si_v2di visible.
21614 * doc/extend.texi: Add documentation for the vec_signexti,
21615 vec_signextll builtins and vec_signextq.
21617 2021-06-09 Carl Love <cel@us.ibm.com>
21619 * config/rs6000/rs6000.c (__fixkfti, __fixunskfti, __floattikf,
21620 __floatuntikf): Names changed to __fixkfti_sw, __fixunskfti_sw,
21621 __floattikf_sw, __floatuntikf_sw respectively.
21622 * config/rs6000/rs6000.md (floatti<mode>2, floatunsti<mode>2,
21623 fix_trunc<mode>ti2, fixuns_trunc<mode>ti2): Add
21624 define_insn for mode IEEE 128.
21626 2021-06-09 Carl Love <cel@us.ibm.com>
21628 * config/rs6000/altivec.md (altivec_vslq, altivec_vsrq):
21629 Rename to altivec_vslq_<mode>, altivec_vsrq_<mode>, mode VEC_TI.
21630 * config/rs6000/vector.md (VEC_TI): Was named VSX_TI in vsx.md.
21631 (vashlv1ti3): Change to vashl<mode>3, mode VEC_TI.
21632 (vlshrv1ti3): Change to vlshr<mode>3, mode VEC_TI.
21633 * config/rs6000/vsx.md (VSX_TI): Remove define_mode_iterator. Update
21634 uses of VSX_TI to VEC_TI.
21636 2021-06-09 Carl Love <cel@us.ibm.com>
21638 * config/rs6000/dfp.md (floattitd2, fixtdti2): New define_insns.
21640 2021-06-09 Carl Love <cel@us.ibm.com>
21642 * config/rs6000/altivec.h (vec_dive, vec_mod): Add define for new
21644 * config/rs6000/altivec.md (UNSPEC_VMULEUD, UNSPEC_VMULESD,
21645 UNSPEC_VMULOUD, UNSPEC_VMULOSD): New unspecs.
21646 (altivec_eqv1ti, altivec_gtv1ti, altivec_gtuv1ti, altivec_vmuleud,
21647 altivec_vmuloud, altivec_vmulesd, altivec_vmulosd, altivec_vrlq,
21648 altivec_vrlqmi, altivec_vrlqmi_inst, altivec_vrlqnm,
21649 altivec_vrlqnm_inst, altivec_vslq, altivec_vsrq, altivec_vsraq,
21650 altivec_vcmpequt_p, altivec_vcmpgtst_p, altivec_vcmpgtut_p): New
21652 (vec_widen_umult_even_v2di, vec_widen_smult_even_v2di,
21653 vec_widen_umult_odd_v2di, vec_widen_smult_odd_v2di, altivec_vrlqmi,
21654 altivec_vrlqnm): New define_expands.
21655 * config/rs6000/rs6000-builtin.def (VCMPEQUT_P, VCMPGTST_P,
21656 VCMPGTUT_P): Add macro expansions.
21657 (BU_P10V_AV_P): Add builtin predicate definition.
21658 (VCMPGTUT, VCMPGTST, VCMPEQUT, CMPNET, CMPGE_1TI,
21659 CMPGE_U1TI, CMPLE_1TI, CMPLE_U1TI, VNOR_V1TI_UNS, VNOR_V1TI, VCMPNET_P,
21660 VCMPAET_P, VMULEUD, VMULESD, VMULOUD, VMULOSD, VRLQ,
21661 VSLQ, VSRQ, VSRAQ, VRLQNM, DIV_V1TI, UDIV_V1TI, DIVES_V1TI, DIVEU_V1TI,
21662 MODS_V1TI, MODU_V1TI, VRLQMI): New macro expansions.
21663 (VRLQ, VSLQ, VSRQ, VSRAQ, DIVE, MOD): New overload expansions.
21664 * config/rs6000/rs6000-call.c (P10_BUILTIN_VCMPEQUT,
21665 P10V_BUILTIN_CMPGE_1TI, P10V_BUILTIN_CMPGE_U1TI,
21666 P10V_BUILTIN_VCMPGTUT, P10V_BUILTIN_VCMPGTST,
21667 P10V_BUILTIN_CMPLE_1TI, P10V_BUILTIN_VCMPLE_U1TI,
21668 P10V_BUILTIN_DIV_V1TI, P10V_BUILTIN_UDIV_V1TI,
21669 P10V_BUILTIN_VMULESD, P10V_BUILTIN_VMULEUD,
21670 P10V_BUILTIN_VMULOSD, P10V_BUILTIN_VMULOUD,
21671 P10V_BUILTIN_VNOR_V1TI, P10V_BUILTIN_VNOR_V1TI_UNS,
21672 P10V_BUILTIN_VRLQ, P10V_BUILTIN_VRLQMI,
21673 P10V_BUILTIN_VRLQNM, P10V_BUILTIN_VSLQ,
21674 P10V_BUILTIN_VSRQ, P10V_BUILTIN_VSRAQ,
21675 P10V_BUILTIN_VCMPGTUT_P, P10V_BUILTIN_VCMPGTST_P,
21676 P10V_BUILTIN_VCMPEQUT_P, P10V_BUILTIN_VCMPGTUT_P,
21677 P10V_BUILTIN_VCMPGTST_P, P10V_BUILTIN_CMPNET,
21678 P10V_BUILTIN_VCMPNET_P, P10V_BUILTIN_VCMPAET_P,
21679 P10V_BUILTIN_DIVES_V1TI, P10V_BUILTIN_MODS_V1TI,
21680 P10V_BUILTIN_MODU_V1TI):
21681 New overloaded definitions.
21682 (rs6000_gimple_fold_builtin) [P10V_BUILTIN_VCMPEQUT,
21683 P10V_BUILTIN_CMPNET, P10V_BUILTIN_CMPGE_1TI,
21684 P10V_BUILTIN_CMPGE_U1TI, P10V_BUILTIN_VCMPGTUT,
21685 P10V_BUILTIN_VCMPGTST, P10V_BUILTIN_CMPLE_1TI,
21686 P10V_BUILTIN_CMPLE_U1TI]: New case statements.
21687 (rs6000_init_builtins) [bool_V1TI_type_node, int_ftype_int_v1ti_v1ti]:
21689 (altivec_init_builtins): New E_V1TImode case statement.
21690 (builtin_function_type)[P10_BUILTIN_128BIT_VMULEUD,
21691 P10_BUILTIN_128BIT_VMULOUD, P10_BUILTIN_128BIT_DIVEU_V1TI,
21692 P10_BUILTIN_128BIT_MODU_V1TI, P10_BUILTIN_CMPGE_U1TI,
21693 P10_BUILTIN_VCMPGTUT, P10_BUILTIN_VCMPEQUT]: New case statements.
21694 * config/rs6000/rs6000.c (rs6000_handle_altivec_attribute) [E_TImode,
21695 E_V1TImode]: New case statements.
21696 * config/rs6000/rs6000.h (rs6000_builtin_type_index): New enum
21697 value RS6000_BTI_bool_V1TI.
21698 * config/rs6000/vector.md (vector_gtv1ti,vector_nltv1ti,
21699 vector_gtuv1ti, vector_nltuv1ti, vector_ngtv1ti, vector_ngtuv1ti,
21700 vector_eq_v1ti_p, vector_ne_v1ti_p, vector_ae_v1ti_p,
21701 vector_gt_v1ti_p, vector_gtu_v1ti_p, vrotlv1ti3, vashlv1ti3,
21702 vlshrv1ti3, vashrv1ti3): New define_expands.
21703 * config/rs6000/vsx.md (UNSPEC_VSX_DIVSQ, UNSPEC_VSX_DIVUQ,
21704 UNSPEC_VSX_DIVESQ, UNSPEC_VSX_DIVEUQ, UNSPEC_VSX_MODSQ,
21705 UNSPEC_VSX_MODUQ): New unspecs.
21706 (mulv2di3, vsx_div_v1ti, vsx_udiv_v1ti, vsx_dives_v1ti,
21707 vsx_diveu_v1ti, vsx_mods_v1ti, vsx_modu_v1ti, xxswapd_v1ti): New
21709 (vcmpnet): New define_expand.
21710 * doc/extend.texi: Add documentation for the new builtins vec_rl,
21711 vec_rlmi, vec_rlnm, vec_sl, vec_sr, vec_sra, vec_mule, vec_mulo,
21712 vec_div, vec_dive, vec_mod, vec_cmpeq, vec_cmpne, vec_cmpgt, vec_cmplt,
21713 vec_cmpge, vec_cmple, vec_all_eq, vec_all_ne, vec_all_gt, vec_all_lt,
21714 vec_all_ge, vec_all_le, vec_any_eq, vec_any_ne, vec_any_gt, vec_any_lt,
21715 vec_any_ge, vec_any_le.
21717 2021-06-09 Carl Love <cel@us.ibm.com>
21719 * config/rs6000/altivec.md (altivec_vrl<VI_char>mi): Fix
21720 bug in argument generation.
21722 2021-06-09 Christophe Lyon <christophe.lyon@linaro.org>
21724 * config/arm/iterators.md (<supf>): Remove VCLZQ_U, VCLZQ_S.
21726 * config/arm/mve.md (mve_vclzq_<supf><mode>): Add '@' prefix,
21727 remove <supf> iterator.
21728 (mve_vclzq_u<mode>): New.
21729 * config/arm/neon.md (clz<mode>2): Rename to neon_vclz<mode>.
21730 (neon_vclz<mode): Move to ...
21731 * config/arm/unspecs.md (VCLZQ_U, VCLZQ_S): Remove.
21732 * config/arm/vec-common.md: ... here. Add support for MVE.
21734 2021-06-09 Christophe Lyon <christophe.lyon@linaro.org>
21736 * config/arm/mve.md (mve_vhaddq_<supf><mode>): Prefix with '@'.
21737 (@mve_vrhaddq_<supf><mode): Likewise.
21738 * config/arm/neon.md (neon_v<r>hadd<sup><mode>): Likewise.
21739 * config/arm/vec-common.md (avg<mode>3_floor, uavg<mode>3_floor)
21740 (avg<mode>3_ceil", uavg<mode>3_ceil): New patterns.
21742 2021-06-09 imba-tjd <109224573@qq.com>
21744 * doc/invoke.texi: Fix typo.
21746 2021-06-09 Roger Sayle <roger@nextmovesoftware.com>
21748 PR middle-end/53267
21749 * fold-const-call.c (fold_const_call_sss) [CASE_CFN_FMOD]:
21750 Support evaluation of fmod/fmodf/fmodl at compile-time.
21752 2021-06-09 Richard Biener <rguenther@suse.de>
21754 PR tree-optimization/100981
21755 * tree-vect-loop.c (vect_create_epilog_for_reduction): Use
21756 gimple_get_lhs to also handle calls.
21757 * tree-vect-slp-patterns.c (complex_pattern::build): Transfer
21760 2021-06-09 Richard Biener <rguenther@suse.de>
21762 PR tree-optimization/97832
21763 * tree-vectorizer.h (_slp_tree::failed): New.
21764 * tree-vect-slp.c (_slp_tree::_slp_tree): Initialize
21766 (_slp_tree::~_slp_tree): Free failed.
21767 (vect_build_slp_tree): Retain failed nodes and record
21768 matches in them, copying that back out when running
21769 into a cached fail. Dump start and end of discovery.
21770 (dt_sort_cmp): New.
21771 (vect_build_slp_tree_2): Handle associatable chains
21772 together doing more aggressive operand swapping.
21774 2021-06-09 H.J. Lu <hjl.tools@gmail.com>
21777 * config.gcc (gcc_cv_initfini_array): Set to yes for Linux and
21779 * doc/install.texi: Require glibc 2.1 and binutils 2.12 for
21780 Linux and GNU targets.
21782 2021-06-09 Richard Biener <rguenther@suse.de>
21784 * tree-vect-stmts.c (vect_is_simple_use): Always get dt
21787 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
21789 * config/arc/arc.md (loop_end): Change it to
21790 define_insn_and_split.
21792 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
21794 * config/arc/arc.md (maddhisi4): Use VMAC2H instruction.
21795 (machi): New pattern.
21796 (umaddhisi4): Use VMAC2HU instruction.
21797 (umachi): New pattern.
21799 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
21801 * config/arc/arc-protos.h (arc_split_move_p): New prototype.
21802 * config/arc/arc.c (arc_split_move_p): New function.
21803 (arc_split_move): Clean up.
21804 * config/arc/arc.md (movdi_insn): Clean up, use arc_split_move_p.
21805 (movdf_insn): Likewise.
21806 * config/arc/simdext.md (mov<VWH>_insn): Likewise.
21808 2021-06-09 Uroš Bizjak <ubizjak@gmail.com>
21811 * config/i386/i386.c (print_operand_address_as): Rename "no_rip"
21812 argument to "raw". Do not emit segment overrides when "raw" is true.
21814 2021-06-09 Martin Liska <mliska@suse.cz>
21816 * doc/gcov.texi: Create a proper JSON files.
21817 * doc/invoke.texi: Remove dots in order to make it a valid
21820 2021-06-09 Xionghu Luo <luoxhu@linux.ibm.com>
21822 * config/rs6000/rs6000-p8swap.c (pattern_is_rotate64): New.
21823 (insn_is_load_p): Use pattern_is_rotate64.
21824 (insn_is_swap_p): Likewise.
21825 (quad_aligned_load_p): Likewise.
21826 (const_load_sequence_p): Likewise.
21827 (replace_swapped_aligned_load): Likewise.
21828 (recombine_lvx_pattern): Likewise.
21829 (recombine_stvx_pattern): Likewise.
21831 2021-06-09 Andrew MacLeod <amacleod@redhat.com>
21833 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Use a
21834 fur_stmt source record.
21835 * gimple-range.cc (fur_source::get_operand): Generic range query.
21836 (fur_source::get_phi_operand): New.
21837 (fur_source::register_dependency): New.
21838 (fur_source::query): New.
21839 (class fur_edge): New. Edge source for operands.
21840 (fur_edge::fur_edge): New.
21841 (fur_edge::get_operand): New.
21842 (fur_edge::get_phi_operand): New.
21843 (fur_edge::query): New.
21844 (fur_stmt::fur_stmt): New.
21845 (fur_stmt::get_operand): New.
21846 (fur_stmt::get_phi_operand): New.
21847 (fur_stmt::query): New.
21848 (class fur_depend): New. Statement source and process dependencies.
21849 (fur_depend::fur_depend): New.
21850 (fur_depend::register_dependency): New.
21851 (class fur_list): New. List source for operands.
21852 (fur_list::fur_list): New.
21853 (fur_list::get_operand): New.
21854 (fur_list::get_phi_operand): New.
21855 (fold_range): New. Instantiate appropriate fur_source class and fold.
21856 (fold_using_range::range_of_range_op): Use new API.
21857 (fold_using_range::range_of_address): Ditto.
21858 (fold_using_range::range_of_phi): Ditto.
21859 (imple_ranger::fold_range_internal): Use fur_depend class.
21860 (fold_using_range::range_of_ssa_name_with_loop_info): Use new API.
21861 * gimple-range.h (class fur_source): Now a base class.
21862 (class fur_stmt): New.
21863 (fold_range): New prototypes.
21864 (fur_source::fur_source): Delete.
21866 2021-06-08 Andrew Pinski <apinski@marvell.com>
21868 PR tree-optimization/25290
21869 * tree-ssa-phiopt.c (xor_replacement): Delete.
21870 (tree_ssa_phiopt_worker): Delete use of xor_replacement.
21871 (match_simplify_replacement): Allow one cheap preparation
21872 statement that can be moved to before the if.
21874 2021-06-08 Pat Haugen <pthaugen@linux.ibm.com>
21876 * config/rs6000/power10.md (power10-fused-load, power10-fused-store,
21877 power10-fused_alu, power10-fused-vec, power10-fused-branch): New.
21879 2021-06-08 Jeff Law <jeffreyalaw@gmail.com>
21881 * config/h8300/logical.md (andqi3_1): Move BCLR case into define_insn_and_split.
21882 Create length attribute on define_insn_and_split. Only split for cases which we
21884 (andqi3_1<cczn>): Renamed from andqi3_1_clobber_flags. Only handle AND here and
21885 fix length computation.
21886 (b<code><mode>msx): Combine QImode and HImode H8/SX patterns using iterator.
21888 2021-06-08 Richard Biener <rguenther@suse.de>
21890 PR tree-optimization/100923
21891 * tree-ssa-sccvn.c (valueize_refs_1): Take a pointer to
21892 the operand vector to be valueized.
21893 (valueize_refs): Likewise.
21894 (valueize_shared_reference_ops_from_ref): Adjust.
21895 (valueize_shared_reference_ops_from_call): Likewise.
21896 (vn_reference_lookup_3): Likewise.
21897 (vn_reference_lookup_pieces): Likewise. Re-valueize
21898 with honoring availability when we are about to create
21899 the ao_ref and valueized before.
21900 (vn_reference_lookup): Likewise.
21901 (vn_reference_insert_pieces): Adjust.
21903 2021-06-08 Richard Biener <rguenther@suse.de>
21905 * tree-vectorizer.h (_slp_instance::root_stmt): Change to...
21906 (_slp_instance::root_stmts): ... a vector.
21907 (SLP_INSTANCE_ROOT_STMT): Rename to ...
21908 (SLP_INSTANCE_ROOT_STMTS): ... this.
21909 (slp_root::root): Change to...
21910 (slp_root::roots): ... a vector.
21911 (slp_root::slp_root): Adjust.
21912 * tree-vect-slp.c (_slp_instance::location): Adjust.
21913 (vect_free_slp_instance): Release the root stmt vector.
21914 (vect_build_slp_instance): Adjust.
21915 (vect_analyze_slp): Likewise.
21916 (_bb_vec_info::~_bb_vec_info): Likewise.
21917 (vect_slp_analyze_operations): Likewise.
21918 (vect_bb_vectorization_profitable_p): Likewise. Adjust
21919 costs for the root stmt.
21920 (vect_slp_check_for_constructors): Gather all BIT_INSERT_EXPRs
21922 (vect_slp_analyze_bb_1): Simplify by marking all root stmts
21924 (vectorize_slp_instance_root_stmt): Adjust.
21925 (vect_schedule_slp): Likewise.
21927 2021-06-08 Aldy Hernandez <aldyh@redhat.com>
21929 * gimple-ssa-evrp.c (class ssa_equiv_stack): New.
21930 (ssa_equiv_stack::ssa_equiv_stack): New.
21931 (ssa_equiv_stack::~ssa_equiv_stack): New.
21932 (ssa_equiv_stack::enter): New.
21933 (ssa_equiv_stack::leave): New.
21934 (ssa_equiv_stack::push_replacement): New.
21935 (ssa_equiv_stack::get_replacement): New.
21936 (is_pointer_ssa): New.
21937 (class pointer_equiv_analyzer): New.
21938 (pointer_equiv_analyzer::pointer_equiv_analyzer): New.
21939 (pointer_equiv_analyzer::~pointer_equiv_analyzer): New.
21940 (pointer_equiv_analyzer::set_global_equiv): New.
21941 (pointer_equiv_analyzer::set_cond_equiv): New.
21942 (pointer_equiv_analyzer::get_equiv): New.
21943 (pointer_equiv_analyzer::enter): New.
21944 (pointer_equiv_analyzer::leave): New.
21945 (pointer_equiv_analyzer::get_equiv_expr): New.
21946 (pta_valueize): New.
21947 (pointer_equiv_analyzer::visit_stmt): New.
21948 (pointer_equiv_analyzer::visit_edge): New.
21949 (hybrid_folder::value_of_expr): Call PTA.
21950 (hybrid_folder::value_on_edge): Same.
21951 (hybrid_folder::pre_fold_bb): New.
21952 (hybrid_folder::post_fold_bb): New.
21953 (hybrid_folder::pre_fold_stmt): New.
21954 (rvrp_folder::pre_fold_bb): New.
21955 (rvrp_folder::post_fold_bb): New.
21956 (rvrp_folder::pre_fold_stmt): New.
21957 (rvrp_folder::value_of_expr): Call PTA.
21958 (rvrp_folder::value_on_edge): Same.
21960 2021-06-08 Jakub Jelinek <jakub@redhat.com>
21963 * tree-inline.c (copy_tree_body_r): For OMP_CLAUSE_DEPEND don't
21964 check TREE_CODE if OMP_CLAUSE_DECL is NULL.
21966 2021-06-08 Richard Biener <rguenther@suse.de>
21968 PR middle-end/100951
21969 * tree-vect-generic.c (expand_vector_piecewise): Build a
21970 VECTOR_CST if all elements are constant.
21971 (expand_vector_condition): Likewise.
21972 (lower_vec_perm): Likewise.
21973 (expand_vector_conversion): Likewise.
21975 2021-06-08 Martin Liska <mliska@suse.cz>
21977 * doc/invoke.texi: Document new param evrp-sparse-threshold.
21979 2021-06-08 Martin Liska <mliska@suse.cz>
21981 * genautomata.c (create_automata): Fix typo.
21983 2021-06-08 Kewen Lin <linkw@linux.ibm.com>
21985 PR tree-optimization/100794
21986 * tree-predcom.c (tree_predictive_commoning_loop): Add parameter
21987 allow_unroll_p and only allow unrolling when it's true.
21988 (tree_predictive_commoning): Add parameter allow_unroll_p and
21990 (run_tree_predictive_commoning): Likewise.
21991 (pass_predcom::gate): Check flag_tree_loop_vectorize and
21992 global_options_set.x_flag_predictive_commoning.
21993 (pass_predcom::execute): Adjust for allow_unroll_p.
21995 2021-06-08 Kewen Lin <linkw@linux.ibm.com>
21997 * tree-predcom.c (execute_pred_commoning): Remove update_ssa call.
21998 (tree_predictive_commoning_loop): Factor some cleanup stuffs into
21999 lambda function cleanup, remove scev_reset call, and adjust return
22001 (tree_predictive_commoning): Adjust for different changed values,
22002 only set flag TODO_update_ssa_only_virtuals if changed.
22003 (pass_data pass_data_predcom): Remove TODO_update_ssa_only_virtuals
22004 from todo_flags_finish.
22006 2021-06-07 Andrew MacLeod <amacleod@redhat.com>
22008 * gimple-range-cache.cc (class sbr_sparse_bitmap): New.
22009 (sbr_sparse_bitmap::sbr_sparse_bitmap): New.
22010 (sbr_sparse_bitmap::bitmap_set_quad): New.
22011 (sbr_sparse_bitmap::bitmap_get_quad): New.
22012 (sbr_sparse_bitmap::set_bb_range): New.
22013 (sbr_sparse_bitmap::get_bb_range): New.
22014 (sbr_sparse_bitmap::bb_range_p): New.
22015 (block_range_cache::block_range_cache): initialize bitmap obstack.
22016 (block_range_cache::~block_range_cache): Destruct obstack.
22017 (block_range_cache::set_bb_range): Decide when to utilze the
22018 sparse on entry cache.
22019 * gimple-range-cache.h (block_range_cache): Add bitmap obstack.
22020 * params.opt (-param=evrp-sparse-threshold): New.
22022 2021-06-07 Andrew MacLeod <amacleod@redhat.com>
22024 * bitmap.c (bitmap_set_aligned_chunk): New.
22025 (bitmap_get_aligned_chunk): New.
22026 (test_aligned_chunk): New.
22027 (bitmap_c_tests): Call test_aligned_chunk.
22028 * bitmap.h (bitmap_set_aligned_chunk, bitmap_get_aligned_chunk): New.
22030 2021-06-07 Uroš Bizjak <ubizjak@gmail.com>
22033 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
22035 (ix86_expand_vector_init_one_nonzero): Ditto.
22036 (ix86_expand_vector_init_one_var): Ditto.
22037 (ix86_expand_vector_init_general): Ditto.
22038 * config/i386/mmx.md (vec_initv4qiqi): New expander.
22040 2021-06-07 Jeff Law <jeffreyalaw@gmail.com>
22042 * config/h8300/movepush.md: Change most _clobber_flags
22043 patterns to instead use <cczn> subst.
22044 (movsi_cczn): New pattern with usable CC cases split out.
22045 (movsi_h8sx_cczn): Likewise.
22047 2021-06-07 Martin Liska <mliska@suse.cz>
22049 * common/common-target.def: Split long lines and replace them
22051 * target.def: Likewise.
22052 * doc/tm.texi: Re-generated.
22054 2021-06-07 Jakub Jelinek <jakub@redhat.com>
22057 * fold-const.c (fold_read_from_vector): Return NULL if trying to
22058 read from a CONSTRUCTOR with vector type elements.
22060 2021-06-07 Jakub Jelinek <jakub@redhat.com>
22062 PR middle-end/100898
22063 * tree-inline.c (copy_bb): Only use gimple_call_arg_ptr if memcpy
22064 should copy any arguments. Don't call gimple_call_num_args
22065 on id->call_stmt or call_stmt more than once.
22067 2021-06-07 liuhongt <hongtao.liu@intel.com>
22070 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3): Refine
22072 (<insn>v4siv4di2): Delete constraints for define_expand.
22074 2021-06-07 liuhongt <hongtao.liu@intel.com>
22077 * config/i386/i386-expand.c (ix86_expand_builtin): Remove
22078 assignment of cfun->machine->has_explicit_vzeroupper.
22079 * config/i386/i386-features.c
22080 (ix86_add_reg_usage_to_vzerouppers): Delete.
22081 (ix86_add_reg_usage_to_vzeroupper): Ditto.
22082 (rest_of_handle_insert_vzeroupper): Remove
22083 ix86_add_reg_usage_to_vzerouppers, add df_analyze at the end
22085 (gate): Remove cfun->machine->has_explicit_vzeroupper.
22086 * config/i386/i386-protos.h (ix86_expand_avx_vzeroupper):
22088 * config/i386/i386.c (ix86_insn_callee_abi): New function.
22089 (ix86_initialize_callee_abi): Ditto.
22090 (ix86_expand_avx_vzeroupper): Ditto.
22091 (ix86_hard_regno_call_part_clobbered): Adjust for vzeroupper
22093 (TARGET_INSN_CALLEE_ABI): Define as ix86_insn_callee_abi.
22094 (ix86_emit_mode_set): Call ix86_expand_avx_vzeroupper
22096 * config/i386/i386.h (struct GTY(()) machine_function): Delete
22097 has_explicit_vzeroupper.
22098 * config/i386/i386.md (enum unspec): New member
22100 (ABI_DEFAULT,ABI_VZEROUPPER,ABI_UNKNOWN): New
22101 define_constants for insn callee abi index.
22102 * config/i386/predicates.md (vzeroupper_pattern): Adjust.
22103 * config/i386/sse.md (UNSPECV_VZEROUPPER): Deleted.
22104 (avx_vzeroupper): Call ix86_expand_avx_vzeroupper.
22105 (*avx_vzeroupper): Rename to ..
22106 (avx_vzeroupper_callee_abi): .. this, and adjust pattern as
22107 call_insn which has a special vzeroupper ABI.
22108 (*avx_vzeroupper_1): Deleted.
22110 2021-06-07 liuhongt <hongtao.liu@intel.com>
22113 * df-scan.c (df_get_call_refs): When call_insn is a fake call,
22114 it won't use stack pointer reg.
22115 * final.c (leaf_function_p): When call_insn is a fake call, it
22116 won't affect caller as a leaf function.
22117 * reg-stack.c (callee_clobbers_any_stack_reg): New.
22118 (subst_stack_regs): When call_insn doesn't clobber any stack
22119 reg, don't clear the arguments.
22120 * rtl.c (shallow_copy_rtx): Don't clear flag used when orig is
22122 * shrink-wrap.c (requires_stack_frame_p): No need for stack
22123 frame for a fake call.
22124 * rtl.h (FAKE_CALL_P): New macro.
22126 2021-06-06 Eric Botcazou <ebotcazou@adacore.com>
22128 * config/sparc/sparc-protos.h (order_regs_for_local_alloc): Rename
22130 (sparc_order_regs_for_local_alloc): ...this.
22131 (sparc_leaf_reg_remap): Declare.
22132 * config/sparc/sparc.h (ADJUST_REG_ALLOC_ORDER): Adjust.
22133 (LEAF_REG_REMAP): Reimplement as call to sparc_leaf_reg_remap.
22134 * config/sparc/sparc.c (leaf_reg_remap): Delete.
22135 (order_regs_for_local_alloc): Rename to...
22136 (sparc_order_regs_for_local_alloc): ...this.
22137 (sparc_leaf_reg_remap): New function.
22138 (sparc_conditional_register_usage): Do not modify leaf_reg_remap.
22140 2021-06-06 David Edelsohn <dje.gcc@gmail.com>
22142 * config/rs6000/rs6000.c (rs6000_xcoff_asm_output_aligned_decl_common):
22143 Use assemble_name to output BSS section name.
22145 2021-06-06 Uroš Bizjak <ubizjak@gmail.com>
22147 * config/i386/constraints.md (Bs):
22148 Remove boolean operators from match_test RTX.
22151 (M): Use "mode" variable instead of GET_MODE (op) in match_test RTX.
22154 2021-06-06 Martin Liska <mliska@suse.cz>
22156 * doc/extend.texi: Add missing @headitem.
22157 * doc/invoke.texi: Likewise.
22158 * doc/objc.texi: Likewise.
22160 2021-06-06 Martin Liska <mliska@suse.cz>
22162 * genhooks.c (emit_findices): Remove unused function.
22163 (emit_documentation): Do not call emit_findices
22164 and do not search for @Fcode directives.
22166 2021-06-06 Martin Liska <mliska@suse.cz>
22168 * doc/invoke.texi: Remove extra character.
22170 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22172 * config/sh/sh.md (doloop_end_split): Fix empty split condition.
22174 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22176 * config/sparc/sparc.md (*snedi<W:mode>_zero_vis3,
22177 *neg_snedi<W:mode>_zero_subxc, *plus_snedi<W:mode>_zero,
22178 *plus_plus_snedi<W:mode>_zero, *minus_snedi<W:mode>_zero,
22179 *minus_minus_snedi<W:mode>_zero): Fix empty split condition.
22181 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22183 * config/or1k/or1k.md (*movdi): Fix empty split condition.
22185 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22187 * config/mips/mips.md (<anonymous>, bswapsi2, bswapdi2): Fix empty
22190 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
22192 * config/m68k/m68k.md (*zero_extend_inc, *zero_extend_dec,
22193 *zero_extendsidi2): Fix empty split condition.
22195 2021-06-05 Jeff Law <jeffreyalaw@gmail.com>
22197 * config/h8300/addsub.md: Fix split condition in define_insn_and_split
22199 * config/h8300/bitfield.md: Likewise.
22200 * config/h8300/combiner.md: Likewise.
22201 * config/h8300/divmod.md: Likewise.
22202 * config/h8300/extensions.md: Likewise.
22203 * config/h8300/jumpcall.md: Likewise.
22204 * config/h8300/movepush.md: Likewise.
22205 * config/h8300/multiply.md: Likewise.
22206 * config/h8300/other.md: Likewise.
22207 * config/h8300/shiftrotate.md: Likewise.
22208 * config/h8300/logical.md: Likewise. Fix split pattern to use
22209 code iterator that somehow slipped through.
22211 2021-06-04 Tobias Burnus <tobias@codesourcery.com>
22213 PR middle-end/100905
22214 * tree-nested.c (convert_nonlocal_omp_clauses,
22215 convert_local_omp_clauses): Handle OMP_CLAUSE_BIND.
22217 2021-06-04 Martin Sebor <msebor@redhat.com>
22219 PR middle-end/100732
22220 * gimple-fold.c (gimple_fold_builtin_sprintf): Avoid folding calls
22221 with either source or destination argument of invalid type.
22222 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Avoid checking
22223 calls with arguments of invalid type.
22225 2021-06-04 Martin Sebor <msebor@redhat.com>
22227 * attribs.c (init_attr_rdwr_indices): Use VLA bounds in the expected
22229 (attr_access::vla_bounds): Also handle VLA bounds.
22231 2021-06-04 Uroš Bizjak <ubizjak@gmail.com>
22233 * config/i386/predicates.md (GOT_memory_operand):
22234 Implement using match_code RTXes.
22235 (GOT32_symbol_operand): Ditto.
22237 2021-06-04 Uroš Bizjak <ubizjak@gmail.com>
22240 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
22242 (ix86_expand_vector_init_general): Ditto.
22243 Use SImode instead of word_mode for logic operations
22244 when GET_MODE_SIZE (mode) < UNITS_PER_WORD.
22245 (expand_vec_perm_even_odd_1): Assert that V2HI mode should be
22246 implemented by expand_vec_perm_1.
22247 (expand_vec_perm_broadcast_1): Assert that V2HI and V4HI modes
22248 should be implemented using standard shuffle patterns.
22249 (ix86_vectorize_vec_perm_const): Handle V2HImode. Add V4HI and
22250 V2HI modes to modes, implementable with shuffle for one operand.
22251 * config/i386/mmx.md (*punpckwd): New insn_and_split pattern.
22252 (*pshufw_1): New insn pattern.
22253 (*vec_dupv2hi): Ditto.
22254 (vec_initv2hihi): New expander.
22256 2021-06-04 Kewen Lin <linkw@linux.ibm.com>
22258 * config/arm/vfp.md (no_literal_pool_df_immediate,
22259 no_literal_pool_sf_immediate): Fix empty split condition.
22261 2021-06-04 Kewen Lin <linkw@linux.ibm.com>
22263 * config/i386/i386.md (*load_tp_x32_zext, *add_tp_x32_zext,
22264 *tls_dynamic_gnu2_combine_32): Fix empty split condition.
22265 * config/i386/sse.md (*<sse2_avx2>_pmovmskb_lt,
22266 *<sse2_avx2>_pmovmskb_zext_lt, *sse2_pmovmskb_ext_lt,
22267 *<sse4_1_avx2>_pblendvb_lt): Likewise.
22269 2021-06-04 Jakub Jelinek <jakub@redhat.com>
22272 * config/i386/i386-expand.c (ix86_expand_vector_init): Handle
22273 concatenation from half-sized modes with TImode elements.
22275 2021-06-04 Claudiu Zissulescu <claziss@synopsys.com>
22277 * config/arc/arc.c (arc_override_options): Disable millicode
22278 thunks when RF16 is on.
22280 2021-06-04 Haochen Gui <guihaoc@gcc.gnu.org>
22282 * config/rs6000/rs6000.h (PROMOTE_MODE): Remove.
22284 2021-06-04 Haochen Gui <guihaoc@gcc.gnu.org>
22286 * config/rs6000/rs6000-call.c (rs6000_promote_function_mode):
22287 Replace PROMOTE_MODE marco with its content.
22289 2021-06-03 Kewen Lin <linkw@linux.ibm.com>
22291 * config/cris/cris.md (*addi_reload): Fix empty split condition.
22293 2021-06-03 Jim Wilson <jimw@sifive.com>
22295 * config.gcc (riscv*-*-*): If --with-riscv-attribute not used,
22296 turn it on for all riscv targets.
22298 2021-06-03 Uroš Bizjak <ubizjak@gmail.com>
22301 * config/i386/i386-expand.c (ix86_expand_vector_set):
22302 Handle V2HI and V4QI modes.
22303 (ix86_expand_vector_extract): Ditto.
22304 * config/i386/mmx.md (*pinsrw): New insn pattern.
22307 (*pextrw_zext): Ditto.
22309 (*pextrb_zext): Ditto.
22310 (vec_setv2hi): New expander.
22311 (vec_extractv2hihi): Ditto.
22312 (vec_setv4qi): Ditto.
22313 (vec_extractv4qiqi): Ditto.
22314 (vec_setv8qi): Enable only for TARGET_SSE4_1.
22315 (vec_extractv8qiqi): Ditto.
22317 2021-06-03 Aaron Sawdey <acsawdey@linux.ibm.com>
22319 * config/rs6000/genfusion.pl (gen_logical_addsubf): Fix input
22320 order to subf instruction.
22321 * config/rs6000/fusion.md: Regenerate.
22323 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
22325 * calls.c (get_size_range): Use range_of_expr instead of
22326 determine_value_range.
22327 * tree-affine.c (expr_to_aff_combination): Same.
22328 * tree-data-ref.c (split_constant_offset): Same.
22329 * tree-vrp.c (determine_value_range_1): Remove.
22330 (determine_value_range): Remove.
22331 * tree-vrp.h (determine_value_range): Remove.
22333 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
22335 * function-tests.c (test_ranges): Call gimple_range_tests.
22336 * gimple-range-cache.cc (ranger_cache::range_of_expr): Pass stmt
22338 * gimple-range.cc (fur_source::get_operand): Do not call
22339 get_tree_range or gimple_range_global.
22341 (get_tree_range): Move to value-query.cc.
22342 Call get_arith_expr_range.
22343 (gimple_ranger::range_of_expr): Add argument to get_tree_range.
22344 Include gimple-range-tests.cc.
22345 * gimple-range.h (fold_range): Add argument.
22346 (get_tree_range): Remove.
22347 * selftest.h (gimple_range_tests): New.
22348 * value-query.cc (global_range_query::range_of_expr): Add
22350 (range_query::get_tree_range): Move from gimple-range.cc.
22351 * value-query.h (class range_query): Add get_tree_range and
22352 get_arith_expr_range. Make fur_source a friend.
22353 * vr-values.c (vr_values::range_of_expr): Pass stmt to
22355 * gimple-range-tests.cc: New file.
22357 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
22359 * gimple-range.cc (gimple_ranger::export_global_ranges): Call
22360 update_global_range.
22361 * value-query.cc (update_global_range): New.
22362 * value-query.h (update_global_range): New.
22364 2021-06-03 David Malcolm <dmalcolm@redhat.com>
22366 * diagnostic-show-locus.c (diagnostic_show_locus): Don't reject
22367 printing the same location twice if there are fix-it hints,
22368 multiple locations, or a label.
22370 2021-06-03 Andre Vieira <andre.simoesdiasvieira@arm.com>
22372 * tree-vect-loop.c (vect_transform_loop): Use main loop's various'
22373 thresholds to narrow the upper bound on epilogue iterations.
22375 2021-06-03 Christophe Lyon <christophe.lyon@linaro.org>
22377 * config/arm/mve.md (mve_vabsq_f<mode>): Use 'abs' instead of unspec.
22378 (mve_vabsq_s<mode>): Likewise.
22379 * config/arm/neon.md (abs<mode>2): Rename to neon_abs<mode>2.
22380 * config/arm/unspecs.md (VABSQ_F, VABSQ_S): Delete.
22381 * config/arm/vec-common.md (neg<mode>2): Rename to
22382 <absneg_str><mode>2.
22384 2021-06-03 Claudiu Zissulescu <claziss@synopsys.com>
22386 * common/config/arc/arc-common.c (arc_option_optimization_table):
22387 Remove malign-call.
22388 * config/arc/arc.c (arc_unalign_branch_p): Remove unused function.
22389 * config/arc/arc.h (TARGET_MIXED_CODE): Remove macro.
22390 (INDEX_REG_CLASS): Only refer to GENERAL_REGS.
22391 * config/arc/arc.md (abssi2_mixed): Remove pattern.
22392 * config/arc/arc.opt (munalign-prob-threshold): Mark it obsolete.
22393 (malign-call): Likewise.
22394 (mmixed-code): Likewise.
22395 * doc/invoke.texi (ARC): Update doc.
22397 2021-06-03 Martin Liska <mliska@suse.cz>
22399 * common.opt: Use proper Enum values.
22400 * opts.c (COVERAGE_SANITIZER_OPT): Remove.
22401 (parse_sanitizer_options): Handle only sanitizer_opts.
22402 (common_handle_option): Just assign value.
22404 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
22407 * tree-inline.c (inline_forbidden_p): Remove test on return type.
22409 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
22411 * dwarf2out.c (loc_list_from_tree_1) <FUNCTION_DECL>: Also generate
22412 DW_OP_GNU_variable_value referencing an existing DIE at file scope.
22413 (type_byte_size): Inline into...
22414 (add_byte_size_attribute): ...this and call add_scalar_info.
22416 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
22418 * dwarf2out.c (mem_loc_descriptor) <UDIV>: Fix typo.
22419 (typed_binop_from_tree): New function.
22420 (loc_list_from_tree_1) <EXACT_DIV_EXPR>: For an unsigned type,
22421 turn a divide by a power of 2 into a shift.
22422 <CEIL_DIV_EXPR>: For an unsigned type, use a signed divide if the
22423 size of the mode is lower than DWARF2_ADDR_SIZE; otherwise, do a
22424 typed divide by calling typed_binop_from_tree.
22426 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
22428 * dwarf2out.c (scompare_loc_descriptor): Fix head comment.
22429 (is_handled_procedure_type): Likewise.
22430 (struct loc_descr_context): Add strict_signedness field.
22431 (resolve_args_picking_1): Deal with DW_OP_[GNU_]deref_type,
22432 DW_OP_[GNU_]convert and DW_OP_[GNU_]reinterpret.
22433 (resolve_args_picking): Minor tweak.
22434 (function_to_dwarf_procedure): Initialize strict_signedness field.
22435 (type_byte_size): Likewise.
22436 (field_byte_offset): Likewise.
22437 (gen_descr_array_type_die): Likewise.
22438 (gen_variant_part): Likewise.
22439 (loc_list_from_tree_1) <CALL_EXPR>: Tidy up and set strict_signedness
22440 to true when a context is present before evaluating the arguments.
22441 <COND_EXPR>: Do not generate a useless comparison with zero.
22442 When dereferencing an address, if strict_signedness is true and the
22443 type is small and signed, use DW_OP_deref_type to do the dereference
22444 and then DW_OP_convert to convert back to the generic type.
22446 2021-06-03 Jakub Jelinek <jakub@redhat.com>
22449 * tree-inline.c (copy_tree_body_r): Handle iterators on
22450 OMP_CLAUSE_AFFINITY or OMP_CLAUSE_DEPEND.
22452 2021-06-03 Kewen Lin <linkw@linux.ibm.com>
22454 * config/arc/arc.md (*bbit_di): Remove.
22456 2021-06-02 Christoph Muellner <cmuellner@gcc.gnu.org>
22458 PR rtl-optimization/100264
22459 * ree.c (get_sub_rtx): Ignore SET expressions without register
22460 destinations and remove assertion, as it is not valid anymore
22461 with this new behaviour.
22462 (merge_def_and_ext): Eliminate destination check for register
22463 as such SET expressions can't occur anymore.
22464 (combine_reaching_defs): Likewise.
22466 2021-06-02 Jakub Jelinek <jakub@redhat.com>
22469 * config/xtensa/xtensa.h (LEAF_REG_REMAP): Cast REGNO to int to avoid
22470 -Wtype-limits warnings.
22471 (DWARF_FRAME_REGISTER): Rewrite into ternary operator with addition
22472 in operands to avoid -Wsign-compare warnings.
22474 2021-06-02 Pat Haugen <pthaugen@linux.ibm.com>
22476 * config/rs6000/rs6000-logue.c (rs6000_emit_prologue): Use
22479 2021-06-02 Vineet Gupta <vgupta@synopsys.com>
22481 * config/arc/arc.h (TARGET_CPU_DEFAULT): Change to hs38_linux.
22483 2021-06-02 Ilya Leoshkevich <iii@linux.ibm.com>
22485 * config/s390/s390.md(*ashrdi3_31<setcc><cconly>): Use a single
22487 * config/s390/subst.md(cconly_subst): Use a single constraint
22488 in (match_scratch).
22490 2021-06-02 Martin Liska <mliska@suse.cz>
22492 * ipa-icf.h: Use auto_vec for memory_access_types.
22494 2021-06-02 Jeff Law <jeffreyalaw@gmail.com>
22496 * config/h8300/h8300-protos.h (compute_a_shift_length): Drop unused
22497 argument from prototype.
22498 (output_logical_op): Add rtx_code argument.
22499 (compute_logical_op_length): Likewise.
22500 * config/h8300/h8300.c (h8300_and_costs): Pass additional argument
22501 to compute_a_shift_length.
22502 (output_logical_op); New argument with the rtx code rather than
22503 extracting it from an operand. Handle QImode too.
22504 (compute_logical_op_length): Similary.
22505 (compute_a_shift_length): Drop unused argument.
22506 * config/h8300/h8300.md (logicals): New code iterator.
22507 * config/h8300/logical.md (<code><mode>3 expander): Combine
22508 the "and" expander with the "ior"/"xor" expander.
22509 (bclr<mode>msx): Combine the QI/HI mode patterns.
22510 (<logical><mode>3 insns): Use code iterator rather than match_operator.
22511 Handle QImode as well. Update call to output_logical_op and
22512 compute_logical_op_length to pass in rtx_code
22513 Fix split condition on all define_insn_and_split patterns.
22514 (one_cmpl<mode>2<cczn>): Use <cczn> to support both clobbering
22515 the flags and setting ZN via existing define_subst.
22516 * config/h8300/shiftrotate.md: Drop unused argument from
22517 calls to compute_a_shift_length.
22518 Signed-off-by: Jeff Law <jeffreyalaw@gmail.com>
22520 2021-06-01 Andrew Pinski <apinski@marvell.com>
22522 PR tree-optimization/25290
22523 * tree-ssa-phiopt.c (match_simplify_replacement):
22525 (tree_ssa_phiopt_worker): Use match_simplify_replacement.
22526 (two_value_replacement): Change the comment about
22527 conditional_replacement.
22528 (conditional_replacement): Delete.
22530 2021-06-01 Andrew Pinski <apinski@marvell.com>
22532 PR tree-optimization/95481
22533 * tree-tailcall.c (find_tail_calls): Handle empty typed
22536 2021-06-01 Andrew Pinski <apinski@marvell.com>
22538 * gimplify.c (zero_sized_field_decl): Delete
22539 (zero_sized_type): Delete
22540 (gimplify_init_ctor_eval): Use is_empty_type instead
22541 of zero_sized_field_decl.
22542 (gimplify_modify_expr): Use is_empty_type instead of
22545 2021-06-01 Jason Merrill <jason@redhat.com>
22548 * tree.h (CALL_FROM_NEW_OR_DELETE_P): Adjust comment.
22550 2021-06-01 Jason Merrill <jason@redhat.com>
22553 * diagnostic.h (warning_enabled_at): Declare.
22554 * diagnostic.c (diagnostic_enabled): Factor out from...
22555 (diagnostic_report_diagnostic): ...here.
22556 (warning_enabled_at): New.
22558 2021-06-01 Aldy Hernandez <aldyh@redhat.com>
22560 * gimple-ssa-evrp.c: Enable exporting of global ranges.
22562 2021-06-01 Martin Liska <mliska@suse.cz>
22565 * doc/invoke.texi: Mention that -fgcse-after-reload
22566 is enabled with -O3.
22568 2021-06-01 liuhongt <hongtao.liu@intel.com>
22570 PR tree-optimization/98365
22571 * tree-if-conv.c (strip_nop_cond_scalar_reduction): New function.
22572 (is_cond_scalar_reduction): Handle nop_expr in cond scalar reduction.
22573 (convert_scalar_cond_reduction): Ditto.
22574 (predicate_scalar_phi): Ditto.
22576 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
22578 PR tree-optimization/100781
22579 * gimple-range-cache.cc (ranger_cache::ranger_cache): Enable new
22580 value calculation by default.
22581 (ranger_cache::enable_new_values): New.
22582 (ranger_cache::disable_new_values): New.
22583 (ranger_cache::push_poor_value): Check if new values are allowed.
22584 * gimple-range-cache.h (class ranger_cache): New member/methods.
22585 * gimple-range.cc (gimple_ranger::range_of_expr): Check for debug
22586 statement, and disable/renable new value calculation.
22588 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
22590 * gimple-range-cache.cc (ranger_cache::ssa_range_in_bb): Delete.
22591 (ranger_cache::range_of_def): New.
22592 (ranger_cache::entry_range): New.
22593 (ranger_cache::exit_range): New.
22594 (ranger_cache::range_of_expr): Adjust.
22595 (ranger_cache::range_on_edge): Adjust.
22596 (ranger_cache::propagate_cache): Call exit_range directly.
22597 * gimple-range-cache.h (class ranger_cache): Adjust.
22599 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
22601 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust for
22602 gori_compute being a member rather than base class.
22603 dervied call to member call.
22604 (ranger_cache::dump): No longer dump gori_map.
22605 (ranger_cache::dump_bb): New.
22606 (ranger_cache::get_non_stale_global_range): Adjust for gori_compute
22607 being a member rather than base class.
22608 (ranger_cache::set_global_range): Ditto.
22609 (ranger_cache::ssa_range_in_bb): Ditto.
22610 (ranger_cache::range_of_expr): New.
22611 (ranger_cache::range_on_edge): New.
22612 (ranger_cache::block_range): Adjust for gori_computes. Debug changes.
22613 (ranger_cache::propagate_cache): Adjust debugging output.
22614 (ranger_cache::fill_block_cache): Adjust for gori_computes. Debug
22616 * gimple-range-cache.h (class ranger_cache): Make gori_compute a
22617 member, and inherit from range_query instead.
22618 (ranger_cache::dump_bb): New. split from dump.
22619 * gimple-range-gori.cc (gori_compute::ssa_range_in_bb): Delete.
22620 (gori_compute::expr_range_at_stmt): Delete.
22621 (gori_compute::compute_name_range_op): Delete.
22622 (gori_compute::compute_operand_range_switch): Add fur_source.
22623 (gori_compute::compute_operand_range): Add fur_source param, inline
22624 old compute_name_range_op and optimize_logical_operands.
22625 (struct tf_range): Delete.
22626 (gori_compute::logical_combine): Adjust
22627 (gori_compute::optimize_logical_operands): Delete.
22628 (gori_compute::compute_logical_operands_in_chain): Delete.
22629 (gori_compute::compute_logical_operands): Adjust.
22630 (gori_compute::compute_operand1_range): Adjust to fur_source.
22631 (gori_compute::compute_operand2_range): Ditto.
22632 (gori_compute::compute_operand1_and_operand2_range): Ditto.
22633 (gori_compute::outgoing_edge_range_p): Add range_query parameter,
22634 and adjust to fur_source.
22635 * gimple-range-gori.h (class gori_compute): Simplify and adjust to
22636 range_query and fur_source.
22637 * gimple-range.cc (gimple_ranger::range_on_edge): Query range_on_edge
22638 from the ranger_cache..
22639 (gimple_ranger::fold_range_internal): Adjust to base class change of
22641 (gimple_ranger::dump_bb): Adjust dump.
22642 * gimple-range.h (gimple_ranger):export gori computes object.
22644 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
22646 PR tree-optimization/100774
22647 * gimple-range-cache.cc (ranger_cache::get_non_stale_global_range):
22648 Constant values are also not stale.
22649 (ranger_cache::set_global_range): Range invariant values should also
22650 have the correct timestamp.
22652 2021-05-31 Martin Liska <mliska@suse.cz>
22654 * tree-streamer-in.c (unpack_ts_function_decl_value_fields):
22655 Unpack FUNCTION_DECL_DECL_TYPE.
22656 * tree-streamer-out.c (pack_ts_function_decl_value_fields):
22657 Stream FUNCTION_DECL_DECL_TYPE instead of
22658 DECL_IS_OPERATOR_NEW_P.
22659 * tree.h (set_function_decl_type): Use FUNCTION_DECL_DECL_TYPE
22661 (DECL_IS_OPERATOR_NEW_P): Likewise.
22662 (DECL_IS_OPERATOR_DELETE_P): Likewise.
22663 (DECL_LAMBDA_FUNCTION_P): Likewise.
22665 2021-05-31 Richard Biener <rguenther@suse.de>
22668 * internal-fn.c (expand_SHUFFLEVECTOR): Define.
22669 * internal-fn.def (SHUFFLEVECTOR): New.
22670 * internal-fn.h (expand_SHUFFLEVECTOR): Declare.
22671 * doc/extend.texi: Document __builtin_shufflevector.
22673 2021-05-31 Peter Bergner <bergner@linux.ibm.com>
22676 * config/rs6000/predicates.md(mma_assemble_input_operand): Allow
22677 indexed form addresses.
22679 2021-05-29 Jeff Law <jlaw@tachyum.com>
22681 * config/h8300/h8300.c (h8300_emit_stack_adjustment): Drop unused
22682 parameter. Call callers fixed.
22684 (output_plussi): Add FALLTHRU markers.
22685 (h8300_shift_needs_scratch_p): Add gcc_unreachable marker.
22687 2021-05-29 Jakub Jelinek <jakub@redhat.com>
22689 PR middle-end/99928
22690 * gimplify.c (gimplify_scan_omp_clauses): For taskloop simd
22691 combined with parallel, make sure to add shared clause to
22692 parallel for explicit linear clause.
22694 2021-05-29 Aldy Hernandez <aldyh@redhat.com>
22696 PR tree-optimization/100787
22697 * gimple-ssa-evrp.c: Disable exporting of global ranges.
22699 2021-05-28 Jason Merrill <jason@redhat.com>
22701 * tree-iterator.h (struct tree_stmt_iterator): Add operator++,
22702 operator--, operator*, operator==, and operator!=.
22703 (class tsi_range): New.
22705 2021-05-28 Richard Biener <rguenther@suse.de>
22707 PR tree-optimization/100778
22708 * tree-vect-slp.c (vect_build_slp_tree_1): Prevent possibly
22709 trapping ops in different BBs.
22711 2021-05-28 Richard Biener <rguenther@suse.de>
22714 * tree-inline.c (copy_bb): When processing __builtin_va_arg_pack
22715 copy fntype from original call.
22717 2021-05-28 Martin Liska <mliska@suse.cz>
22719 PR gcov-profile/100751
22720 * doc/gcov.texi: Revert partially a hunk that was wrong.
22722 2021-05-28 Cooper Qu <cooper.qu@linux.alibaba.com>
22724 * config/csky/csky-linux-elf.h (HAVE_sync_compare_and_swapqi):
22726 (HAVE_sync_compare_and_swaphi): Likewise.
22727 (HAVE_sync_compare_and_swapsi): Likewise.
22729 2021-05-28 Jakub Jelinek <jakub@redhat.com>
22731 PR middle-end/99928
22732 * tree.h (OMP_CLAUSE_MAP_IMPLICIT): Define.
22734 2021-05-28 Tobias Burnus <tobias@codesourcery.com>
22736 * gimplify.c (gimplify_omp_affinity): New.
22737 (gimplify_scan_omp_clauses): Call it; remove affinity clause afterwards.
22738 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_AFFINITY.
22739 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_AFFINITY.
22740 * tree.c (omp_clause_num_ops, omp_clause_code_name): Add clause.
22741 (walk_tree_1): Handle OMP_CLAUSE_AFFINITY.
22743 2021-05-28 Joern Rennecke <joern.rennecke@riscy-ip.com>
22744 Richard Biener <rguenther@suse.de>
22746 * match.pd <popcount & / + pattern matching>:
22747 When generating popcount directly fails, try doing it in two halves.
22749 2021-05-28 Bernd Edlinger <bernd.edlinger@hotmail.de>
22751 * Makefile.in (generated_files): Add gimple-match.c and
22754 2021-05-28 Joern Rennecke <joern.rennecke@embecosm.com>
22756 * gensupport.c (alter_predicate_for_insn): Handle MATCH_DUP.
22758 2021-05-28 Joern Rennecke <joern.rennecke@embecosm.com>
22760 * gensupport.c (alter_constraints): Add MATCH_SCRATCH case.
22762 2021-05-28 Kewen Lin <linkw@linux.ibm.com>
22764 PR tree-optimization/99398
22765 * tree-ssa-forwprop.c (simplify_permutation): Optimize some cases
22766 where the fed operands are CTOR/CST and propagated through
22767 VIEW_CONVERT_EXPR. Call vec_perm_indices::new_shrunk_vector.
22768 * vec-perm-indices.c (vec_perm_indices::new_shrunk_vector): New
22770 * vec-perm-indices.h (vec_perm_indices::new_shrunk_vector): New
22773 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
22775 * config/i386/mmx.md (addv2sf3): Do not call
22776 ix86_fixup_binary_operands_no_copy.
22779 (<smaxmin:code>v2sf3): Ditto.
22780 (<plusminus:insn><MMXMODEI:mode>3): Ditto.
22781 (<plusminus:insn><VI_32:mode>3): Remove expander.
22782 (<plusminus:insn><VI_32:mode>3): Rename from
22783 "*<plusminus:insn><VI_32:mode>3".
22784 (mulv4hi): Do not call ix86_fixup_binary_operands_no_copy.
22785 (mulv2hi3): Remove expander.
22786 (mulv2hi3): Rename from *mulv2hi3.
22787 (<s>mulv2hi3_highpart): Remove expander.
22788 (<s>mulv2hi3_highpart): Rename from *<s>mulv2hi3_highpart.
22789 (<smaxmin:code><MMXMODE14:mode>3): Rename from
22790 "*mmx_<smaxmin:code><MMXMODE14:mode>3".
22791 (<smaxmin:code><SMAXMIN_MMXMODEI:mode>3): Remove expander.
22792 (SMAXMIN_MMXMODEI): Remove mode iterator.
22793 (<smaxmin:code>v4hi3): New expander.
22794 (<smaxmin:code>v4qi3): Rename from *<smaxmin:code>v4qi3.
22795 (<smaxmin:code>v2hi3): Rename from *<smaxmin:code>v2hi3.
22796 (<smaxmin:code><SMAXMIN_VI_32:mode>3): Remove expander.
22797 (SMAXMIN_VI_32): Remove mode iterator.
22798 (<umaxmin:code><MMXMODE24:mode>3): Rename from
22799 "*mmx_<umaxmin:code><MMXMODE24:mode>3".
22800 (<umaxmin:code><UMAXMIN_MMXMODEI:mode>3): Remove expander.
22801 (UMAXMIN_MMXMODEI): Remove mode iterator.
22802 (<umaxmin:code>v8qi3): New expander.
22803 (<umaxmin:code>v4qi3): Rename from *<umaxmin:code>v4qi3.
22804 (<umaxmin:code>v2hi3): Rename from *<umaxmin:code>v2hi3.
22805 (<umaxmin:code><SMAXMIN_VI_32:mode>3): Remove expander.
22806 (UMAXMIN_VI_32): Remove mode iterator.
22807 (<any_shift:insn>v2hi3): Remove expander.
22808 (<any_shift:insn>v2hi3): Rename from *<any_shift:insn>v2hi3.
22809 (<any_logic:code><MMXMODEI:mode>3): Do not call
22810 ix86_fixup_binary_operands_no_copy.
22811 (<any_logic:code><VI_32:mode>3): Remove expander.
22812 (<any_logic:code><VI_32:mode>3): Rename from
22813 "*<any_logic:code><VI_32:mode>3".
22814 (uavg<mode>3_ceil): Do not call ix86_fixup_binary_operands_no_copy.
22815 * config/i386/sse.md (div<VF2:mode>3): Do not call
22816 ix86_fixup_binary_operands_no_copy.
22817 (div<VF1:mode>3): Ditto.
22818 (<maxmin:code><VI8_AVX2_AVX512F:mode>3): Ditto.
22819 (smulhrsv4hi3): Ditto.
22820 (smulhrsv2hi3): Ditto.
22822 2021-05-27 Martin Sebor <msebor@redhat.com>
22824 * ggc.h (gt_ggc_mx): Add overloads for all integers.
22826 * hash-map.h (class hash_map): Add pch_nx_helper overloads for all
22828 (hash_map::operator==): New function.
22830 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
22833 * config/i386/i386-expand.c (ix86_expand_int_sse_cmp):
22834 For TARGET_XOP bypass SSE comparisons for all supported vector modes.
22835 * config/i386/mmx.md (*xop_maskcmp<MMXMODEI:mode>3): New insn pattern.
22836 (*xop_maskcmp<VI_32:mode>3): Ditto.
22837 (*xop_maskcmp_uns<MMXMODEI:mode>3): Ditto.
22838 (*xop_maskcmp_uns<VI_32:mode>3): Ditto.
22840 2021-05-27 Richard Earnshaw <rearnsha@arm.com>
22843 * config/arm/arm.c (arm_configure_build_target): Remove parameter
22844 opts_set, directly check opts parameters for being non-null.
22845 (arm_option_restore): Update call to arm_configure_build_target.
22846 (arm_option_override): Likewise.
22847 (arm_can_inline_p): Likewise.
22848 (arm_valid_target_attribute_tree): Likewise.
22849 * config/arm/arm-c.c (arm_pragma_target_parse): Likewise.
22850 * config/arm/arm-protos.h (arm_configure_build_target): Adjust
22853 2021-05-27 Aldy Hernandez <aldyh@redhat.com>
22855 * vr-values.c (simplify_conversion_using_ranges): Use
22856 get_range_query instead of get_global_range_query.
22858 2021-05-27 Aldy Hernandez <aldyh@redhat.com>
22860 * gimple-range.cc (get_range_global): Move to value-query.cc.
22861 (gimple_range_global): Same.
22862 (get_global_range_query): Same.
22863 (global_range_query::range_of_expr): Same.
22864 * gimple-range.h (class global_range_query): Move to
22866 (gimple_range_global): Same.
22867 * tree-ssanames.c (get_range_info): Move to value-query.cc.
22868 (get_ptr_nonnull): Same.
22869 * tree-ssanames.h (get_range_info): Remove.
22870 (get_ptr_nonnull): Remove.
22871 * value-query.cc (get_ssa_name_range_info): Move from
22873 (get_ssa_name_ptr_info_nonnull): Same.
22874 (get_range_global): Move from gimple-range.cc.
22875 (gimple_range_global): Same.
22876 (get_global_range_query): Same.
22877 (global_range_query::range_of_expr): Same.
22878 * value-query.h (class global_range_query): Move from
22880 (gimple_range_global): Same.
22882 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
22885 * config/i386/mmx.md (uavgv4qi3_ceil): New insn pattern.
22886 (uavgv2hi3_ceil): Ditto.
22888 2021-05-26 Eric Botcazou <ebotcazou@adacore.com>
22891 * doc/extend.texi (scalar_storage_order): Rephrase slightly.
22893 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
22895 * tree-ssanames.c (get_range_info): Merge both copies of
22896 get_range_info into one that works with irange.
22897 * tree-ssanames.h (get_range_info): Remove version that works on
22900 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
22902 * builtins.c (check_nul_terminated_array): Convert to get_range_query.
22903 (expand_builtin_strnlen): Same.
22904 (determine_block_size): Same.
22905 * fold-const.c (expr_not_equal_to): Same.
22906 * gimple-fold.c (size_must_be_zero_p): Same.
22907 * gimple-match-head.c: Include gimple-range.h.
22908 * gimple-pretty-print.c (dump_ssaname_info): Convert to get_range_query.
22909 * gimple-ssa-warn-restrict.c
22910 (builtin_memref::extend_offset_range): Same.
22911 * graphite-sese-to-poly.c (add_param_constraints): Same.
22912 * internal-fn.c (get_min_precision): Same.
22913 * ipa-fnsummary.c (set_switch_stmt_execution_predicate): Same.
22914 * ipa-prop.c (ipa_compute_jump_functions_for_edge): Same.
22916 * tree-data-ref.c (split_constant_offset): Same.
22917 (dr_step_indicator): Same.
22918 * tree-dfa.c (get_ref_base_and_extent): Same.
22919 * tree-scalar-evolution.c (iv_can_overflow_p): Same.
22920 * tree-ssa-loop-niter.c (refine_value_range_using_guard): Same.
22921 (determine_value_range): Same.
22922 (record_nonwrapping_iv): Same.
22923 (infer_loop_bounds_from_signedness): Same.
22924 (scev_var_range_cant_overflow): Same.
22925 * tree-ssa-phiopt.c (two_value_replacement): Same.
22926 * tree-ssa-pre.c (insert_into_preds_of_block): Same.
22927 * tree-ssa-reassoc.c (optimize_range_tests_to_bit_test): Same.
22928 * tree-ssa-strlen.c (handle_builtin_stxncpy_strncat): Same.
22930 (dump_strlen_info): Same.
22931 (set_strlen_range): Same.
22932 (maybe_diag_stxncpy_trunc): Same.
22933 (get_len_or_size): Same.
22934 (handle_integral_assign): Same.
22935 * tree-ssa-structalias.c (find_what_p_points_to): Same.
22936 * tree-ssa-uninit.c (find_var_cmp_const): Same.
22937 * tree-switch-conversion.c (bit_test_cluster::emit): Same.
22938 * tree-vect-patterns.c (vect_get_range_info): Same.
22939 (vect_recog_divmod_pattern): Same.
22940 * tree-vrp.c (intersect_range_with_nonzero_bits): Same.
22941 (register_edge_assert_for_2): Same.
22942 (determine_value_range_1): Same.
22943 * tree.c (get_range_pos_neg): Same.
22944 * vr-values.c (vr_values::get_lattice_entry): Same.
22945 (vr_values::update_value_range): Same.
22946 (simplify_conversion_using_ranges): Same.
22948 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
22950 * gimple-ssa-warn-alloca.c (alloca_call_type): Use
22951 get_range_query instead of query argument.
22952 (pass_walloca::execute): Enable and disable global ranger.
22954 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
22956 * gimple-ssa-evrp.c (rvrp_folder::rvrp_folder): Call
22958 (rvrp_folder::~rvrp_folder): Call disable_ranger.
22959 (hybrid_folder::hybrid_folder): Call enable_ranger.
22960 (hybrid_folder::~hybrid_folder): Call disable_ranger.
22962 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
22964 * function.c (allocate_struct_function): Set cfun->x_range_query.
22965 * function.h (struct function): Declare x_range_query.
22966 (get_range_query): New.
22967 (get_global_range_query): New.
22968 * gimple-range-cache.cc (ssa_global_cache::ssa_global_cache):
22969 Remove call to safe_grow_cleared.
22970 * gimple-range.cc (get_range_global): New.
22971 (gimple_range_global): Move from gimple-range.h.
22972 (get_global_range_query): New.
22973 (global_range_query::range_of_expr): New.
22974 (enable_ranger): New.
22975 (disable_ranger): New.
22976 * gimple-range.h (gimple_range_global): Move to gimple-range.cc.
22977 (class global_range_query): New.
22978 (enable_ranger): New.
22979 (disable_ranger): New.
22980 * gimple-ssa-evrp.c (evrp_folder::~evrp_folder): Rename
22981 dump_all_value_ranges to dump.
22982 * tree-vrp.c (vrp_prop::finalize): Same.
22983 * value-query.cc (range_query::dump): New.
22984 * value-query.h (range_query::dump): New.
22985 * vr-values.c (vr_values::dump_all_value_ranges): Rename to...
22986 (vr_values::dump): ...this.
22987 * vr-values.h (class vr_values): Rename dump_all_value_ranges to
22988 dump and make virtual.
22990 2021-05-26 Uroš Bizjak <ubizjak@gmail.com>
22992 * config/i386/i386.c (ix86_autovectorize_vector_modes):
22993 Add V4QImode and V16QImode for TARGET_SSE2.
22994 * doc/sourcebuild.texi (Vector-specific attributes):
22995 Add vect64 and vect32 description.
22997 2021-05-26 Bernd Edlinger <bernd.edlinger@hotmail.de>
22999 * gimple-range-gori.cc (range_def_chain::register_dependency):
23000 Resize m_def_chain when needed.
23002 2021-05-26 Christophe Lyon <christophe.lyon@linaro.org>
23004 * config/arm/mve.md (mve_vaddvq_<supf><mode>): Prefix with '@'.
23005 * config/arm/neon.md (reduc_plus_scal_<mode>): Move to ..
23006 * config/arm/vec-common.md: .. here. Add support for MVE.
23008 2021-05-26 Jakub Jelinek <jakub@redhat.com>
23010 * config/epiphany/epiphany.c (epiphany_print_operand_address): Remove
23012 * config/microblaze/microblaze.c (microblaze_legitimize_address,
23014 microblaze_option_override, print_operand): Likewise.
23015 * config/microblaze/microblaze.md (call_internal_plt,
23016 call_value_intern_plt, call_value_intern): Likewise.
23017 * config/arm/aout.h (ASM_OUTPUT_ALIGN): Likewise.
23018 * config/iq2000/iq2000.md (call_internal1, call_value_internal1,
23019 call_value_multiple_internal1): Likewise.
23020 * config/bfin/bfin.c (symbolic_reference_mentioned_p): Likewise.
23022 2021-05-26 Jan-Benedict Glaw <jbglaw@lug-owl.de>
23024 * config/arc/arc.c (arc_address_cost, arc_print_operand_address,
23025 arc_ccfsm_advance, symbolic_reference_mentioned_p,
23026 arc_raw_symbolic_reference_mentioned_p): Remove register
23029 2021-05-26 Jakub Jelinek <jakub@redhat.com>
23032 * omp-low.c: Include omp-offload.h.
23033 (create_omp_child_function): If current_function_decl has
23034 "omp declare target" attribute and is_gimple_omp_offloaded,
23035 remove that attribute from the copy of attribute list and
23036 add "omp target entrypoint" attribute instead.
23037 (lower_omp_target): Mark .omp_data_sizes.* and .omp_data_kinds.*
23038 variables for offloading if in omp_maybe_offloaded_ctx.
23039 * omp-offload.c (pass_omp_target_link::execute): Nullify second
23040 argument to GOMP_target_data_ext in offloaded code.
23042 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23044 * config/csky/csky.c (csky_can_change_mode_class): Delete.
23045 For csky, HF/SF mode use the low bits of VREGS.
23047 2021-05-26 Eric Botcazou <ebotcazou@adacore.com>
23049 * gimplify.c (gimplify_decl_expr): Do not clear TREE_READONLY on a
23050 DECL which is a reference for OMP.
23052 2021-05-26 Martin Liska <mliska@suse.cz>
23054 PR gcov-profile/100751
23055 * doc/gcov.texi: Document that __gcov_dump can be called just
23056 once and that __gcov_reset resets run-time counters.
23058 2021-05-26 Martin Liska <mliska@suse.cz>
23060 * doc/install.texi: Port relevant part from install-old.texi
23061 and re-generate list of CPUs and systems.
23063 2021-05-26 Martin Liska <mliska@suse.cz>
23065 * Makefile.in: Remove it.
23066 * doc/include/fdl.texi: Update next/previous chapters.
23067 * doc/install.texi: Likewise.
23068 * doc/install-old.texi: Removed.
23070 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23072 * config/csky/csky.c (ck810_legitimate_index_p): Support
23073 "base + index" with DF mode.
23074 * config/csky/constraints.md ("Y"): New constraint for memory operands
23075 without index register.
23076 * config/csky/csky_insn_fpuv2.md (fpuv3_movdf): Use "Y" instead of "m"
23077 when mov between memory and general registers, and lower their priority.
23078 * config/csky/csky_insn_fpuv3.md (fpuv2_movdf): Likewise.
23080 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23082 * config/csky/csky.c (TARGET_PROMOTE_PROTOTYPES): Delete.
23084 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23086 * config/csky/csky.md (untyped_call): Emit clobber for return
23087 registers to mark them used.
23089 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
23091 * config/csky/csky.md (cskyv2_sextend_ldbs): New.
23093 2021-05-26 Andrew Pinski <apinski@marvell.com>
23095 * match.pd (x < 0 ? ~y : y): New patterns.
23097 2021-05-26 Andrew Pinski <apinski@marvell.com>
23099 * match.pd (A?CST1:CST2): Add simplifcations for A?0:+-1, A?+-1:0,
23100 A?POW2:0 and A?0:POW2.
23102 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23104 * gimple-range-gori.cc (class logical_stmt_cache): Delete
23105 (logical_stmt_cache::logical_stmt_cache ): Delete.
23106 (logical_stmt_cache::~logical_stmt_cache): Delete.
23107 (logical_stmt_cache::cache_entry::dump): Delete.
23108 (logical_stmt_cache::get_range): Delete.
23109 (logical_stmt_cache::cached_name ): Delete.
23110 (logical_stmt_cache::same_cached_name): Delete.
23111 (logical_stmt_cache::cacheable_p): Delete.
23112 (logical_stmt_cache::slot_diagnostics ): Delete.
23113 (logical_stmt_cache::dump): Delete.
23114 (gori_compute_cache::gori_compute_cache): Delete.
23115 (gori_compute_cache::~gori_compute_cache): Delete.
23116 (gori_compute_cache::compute_operand_range): Delete.
23117 (gori_compute_cache::cache_stmt): Delete.
23118 * gimple-range-gori.h (gori_compute::compute_operand_range): Remove
23120 (class gori_compute_cache): Delete.
23122 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23124 * gimple-range.cc (fold_using_range::range_of_range_op): Use m_gori
23126 (fold_using_range::range_of_address): Adjust.
23127 (fold_using_range::range_of_phi): Adjust.
23128 * gimple-range.h (class fur_source): Adjust.
23129 (fur_source::fur_source): Adjust.
23131 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23133 * gimple-range-gori.cc (gori_compute::expr_range_at_stmt): Rename
23134 from expr_range_in_bb and adjust.
23135 (gori_compute::compute_name_range_op): Adjust.
23136 (gori_compute::optimize_logical_operands): Adjust.
23137 (gori_compute::compute_logical_operands_in_chain): Adjust.
23138 (gori_compute::compute_operand1_range): Adjust.
23139 (gori_compute::compute_operand2_range): Adjust.
23140 (ori_compute_cache::cache_stmt): Adjust.
23141 * gimple-range-gori.h (gori_compute): Rename prototype.
23143 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23145 * gimple-range.cc (gimple_ranger::range_of_expr): Non-null should be
23146 checked only after range_of_stmt, not range_on_entry.
23147 (gimple_ranger::range_on_entry): Check for non-null in any
23148 predecessor block, if it is not already non-null.
23149 (gimple_ranger::range_on_exit): DOnt check for non-null after
23150 range on entry call.
23151 (gimple_ranger::dump_bb): New. Split from dump.
23152 (gimple_ranger::dump): Adjust.
23153 * gimple-range.h (class gimple_ranger): Adjust.
23155 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23157 * gimple-range-cache.cc (struct range_timestamp): Delete.
23158 (class temporal_cache): Adjust.
23159 (temporal_cache::get_timestamp): Delete.
23160 (temporal_cache::set_dependency): Delete.
23161 (temporal_cache::temporal_value): Adjust.
23162 (temporal_cache::current_p): Take dependencies as params.
23163 (temporal_cache::set_timestamp): Adjust.
23164 (temporal_cache::set_always_current): Adjust.
23165 (ranger_cache::get_non_stale_global_range): Adjust.
23166 (ranger_cache::register_dependency): Delete.
23167 * gimple-range-cache.h (class range_cache): Adjust.
23169 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23171 * gimple-range-gori.cc (range_def_chain::range_def_chain): init
23173 (range_def_chain::~range_def_chain): Dispose of obstack rather than
23174 each individual bitmap.
23175 (range_def_chain::set_import): New.
23176 (range_def_chain::get_imports): New.
23177 (range_def_chain::chain_import_p): New.
23178 (range_def_chain::register_dependency): Rename from build_def_chain
23180 (range_def_chain::def_chain_in_bitmap_p): New.
23181 (range_def_chain::add_def_chain_to_bitmap): New.
23182 (range_def_chain::has_def_chain): Just check first depenedence.
23183 (range_def_chain::get_def_chain): Process imports, use generic
23184 register_dependency routine.
23185 (range_def_chain::dump): New.
23186 (gori_map::gori_map): Allocate import list.
23187 (gori_map::~gori_map): Release imports.
23188 (gori_map::exports): Check for past allocated block size.
23189 (gori_map::imports): New.
23190 (gori_map::def_chain_in_export_p): Delete.
23191 (gori_map::is_import_p): New.
23192 (gori_map::maybe_add_gori): Handle imports.
23193 (gori_map::dump): Adjust output, add imports.
23194 (gori_compute::has_edge_range_p): Remove def_chain_in_export call.
23195 (gori_export_iterator::gori_export_iterator): New.
23196 (gori_export_iterator::next): New.
23197 (gori_export_iterator::get_name): New.
23198 * gimple-range-gori.h (range_def_chain): Add imports and direct
23199 dependecies via struct rdc.
23200 (range_def_chain::depend1): New.
23201 (range_def_chain::depend2): New.
23202 (class gori_map): Adjust.
23203 (FOR_EACH_GORI_IMPORT_NAME): New.
23204 (FOR_EACH_GORI_EXPORT_NAME): New.
23205 (class gori_export_iterator): New.
23207 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23209 * gimple-range-cache.cc (ranger_cache::ranger_cache): Move initial
23210 export cache filling to here.
23211 * gimple-range-gori.cc (gori_compute::gori_compute) : From Here.
23213 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
23215 * gimple-range-gori.cc (range_def_chain): Move to gimple-range-gori.h.
23216 (gori_map): Move to gimple-range-gori.h.
23217 (gori_compute::gori_compute): Adjust.
23218 (gori_compute::~gori_compute): Delete.
23219 (gori_compute::compute_operand_range_switch): Adjust.
23220 (gori_compute::compute_operand_range): Adjust.
23221 (gori_compute::compute_logical_operands): Adjust.
23222 (gori_compute::has_edge_range_p ): Adjust.
23223 (gori_compute::set_range_invariant): Delete.
23224 (gori_compute::dump): Adjust.
23225 (gori_compute::outgoing_edge_range_p): Adjust.
23226 * gimple-range-gori.h (class range_def_chain): Relocate here.
23227 (class gori_map): Relocate here.
23228 (class gori_compute): Inherit from gori_map, and adjust.
23230 2021-05-25 Aldy Hernandez <aldyh@redhat.com>
23232 * value-range.cc (range_tests_legacy): Use
23233 build_nonstandard_integer_type instead of int and short.
23235 2021-05-25 Eric Botcazou <ebotcazou@adacore.com>
23237 * gimplify.c (gimplify_decl_expr): Clear TREE_READONLY on the DECL
23238 when really creating an initialization statement for it.
23240 2021-05-25 Eric Botcazou <ebotcazou@adacore.com>
23242 * tree-inline.c (setup_one_parameter): Fix thinko in new condition.
23244 2021-05-25 Kito Cheng <kito.cheng@sifive.com>
23246 * config/riscv/riscv.h (ASM_SPEC): Pass -mno-relax.
23248 2021-05-25 Martin Liska <mliska@suse.cz>
23250 PR tree-optimization/92860
23252 * optc-save-gen.awk: Remove exceptions.
23254 2021-05-25 Martin Liska <mliska@suse.cz>
23256 * asan.h (sanitize_coverage_p): New function.
23257 * doc/extend.texi: Document it.
23258 * fold-const.c (fold_range_test): Use sanitize_flags_p
23259 instead of flag_sanitize_coverage.
23260 (fold_truth_andor): Likewise.
23261 * sancov.c: Likewise.
23262 * tree-ssa-ifcombine.c (ifcombine_ifandif): Likewise.
23263 * ipa-inline.c (sanitize_attrs_match_for_inline_p): Handle
23264 -fsanitize-coverage when inlining.
23266 2021-05-25 Cooper Qu <cooper.qu@linux.alibaba.com>
23268 * config/csky/csky-modes.def : Fix copyright.
23270 2021-05-25 Cooper Qu <cooper.qu@linux.alibaba.com>
23272 * config/csky/csky-modes.def : Amend copyright.
23273 * config/csky/csky_insn_fpuv2.md : Likewise.
23274 * config/csky/csky_insn_fpuv3.md : Likewise.
23276 2021-05-25 Richard Biener <rguenther@suse.de>
23278 PR middle-end/100727
23279 * calls.c (initialize_argument_information): Explicitely test
23280 for WITH_SIZE_EXPR.
23281 * gimple-expr.c (mark_addressable): Skip outer WITH_SIZE_EXPR.
23283 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
23285 * config/csky/csky.h (FRAME_POINTER_REGNUM): Use
23286 HARD_FRAME_POINTER_REGNUM and FRAME_POINTER_REGNUM instead of
23287 the signle definition. The signle definition may not work well
23288 at simplify_subreg_regno().
23289 (HARD_FRAME_POINTER_REGNUM): New.
23290 (ELIMINABLE_REGS): Add for HARD_FRAME_POINTER_REGNUM.
23291 * config/csky/csky.c (get_csky_live_regs, csky_can_eliminate,
23292 csky_initial_elimination_offset, csky_expand_prologue,
23293 csky_expand_epilogue): Add for HARD_FRAME_POINTER_REGNUM.
23295 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
23297 * config/csky/csky.c (csky_option_override):
23298 Init csky_arch_isa_features[] in advance, so TARGET_DSP
23299 and TARGET_DIV can be set well.
23301 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
23303 * config/csky/constraints.md ("l", "h"): Delete.
23304 * config/csky/csky.h (reg_class, REG_CLASS_NAMES,
23305 REG_CLASS_CONTENTS): Delete LO_REGS and HI_REGS.
23306 * config/csky/csky.c (regno_reg_classm,
23307 csky_secondary_reload, csky_register_move_cost):
23308 Use HILO_REGS instead of LO_REGS and HI_REGS.
23310 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
23312 * config/csky/constraints.md ("W"): New constriant for mem operand
23313 with base reg, index register.
23314 ("Q"): Renamed and modified "csky_valid_fpuv2_mem_operand" to
23315 "csky_valid_mem_constraint_operand" to deal with both "Q" and "W"
23317 ("Dv"): New constraint for const double value that can be used at
23319 * config/csky/csky-modes.def (HFmode): New mode.
23320 * config/csky/csky-protos.h (csky_valid_fpuv2_mem_operand): Rename
23321 to "csky_valid_mem_constraint_operand" and support new constraint
23323 (csky_get_movedouble_length): New.
23324 (fpuv3_output_move): New.
23325 (fpuv3_const_double): New.
23326 * config/csky/csky.c (csky_option_override): New arch CK860 with fpv3.
23327 (decompose_csky_address): Refine.
23328 (csky_print_operand): New "CONST_DOUBLE" operand.
23329 (csky_output_move): Support fpv3 instructions.
23330 (csky_get_movedouble_length): New.
23331 (fpuv3_output_move): New.
23332 (fpuv3_const_double): New.
23333 (csky_emit_compare): Cover float comparsion.
23334 (csky_emit_compare_float): Refine.
23335 (csky_vaild_fpuv2_mem_operand): Rename to
23336 "csky_valid_mem_constraint_operand" and support new constraint "W".
23337 (ck860_rtx_costs): New.
23338 (csky_rtx_costs): Add the cost calculation of CK860.
23339 (regno_reg_class): New vregs for fpuv3.
23340 (csky_dbx_regno): Likewise.
23341 (csky_cpu_cpp_builtins): New builtin macro for fpuv3.
23342 (csky_conditional_register_usage): Suporrot fpuv3.
23343 (csky_dwarf_register_span): Suporrot fpuv3.
23344 (csky_init_builtins, csky_mangle_type): Support "__fp16" type.
23345 (ck810_legitimate_index_p): Support fp16.
23346 * config/csky/csky.h (TARGET_TLS): ADD CK860.
23347 (CSKY_VREG_P, CSKY_VREG_LO_P, CSKY_VREG_HI_P): Support fpuv3.
23348 (TARGET_SINGLE_FPU): Support fpuv3.
23349 (TARGET_SUPPORT_FPV3): New.
23350 (FIRST_PSEUDO_REGISTER): Change to 202 to hold the new fpuv3 registers.
23351 (FIXED_REGISTERS, CALL_REALLY_USED_REGISTERS, REGISTER_NAMES,
23352 REG_CLASS_CONTENTS): Support fpuv3.
23353 * config/csky/csky.md (movsf): Move to cksy_insn_fpu.md and refine.
23354 (csky_movsf_fpv2): Likewise.
23355 (ck801_movsf): Likewise.
23356 (csky_movsf): Likewise.
23358 (csky_movdf_fpv2): Likewise.
23359 (ck801_movdf): Likewise.
23360 (csky_movdf): Likewise.
23361 (movsicc): Refine. Use "comparison_operatior" instead of
23362 "ordered_comparison_operatior".
23363 (addsicc): Likewise.
23364 (CSKY_FIRST_VFP3_REGNUM, CSKY_LAST_VFP3_REGNUM): New constant.
23365 (call_value_internal_vh): New.
23366 * config/csky/csky_cores.def (CK860): New arch and cpu.
23371 * config/csky/csky_insn_fpu.md: Refactor. Separate all float patterns
23372 into emit-patterns and match-patterns, remain the emit-patterns here,
23373 and move the match-patterns to csky_insn_fpuv2.md or
23374 csky_insn_fpuv3.md.
23375 * config/csky/csky_insn_fpuv2.md: New file for fpuv2 instructions.
23376 * config/csky/csky_insn_fpuv3.md: New file and new patterns for fpuv3
23378 * config/csky/csky_isa.def (fcr): New.
23383 (CK860): New definition for ck860.
23384 * config/csky/csky_tables.opt (ck860): New processors ck860,
23385 ck860f. And new arch ck860.
23390 * config/csky/predicates.md (csky_float_comparsion_operator): Delete
23391 "geu", "gtu", "leu", "ltu", which will never appear at float comparison.
23392 * config/csky/t-csky-elf: Support 860.
23393 * config/csky/t-csky-linux: Likewise.
23394 * doc/md.texi: Add "Q" and "W" constraints for C-SKY.
23396 2021-05-24 Aaron Sawdey <acsawdey@linux.ibm.com>
23398 * config/rs6000/genfusion.pl (gen_logical_addsubf): Refactor to
23399 add generation of logical-add and add-logical fusion pairs.
23400 * config/rs6000/rs6000-cpus.def: Add new fusion to ISA 3.1 mask
23402 * config/rs6000/rs6000.c (rs6000_option_override_internal): Turn on
23403 logical-add and add-logical fusion by default.
23404 * config/rs6000/rs6000.opt: Add -mpower10-fusion-logical-add and
23405 -mpower10-fusion-add-logical options.
23406 * config/rs6000/fusion.md: Regenerate file.
23408 2021-05-24 Aldy Hernandez <aldyh@redhat.com>
23410 * value-range.cc (irange::legacy_equal_p): Check type when
23411 comparing VR_VARYING types.
23412 (range_tests_legacy): Test comparing VARYING ranges of different
23415 2021-05-24 Wilco Dijkstra <wdijkstr@arm.com>
23417 * config/aarch64/aarch64.c (neoversen1_tunings):
23418 Enable AARCH64_EXTRA_TUNE_CHEAP_SHIFT_EXTEND.
23420 2021-05-24 Wilco Dijkstra <wdijkstr@arm.com>
23422 * config/aarch64/aarch64.c (aarch64_classify_symbol): Use GOT for
23423 extern weak symbols. Limit symbol offsets for non-GOT symbols with
23426 2021-05-24 Christophe Lyon <christophe.lyon@linaro.org>
23428 * config/arm/neon.md (vec_load_lanesxi<mode>)
23429 (vec_store_lanexoi<mode>): Move ...
23430 * config/arm/vec-common.md: here.
23432 2021-05-24 Christophe Lyon <christophe.lyon@linaro.org>
23434 * config/arm/neon.md (vec_load_lanesoi<mode>)
23435 (vec_store_lanesoi<mode>): Move ...
23436 * config/arm/vec-common.md: here.
23438 2021-05-24 liuhongt <hongtao.liu@intel.com>
23441 * config/i386/i386.c (ix86_gimple_fold_builtin): Replacing
23442 stmt with GIMPLE_NOP when lhs doesn't exist.
23444 2021-05-23 Uroš Bizjak <ubizjak@gmail.com>
23447 * config/i386/mmx.md (*push<VI_32:mode>2_rex64):
23448 New instruction pattern.
23449 (*push<VI_32:mode>2): Ditto.
23450 (push splitter for SSE registers): New splitter.
23452 2021-05-23 Andrew Pinski <apinski@marvell.com>
23454 * match.pd ((A & C) != 0 ? D : 0): Limit to non pointer types.
23456 2021-05-22 Aaron Sawdey <acsawdey@linux.ibm.com>
23458 * config/rs6000/genfusion.pl (gen_addadd): Fix incorrect attr types.
23459 * config/rs6000/fusion.md: Regenerate file.
23461 2021-05-21 Aaron Sawdey <acsawdey@linux.ibm.com>
23463 * config/rs6000/genfusion.pl (gen_addadd): New function.
23464 * config/rs6000/fusion.md: Regenerate file.
23465 * config/rs6000/rs6000-cpus.def: Add
23466 OPTION_MASK_P10_FUSION_2ADD to masks.
23467 * config/rs6000/rs6000.c (rs6000_option_override_internal):
23468 Handle default value of OPTION_MASK_P10_FUSION_2ADD.
23469 * config/rs6000/rs6000.opt: Add -mpower10-fusion-2add.
23471 2021-05-21 Jakub Jelinek <jakub@redhat.com>
23473 PR middle-end/99928
23474 * tree.h (OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT_TARGET): Define.
23475 * gimplify.c (enum gimplify_omp_var_data): Fix up
23476 GOVD_MAP_HAS_ATTACHMENTS value, add GOVD_FIRSTPRIVATE_IMPLICIT.
23477 (omp_lastprivate_for_combined_outer_constructs): If combined target
23478 has GOVD_FIRSTPRIVATE_IMPLICIT set for the decl, change it to
23479 GOVD_MAP | GOVD_SEEN.
23480 (gimplify_scan_omp_clauses): Set GOVD_FIRSTPRIVATE_IMPLICIT for
23481 firstprivate clauses with OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT.
23482 (gimplify_adjust_omp_clauses): For firstprivate clauses with
23483 OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT either clear that bit and
23484 OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT_TARGET too, or remove it and
23485 let it be replaced by implicit map clause.
23487 2021-05-21 Jakub Jelinek <jakub@redhat.com>
23489 PR middle-end/99928
23490 * gimplify.c (omp_lastprivate_for_combined_outer_constructs): New
23492 (gimplify_scan_omp_clauses) <case OMP_CLAUSE_LASTPRIVATE>: Use it.
23493 (gimplify_omp_for): Likewise.
23495 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23497 PR middle-end/90115
23498 * omp-low.c (oacc_privatization_candidate_p): Reject 'static',
23499 'external' in blocks.
23501 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23503 PR middle-end/90115
23504 * flag-types.h (enum openacc_privatization): New.
23505 * params.opt (-param=openacc-privatization): New.
23506 * doc/invoke.texi (openacc-privatization): Document it.
23507 * omp-general.h (get_openacc_privatization_dump_flags): New
23509 * omp-low.c (oacc_privatization_candidate_p): Add diagnostics.
23510 * omp-offload.c (execute_oacc_device_lower)
23511 <IFN_UNIQUE_OACC_PRIVATE>: Re-work diagnostics.
23512 * target.def (goacc.adjust_private_decl): Add 'location_t'
23514 * doc/tm.texi: Regenerate.
23515 * config/gcn/gcn-protos.h (gcn_goacc_adjust_private_decl): Adjust.
23516 * config/gcn/gcn-tree.c (gcn_goacc_adjust_private_decl): Likewise.
23517 * config/nvptx/nvptx.c (nvptx_goacc_adjust_private_decl):
23518 Likewise. Preserve it for...
23519 (nvptx_goacc_expand_var_decl): ... use here.
23521 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23523 * doc/sourcebuild.texi (Other attributes): Document '__OPTIMIZE__'
23526 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23528 PR middle-end/90115
23529 * omp-low.c (oacc_privatization_candidate_p): New function.
23530 (oacc_privatization_scan_clause_chain)
23531 (oacc_privatization_scan_decl_chain): Use it. Also
23532 'gcc_checking_assert' that we're not seeing duplicates.
23534 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23536 PR middle-end/90115
23537 * omp-offload.c (execute_oacc_device_lower): Skip processing if no
23540 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23542 PR middle-end/90115
23543 * omp-offload.c (execute_oacc_device_lower): Explain.
23545 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23547 PR middle-end/90115
23548 * omp-offload.c (execute_oacc_device_lower)
23549 <IFN_UNIQUE_OACC_PRIVATE>: Diagnose and handle for 'level == -1'
23551 * internal-fn.c (expand_UNIQUE): Don't expect
23552 'IFN_UNIQUE_OACC_PRIVATE'.
23554 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23556 PR middle-end/90115
23557 * omp-low.c (lower_omp_for): Don't evaluate OpenMP 'for' clauses.
23559 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23561 PR middle-end/90115
23562 * config/nvptx/nvptx.c (nvptx_goacc_adjust_private_decl)
23563 (nvptx_goacc_expand_var_decl): Tighten.
23565 2021-05-21 Julian Brown <julian@codesourcery.com>
23566 Chung-Lin Tang <cltang@codesourcery.com>
23567 Thomas Schwinge <thomas@codesourcery.com>
23569 PR middle-end/90115
23570 * doc/tm.texi.in (TARGET_GOACC_EXPAND_VAR_DECL)
23571 (TARGET_GOACC_ADJUST_PRIVATE_DECL): Add documentation hooks.
23572 * doc/tm.texi: Regenerate.
23573 * expr.c (expand_expr_real_1): Expand decls using the
23574 expand_var_decl OpenACC hook if defined.
23575 * internal-fn.c (expand_UNIQUE): Handle IFN_UNIQUE_OACC_PRIVATE.
23576 * internal-fn.h (IFN_UNIQUE_CODES): Add OACC_PRIVATE.
23577 * omp-low.c (omp_context): Add oacc_privatization_candidates
23579 (lower_oacc_reductions): Add PRIVATE_MARKER parameter. Insert
23581 (lower_oacc_head_tail): Add PRIVATE_MARKER parameter. Modify
23582 private marker's gimple call arguments, and pass it to
23583 lower_oacc_reductions.
23584 (oacc_privatization_scan_clause_chain)
23585 (oacc_privatization_scan_decl_chain, lower_oacc_private_marker):
23587 (lower_omp_for, lower_omp_target, lower_omp_1): Use these.
23588 * omp-offload.c (convert.h): Include.
23589 (oacc_loop_xform_head_tail): Treat private-variable markers like
23590 fork/join when transforming head/tail sequences.
23591 (struct var_decl_rewrite_info): Add struct.
23592 (oacc_rewrite_var_decl, is_sync_builtin_call): New functions.
23593 (execute_oacc_device_lower): Support rewriting gang-private
23594 variables using target hook, and fix up addr_expr and var_decl
23596 * target.def (adjust_private_decl, expand_var_decl): New hooks.
23597 * config/gcn/gcn-protos.h (gcn_goacc_adjust_gangprivate_decl):
23599 (gcn_goacc_adjust_private_decl): ...this.
23600 * config/gcn/gcn-tree.c (gcn_goacc_adjust_gangprivate_decl):
23602 (gcn_goacc_adjust_private_decl): ...this. Add LEVEL parameter.
23603 * config/gcn/gcn.c (TARGET_GOACC_ADJUST_GANGPRIVATE_DECL): Rename
23604 definition using gcn_goacc_adjust_gangprivate_decl...
23605 (TARGET_GOACC_ADJUST_PRIVATE_DECL): ...to this, using
23606 gcn_goacc_adjust_private_decl.
23607 * config/nvptx/nvptx.c (tree-pretty-print.h): Include.
23608 (gang_private_shared_size): New global variable.
23609 (gang_private_shared_align): Likewise.
23610 (gang_private_shared_sym): Likewise.
23611 (gang_private_shared_hmap): Likewise.
23612 (nvptx_option_override): Initialize these.
23613 (nvptx_file_end): Output gang_private_shared_sym.
23614 (nvptx_goacc_adjust_private_decl, nvptx_goacc_expand_var_decl):
23616 (nvptx_set_current_function): Clear gang_private_shared_hmap.
23617 (TARGET_GOACC_ADJUST_PRIVATE_DECL): Define hook.
23618 (TARGET_GOACC_EXPAND_VAR_DECL): Likewise.
23620 2021-05-21 H.J. Lu <hjl.tools@gmail.com>
23622 * config/i386/i386-modes.def (MAX_BITSIZE_MODE_ANY_INT): Removed.
23624 2021-05-21 Richard Biener <rguenther@suse.de>
23625 H.J. Lu <hjl.tools@gmail.com>
23627 PR middle-end/90773
23628 * expr.c (expand_constructor): Elide expand_constructor if
23629 move by pieces is preferred.
23631 2021-05-21 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
23633 * config/aarch64/aarch64-builtins.c (aarch64_call_properties):
23634 Take a flag and mode value as arguments.
23635 (aarch64_modifies_global_state_p): Likewise.
23636 (aarch64_reads_global_state_p): Likewise.
23637 (aarch64_could_trap_p): Likewise.
23638 (aarch64_get_attributes): Likewise.
23639 (aarch64_init_simd_builtins): Adjust callsite of above.
23640 (aarch64_init_fcmla_laneq_builtins): Use aarch64_get_attributes to get
23641 function attributes to apply to builtins.
23642 (aarch64_init_crc32_builtins): Likewise.
23643 (aarch64_init_builtin_rsqrt): Likewise.
23645 2021-05-21 Aaron Sawdey <acsawdey@linux.ibm.com>
23647 * config/rs6000/rs6000.md (define_attr "type"): Add types for fusion.
23648 * config/rs6000/genfusion.pl (gen_ld_cmpi_p10): Use new fusion types.
23649 (gen_2logical): Use new fusion types.
23650 * config/rs6000/fusion.md: Regenerate.
23652 2021-05-21 Uroš Bizjak <ubizjak@gmail.com>
23655 * config/i386/i386-expand.c (ix86_expand_sse_movcc):
23656 Handle V4QI and V2HI modes.
23657 (ix86_expand_sse_movcc): Ditto.
23658 * config/i386/mmx.md (*<sat_plusminus:insn><VI_32:mode>3):
23659 New instruction pattern.
23660 (*eq<VI_32:mode>3): Ditto.
23661 (*gt<VI_32:mode>3): Ditto.
23662 (*xop_pcmov_<VI_32:mode>): Ditto.
23663 (mmx_pblendvb32): Ditto.
23664 (mmx_pblendvb64): Rename from mmx_pblendvb.
23665 (vec_cmp<VI_32:mode><VI_32:mode>): New expander.
23666 (vec_cmpu<VI_32:mode><VI_32:mode>): Ditto.
23667 (vcond<VI_32:mode><VI_32:mode>): Ditto.
23668 (vcondu<VI_32:mode><VI_32:mode>): Ditto.
23669 (vcond_mask_<VI_32:mode><VI_32:mode>): Ditto.
23671 2021-05-21 Jakub Jelinek <jakub@redhat.com>
23673 PR tree-optimization/94589
23674 * tree-ssa-phiopt.c (spaceship_replacement): For integral rhs1 and
23675 rhs2, treat x <= 4 equivalently to x < 5 etc. In cmp1 and cmp2 (if
23676 not the same as cmp3) treat <= the same as < and >= the same as >.
23677 Don't require that cond2_phi_edge is true edge, instead take
23678 false/true edges into account based on cmp1/cmp2 comparison kinds.
23680 2021-05-21 Uroš Bizjak <ubizjak@gmail.com>
23683 * config/i386/mmx.md (SMAXMIN_MMXMODEI): New mode iterator.
23684 (<smaxmin:code><SMAXMIN_MMXMODEI:mode>3): Macroize expander
23685 from <smaxmin:code>v4hi3> and <smaxmin:code><MMXMODE14:mode>3
23686 using SMAXMIN_MMXMODEI mode iterator.
23687 (*<smaxmin:code>v4qi3): New insn pattern.
23688 (*<smaxmin:code>v2hi3): Ditto.
23689 (SMAXMIN_VI_32): New mode iterator.
23690 (<smaxmin:code><SMAXMIN_VI_32>mode3): New expander.
23691 (UMAXMIN_MMXMODEI): New mode iterator.
23692 (<umaxmin:code><UMAXMIN_MMXMODEI:mode>3): Macroize expander
23693 from <umaxmin:code>v8qi3> and <umaxmin:code><MMXMODE24:mode>3
23694 using UMAXMIN_MMXMODEI mode iterator.
23695 (*<umaxmin:code>v4qi3): New insn pattern.
23696 (*<umaxmin:code>v2hi3): Ditto.
23697 (UMAXMIN_VI_32): New mode iterator.
23698 (<umaxmin:code><UMAXMIN_VI_32>mode3): New expander.
23699 (abs<VI_32:mode>2): New insn pattern.
23700 (ssse3_abs<MMXMODEI:mode>2, abs<MMXMODEI:mode>2): Move from ...
23701 * config/i386/sse.md: ... here.
23703 2021-05-20 Clement Chigot <clement.chigot@atos.net>
23704 David Edelsohn <dje.gcc@gmail.com>
23706 * collect2.c (scan_prog_file): Issue non-fatal warning for
23709 2021-05-20 Jonathan Wakely <jwakely@redhat.com>
23711 * doc/invoke.texi (-Wno-c++11-extensions)
23712 (-Wno-c++14-extensions, -Wno-c++17-extensions)
23713 (-Wno-c++20-extensions, -Wno-c++23-extensions): Document
23716 2021-05-20 Indu Bhagat <indu.bhagat@oracle.com>
23718 * config/c6x/c6x.c (c6x_output_file_unwind): Use dwarf_debuginfo_p.
23719 * config/darwin.c (darwin_override_options): Likewise.
23720 * config/i386/cygming.h (DBX_REGISTER_NUMBER): Likewise.
23721 * config/i386/darwin.h (DBX_REGISTER_NUMBER): Likewise.
23722 (DWARF2_FRAME_REG_OUT): Likewise.
23723 * config/mips/mips.c (mips_output_filename): Likewise.
23724 * config/rs6000/rs6000.c (rs6000_xcoff_declare_function_name):
23726 (rs6000_dbx_register_number): Likewise.
23727 * dbxout.c: Include flags.h.
23728 * dwarf2cfi.c (cfi_label_required_p): Likewise.
23729 (dwarf2out_do_frame): Likewise.
23730 * except.c: Include flags.h.
23731 * final.c (dwarf2_debug_info_emitted_p): Likewise.
23732 (final_scan_insn_1): Likewise.
23733 * flags.h (dwarf_debuginfo_p): New function declaration.
23734 * opts.c (dwarf_debuginfo_p): New function definition.
23735 * targhooks.c (default_debug_unwind_info): Use dwarf_debuginfo_p.
23736 * toplev.c (process_options): Likewise.
23738 2021-05-20 Indu Bhagat <indu.bhagat@oracle.com>
23740 * common.opt: Change type to support bitmasks.
23741 * flag-types.h (enum debug_info_type): Rename enumerator constants.
23742 (NO_DEBUG): New bitmask.
23743 (DBX_DEBUG): Likewise.
23744 (DWARF2_DEBUG): Likewise.
23745 (XCOFF_DEBUG): Likewise.
23746 (VMS_DEBUG): Likewise.
23747 (VMS_AND_DWARF2_DEBUG): Likewise.
23748 * flags.h (debug_set_to_format): New function declaration.
23749 (debug_set_count): Likewise.
23750 (debug_set_names): Likewise.
23751 * opts.c (debug_type_masks): Array of bitmasks for debug formats.
23752 (debug_set_to_format): New function definition.
23753 (debug_set_count): Likewise.
23754 (debug_set_names): Likewise.
23755 (set_debug_level): Update access to debug_type_names.
23756 * toplev.c: Likewise.
23758 2021-05-20 Martin Sebor <msebor@redhat.com>
23760 PR middle-end/100684
23761 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Handle C++ lambda.
23763 2021-05-20 Uroš Bizjak <ubizjak@gmail.com>
23766 * config/i386/i386.md (isa): Remove x64_bmi.
23767 (enabled): Remove x64_bmi.
23768 * config/i386/mmx.md (mmx_andnot<MMXMODEI:mode>3):
23769 Remove general register alternative.
23770 (*andnot<VI_32:mode>3): Ditto.
23771 (*mmx_<any_logic:code><MMXMODEI:mode>3): Ditto.
23772 (*<any_logic:code><VI_32:mode>3): Ditto.
23774 2021-05-20 Kewen Lin <linkw@linux.ibm.com>
23776 * config/arm/arm.c: Include head files tree-vectorizer.h and
23779 2021-05-20 Uroš Bizjak <ubizjak@gmail.com>
23782 * config/i386/mmx.md (Yv_Yw): Revert adding V4QI and V2HI modes.
23783 (*<plusminus:insn><VI_32:mode>3): Use Yw instad of <Yv_Yw> constrint.
23784 (<s>mulv4hi3_highpart): New expander.
23785 (*<s>mulv2hi3_highpart): New insn pattern.
23786 (<s>mulv2hi3_higpart): New expander.
23787 (*<any_shift:insn>v2hi3): New insn pattern.
23788 (<any_shift:insn>v2hi3): New expander.
23789 * config/i386/sse.md (smulhrsv2hi3): New expander.
23790 (*smulhrsv2hi3): New insn pattern.
23792 2021-05-20 Kewen Lin <linkw@linux.ibm.com>
23794 * doc/invoke.texi (vect-inner-loop-cost-factor): Document new
23796 * params.opt (vect-inner-loop-cost-factor): New.
23797 * targhooks.c (default_add_stmt_cost): Replace hardcoded factor
23798 50 with LOOP_VINFO_INNER_LOOP_COST_FACTOR, include head file
23799 tree-vectorizer.h and its required ones.
23800 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Replace
23801 hardcoded factor 50 with LOOP_VINFO_INNER_LOOP_COST_FACTOR.
23802 * config/arm/arm.c (arm_add_stmt_cost): Likewise.
23803 * config/i386/i386.c (ix86_add_stmt_cost): Likewise.
23804 * config/rs6000/rs6000.c (rs6000_add_stmt_cost): Likewise.
23805 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost):
23807 (_loop_vec_info::_loop_vec_info): Init inner_loop_cost_factor.
23808 * tree-vectorizer.h (_loop_vec_info): Add inner_loop_cost_factor.
23809 (LOOP_VINFO_INNER_LOOP_COST_FACTOR): New macro.
23811 2021-05-20 Christophe Lyon <christophe.lyon@linaro.org>
23812 Torbjörn Svensson <torbjorn.svensson@st.com>
23815 * doc/cpp.texi (Common Predefined Macros): Document __FILE_NAME__.
23817 2021-05-20 Jakub Jelinek <jakub@redhat.com>
23819 PR middle-end/99928
23820 * gimplify.c (gimplify_scan_omp_clauses) <case OMP_CLAUSE_LINEAR>: For
23821 explicit linear clause when combined with target, make it map(tofrom:)
23822 instead of no clause or firstprivate.
23824 2021-05-20 Jakub Jelinek <jakub@redhat.com>
23826 PR tree-optimization/94589
23827 * match.pd ((X & Y) == X -> (X & ~Y) == 0): Simplify even in presence
23828 of integral conversions.
23830 2021-05-19 Andrew MacLeod <amacleod@redhat.com>
23832 * gimple-range.cc (fur_source::get_operand): New.
23833 (gimple_range_fold): Delete.
23834 (fold_using_range::fold_stmt): Move from gimple_ranger::calc_stmt.
23835 (fold_using_range::range_of_range_op): Move from gimple_ranger.
23836 (fold_using_range::range_of_address): Ditto.
23837 (fold_using_range::range_of_phi): Ditto.
23838 (fold_using_range::range_of_call): Ditto.
23839 (fold_using_range::range_of_builtin_ubsan_call): Move from
23840 range_of_builtin_ubsan_call.
23841 (fold_using_range::range_of_builtin_call): Move from
23842 range_of_builtin_call.
23843 (gimple_ranger::range_of_builtin_call): Delete.
23844 (fold_using_range::range_of_cond_expr): Move from gimple_ranger.
23845 (gimple_ranger::fold_range_internal): New.
23846 (gimple_ranger::range_of_stmt): Use new fold_using_range API.
23847 (fold_using_range::range_of_ssa_name_with_loop_info): Move from
23848 gimple_ranger. Improve ranges of SSA_NAMES when possible.
23849 * gimple-range.h (gimple_ranger): Remove various range_of routines.
23850 (class fur_source): New.
23851 (class fold_using_range): New.
23852 (fur_source::fur_source): New.
23854 * vr-values.c (vr_values::extract_range_basic): Use fold_using_range
23855 instead of range_of_builtin_call.
23857 2021-05-19 Jonathan Wakely <jwakely@redhat.com>
23859 * doc/cpp.texi (Common Predefined Macros): Update documentation
23860 for the __GXX_EXPERIMENTAL_CXX0X__ macro.
23862 2021-05-19 Alex Coplan <alex.coplan@arm.com>
23865 * config/arm/arm.md (nonsecure_call_internal): Always ensure
23866 callee's address is in a register.
23868 2021-05-19 Geng Qi <gengqi@linux.alibaba.com>
23870 * common/config/riscv/riscv-common.c
23871 (riscv_subset_list::parsing_subset_version): Properly parse the letter
23873 (riscv_subset_list::parse_std_ext,
23874 riscv_subset_list::parse_multiletter_ext): To handle errors generated
23875 in riscv_subset_list::parsing_subset_version.
23877 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
23879 * config/aarch64/aarch64-simd.md: Use "neon_move_narrow_q"
23880 type attribute in patterns generating XTN(2).
23882 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
23884 * config/aarch64/aarch64-simd.md (aarch64_simd_vec_pack_trunc_<mode>):
23885 Remove as duplicate of...
23886 (aarch64_xtn<mode>): This.
23887 (aarch64_xtn2<mode>_le): Move position in file.
23888 (aarch64_xtn2<mode>_be): Move position in file.
23889 (aarch64_xtn2<mode>): Move position in file.
23890 (vec_pack_trunc_<mode>): Define as an expander.
23892 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
23894 * config/aarch64/aarch64-simd-builtins.def: Split builtin
23895 generation for aarch64_<sur>q<r>shr<u>n_n<mode> pattern into
23896 separate scalar and vector generators.
23897 * config/aarch64/aarch64-simd.md
23898 (aarch64_<sur>q<r>shr<u>n_n<mode>): Define as an expander and
23900 (aarch64_<sur>q<r>shr<u>n_n<mode>_insn_le): This and...
23901 (aarch64_<sur>q<r>shr<u>n_n<mode>_insn_be): This.
23902 * config/aarch64/iterators.md: Define SD_HSDI iterator.
23904 2021-05-19 Jonathn Wright <jonathan.wright@arm.com>
23906 * config/aarch64/aarch64-simd.md: Use UNSPEC_SQXTUN instead
23908 * config/aarch64/iterators.md: Remove UNSPEC_SQXTUN2.
23910 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
23912 * config/aarch64/aarch64-simd.md (aarch64_<sur>q<r>shr<u>n2_n<mode>):
23913 Implement as an expand emitting a big/little endian
23914 instruction pattern.
23915 (aarch64_<sur>q<r>shr<u>n2_n<mode>_insn_le): Define.
23916 (aarch64_<sur>q<r>shr<u>n2_n<mode>_insn_be): Define.
23918 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
23920 * config/aarch64/aarch64-simd.md (aarch64_<sur><addsub>hn2<mode>):
23921 Implement as an expand emitting a big/little endian
23922 instruction pattern.
23923 (aarch64_<sur><addsub>hn2<mode>_insn_le): Define.
23924 (aarch64_<sur><addsub>hn2<mode>_insn_be): Define.
23925 * config/aarch64/iterators.md: Remove UNSPEC_[R]ADDHN2 and
23926 UNSPEC_[R]SUBHN2 unspecs and ADDSUBHN2 iterator.
23928 2021-05-19 Richard Biener <rguenther@suse.de>
23930 PR middle-end/100672
23931 * fold-const.c (fold_negate_expr_1): Use element_precision.
23932 (negate_expr_p): Likewise.
23934 2021-05-19 Andre Vieira <andre.simoesdiasvieira@arm.com>
23936 * config/aarch64/iterators.md (SVE_PRED_LOAD): New iterator.
23937 (pred_load): New int attribute.
23938 * config/aarch64/aarch64-sve.md
23939 (aarch64_load_<ANY_EXTEND:optab><SVE_HSDI:mode><SVE_PARTIAL_I:mode>): Use
23940 SVE_PRED_LOAD enum iterator and corresponding pred_load attribute.
23941 * config/aarch64/aarch64-sve-builtins-base.cc (expand): Update call to
23942 code_for_aarch64_load.
23944 2021-05-19 Richard Biener <rguenther@suse.de>
23946 * cfgexpand.c (discover_nonconstant_array_refs_r): Make
23947 sure TARGET_MEM_REF bases are expanded as memory.
23948 * tree-ssa-operands.c (operands_scanner::get_tmr_operands):
23949 Do not mark TARGET_MEM_REF bases addressable.
23950 * tree-ssa.c (non_rewritable_mem_ref_base): Handle
23951 TARGET_MEM_REF bases as never rewritable.
23952 * gimple-walk.c (walk_stmt_load_store_addr_ops): Do not
23953 walk TARGET_MEM_REF bases as address-takens.
23954 * tree-ssa-dce.c (ref_may_be_aliased): Handle TARGET_MEM_REF.
23956 2021-05-19 Richard Biener <rguenther@suse.de>
23958 * builtins.c (get_object_alignment_1): Strip outer
23960 * tree-dfa.c (get_ref_base_and_extent): Handle outer
23961 WITH_SIZE_EXPR for size processing and process the
23963 * tree-ssa-alias.c (ao_ref_base_alias_set): Strip
23964 outer WITH_SIZE_EXPR.
23965 (ao_ref_base_alias_ptr_type): Likewise.
23966 (refs_may_alias_p_2): Allow WITH_SIZE_EXPR in ref->ref
23967 and handle that accordingly, stripping it for the
23968 core alias workers.
23969 * tree.c (get_base_address): Handle WITH_SIZE_EXPR by
23970 looking through it instead of returning NULL.
23972 2021-05-19 Jakub Jelinek <jakub@redhat.com>
23974 PR middle-end/100576
23975 * builtins.c (check_read_access): Convert bound to size_type_node if
23978 2021-05-19 Richard Biener <rguenther@suse.de>
23980 * tree-cfg.c (verify_types_in_gimple_min_lval): Inline...
23981 (verify_types_in_gimple_reference): ... here. Sanitize.
23982 (verify_gimple_call): Verify references in LHS and arguments.
23983 (verify_gimple_assign_single): Reject WITH_SIZE_EXPR.
23985 2021-05-19 Uroš Bizjak <ubizjak@gmail.com>
23987 * config/i386/i386.h (VALID_INT_MODE_P):
23988 Add V8QI, V4HI and V2SI modes for TARGET_64BIT.
23989 * config/i386/i386.md (isa): Add x64_bmi.
23990 (enabled): Handle x64_bmi.
23991 * config/i386/mmx.md (mmx_andnot<MMXMODEI:mode>3):
23992 Add alternative using 64bit general registers.
23993 (*mmx_<any_logic:code><MMXMODEI:mode>3): Ditto.
23995 2021-05-19 Jakub Jelinek <jakub@redhat.com>
23997 PR middle-end/99928
23998 * tree.h (OMP_MASTER_COMBINED): Define.
23999 * gimplify.c (gimplify_scan_omp_clauses): Rewrite lastprivate
24000 handling for outer combined/composite constructs to a loop.
24001 Handle lastprivate on combined target.
24002 (gimplify_expr): Formatting fix.
24004 2021-05-19 Xionghu Luo <luoxhu@linux.ibm.com>
24006 * passes.def: Add sink_code pass before store_merging.
24007 * tree-ssa-sink.c (pass_sink_code:clone): New.
24009 2021-05-18 Bill Schmidt <wschmidt@linux.ibm.com>
24011 * config/rs6000/freebsd64.h (ADJUST_FIELD_ALIGN): Remove call to
24012 rs6000_special_adjust_field_align_p.
24013 * config/rs6000/linux64.h (ADJUST_FIELD_ALIGN): Likewise.
24014 * config/rs6000/rs6000-call.c (rs6000_function_arg_boundary):
24015 Remove ABI warning.
24016 (rs6000_function_arg): Likewise.
24017 * config/rs6000/rs6000-protos.h
24018 (rs6000_special_adjust_field_align_p): Remove prototype.
24019 * config/rs6000/rs6000.c (rs6000_special_adjust_field_align_p):
24021 * config/rs6000/sysv4.h (ADJUST_FIELD_ALIGN): Remove call to
24022 rs6000_special_adjust_field_align_p.
24024 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
24027 * config/i386/i386.h (VALID_SSE2_REG_MODE):
24028 Add V4QI and V2HI modes.
24029 (VALID_INT_MODE_P): Ditto.
24030 * config/i386/mmx.md (VI_32): New mode iterator.
24031 (mmxvecsize): Handle V4QI and V2HI.
24033 (mov<VI_32:mode>): New expander.
24034 (*mov<mode>_internal): New insn pattern.
24035 (movmisalign<VI_32:mode>): New expander.
24036 (neg<VI_32:mode>): New expander.
24037 (<plusminus:insn><VI_32:mode>3): New expander.
24038 (*<plusminus:insn><VI_32:mode>3): New insn pattern.
24039 (mulv2hi3): New expander.
24040 (*mulv2hi3): New insn pattern.
24041 (one_cmpl<VI_32:mode>2): New expander.
24042 (*andnot<VI_32:mode>3): New insn pattern.
24043 (<any_logic:code><VI_32:mode>3): New expander.
24044 (*<any_logic:code><VI_32:mode>3): New insn pattern.
24046 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
24048 * config/i386/sse.md (<any_extend:insn>v4qiv4di2):
24049 Fix a mode mismatch with operand 1.
24051 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
24054 * config/i386/i386-expand.c (split_double_mode): Return
24055 temporary register when simplify_gen_subreg fails with
24056 the high half od the paradoxical subreg.
24058 2021-05-18 Richard Biener <rguenther@suse.de>
24060 * cfgexpand.c (expand_one_var): Pass in forced_stack_var
24061 and honor it when expanding.
24062 (expand_used_vars_for_block): Pass through forced_stack_var.
24063 (expand_used_vars): Likewise.
24064 (discover_nonconstant_array_refs_r): Set bits in
24065 forced_stack_vars instead of marking vars TREE_ADDRESSABLE.
24066 (avoid_type_punning_on_regs): Likewise.
24067 (discover_nonconstant_array_refs): Likewise.
24068 (pass_expand::execute): Create and pass down forced_stack_var
24069 bitmap. For parameters and returns temporarily set
24070 TREE_ADDRESSABLE when expand_function_start.
24072 2021-05-18 Thomas Schwinge <thomas@codesourcery.com>
24074 * doc/sourcebuild.texi: Document 'dg-note'.
24076 2021-05-18 Tobias Burnus <tobias@codesourcery.com>
24079 * configure: Regenerate.
24080 * configure.ac (BUILD_CFLAG, BUILD_CXXFLAGS): Add $(CFLAGS-$@).
24082 2021-05-18 Thomas Schwinge <thomas@codesourcery.com>
24084 * gimple.h (is_gimple_omp_oacc): Tighten.
24085 * omp-low.c (check_omp_nesting_restrictions): Adjust.
24087 2021-05-18 Richard Biener <rguenther@suse.de>
24089 * tree-ssa-operands.c (mark_address_taken): Simplify.
24091 2021-05-18 Martin Liska <mliska@suse.cz>
24093 * config/gcn/mkoffload.c (STR): Redefine.
24094 * config/i386/intelmic-mkoffload.c (STR): Likewise.
24095 * config/nvptx/mkoffload.c (STR): Likewise.
24097 2021-05-18 Martin Liska <mliska@suse.cz>
24099 * common/config/aarch64/aarch64-common.c (aarch64_parse_extension):
24100 Use startswith function instead of strncmp.
24101 * common/config/bfin/bfin-common.c (bfin_handle_option): Likewise.
24102 * common/config/riscv/riscv-common.c (riscv_subset_list::parse): Likewise.
24103 * config/aarch64/aarch64-sve-builtins-shapes.cc (parse_type): Likewise.
24104 * config/aarch64/aarch64.c (aarch64_process_one_target_attr): Likewise.
24105 * config/alpha/alpha.c (alpha_elf_section_type_flags): Likewise.
24106 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
24107 * config/arm/arm.c (arm_file_start): Likewise.
24108 (arm_valid_target_attribute_rec): Likewise.
24109 (thumb1_md_asm_adjust): Likewise.
24110 * config/arm/driver-arm.c (host_detect_local_cpu): Likewise.
24111 * config/avr/avr.c (STR_PREFIX_P): Likewise.
24112 (avr_set_current_function): Likewise.
24113 (avr_handle_addr_attribute): Likewise.
24114 (avr_asm_output_aligned_decl_common): Likewise.
24115 (avr_asm_named_section): Likewise.
24116 (avr_section_type_flags): Likewise.
24117 (avr_asm_select_section): Likewise.
24118 * config/c6x/c6x.c (c6x_in_small_data_p): Likewise.
24119 (c6x_section_type_flags): Likewise.
24120 * config/darwin-c.c (darwin_cfstring_ref_p): Likewise.
24121 (darwin_objc_declare_unresolved_class_reference): Likewise.
24122 (darwin_objc_declare_class_definition): Likewise.
24123 * config/darwin.c (indirect_data): Likewise.
24124 (darwin_encode_section_info): Likewise.
24125 (darwin_objc2_section): Likewise.
24126 (darwin_objc1_section): Likewise.
24127 (machopic_select_section): Likewise.
24128 (darwin_globalize_label): Likewise.
24129 (darwin_label_is_anonymous_local_objc_name): Likewise.
24130 (darwin_asm_named_section): Likewise.
24131 (darwin_asm_output_dwarf_offset): Likewise.
24132 * config/frv/frv.c (frv_string_begins_with): Likewise.
24133 (frv_in_small_data_p): Likewise.
24134 * config/gcn/mkoffload.c (STR): Likewise.
24136 * config/i386/i386-builtins.c (get_builtin_code_for_version): Likewise.
24137 * config/i386/i386-options.c (ix86_option_override_internal): Likewise.
24138 * config/i386/i386.c (x86_64_elf_section_type_flags): Likewise.
24139 (ix86_md_asm_adjust): Likewise.
24140 * config/i386/intelmic-mkoffload.c (STR): Likewise.
24141 * config/i386/winnt.c (i386_pe_asm_named_section): Likewise.
24142 (i386_pe_file_end): Likewise.
24143 * config/ia64/ia64.c (ia64_in_small_data_p): Likewise.
24144 (ia64_section_type_flags): Likewise.
24145 * config/mips/driver-native.c (host_detect_local_cpu): Likewise.
24146 * config/mips/mips.c (mips_handle_interrupt_attr): Likewise.
24147 (mips16_stub_function_p): Likewise.
24148 (mips_function_rodata_section): Likewise.
24149 * config/msp430/msp430.c (msp430_mcu_name): Likewise.
24150 (msp430_function_section): Likewise.
24151 (msp430_section_type_flags): Likewise.
24152 (msp430_expand_helper): Likewise.
24153 * config/nios2/nios2.c (nios2_small_section_name_p): Likewise.
24154 (nios2_valid_target_attribute_rec): Likewise.
24155 * config/nvptx/mkoffload.c (process): Likewise.
24157 * config/pa/som.h: Likewise.
24158 * config/pdp11/pdp11.c (pdp11_output_ident): Likewise.
24159 * config/riscv/riscv.c (riscv_elf_select_rtx_section): Likewise.
24160 * config/rs6000/rs6000.c (VTABLE_NAME_P): Likewise.
24161 (rs6000_inner_target_options): Likewise.
24162 * config/s390/driver-native.c (s390_host_detect_local_cpu): Likewise.
24163 * config/sparc/driver-sparc.c (host_detect_local_cpu): Likewise.
24164 * config/vax/vax.c (vax_output_int_move): Likewise.
24165 * config/vms/vms-ld.c (startswith): Likewise.
24166 (process_args): Likewise.
24168 * config/vms/vms.c: Likewise.
24170 2021-05-18 Jakub Jelinek <jakub@redhat.com>
24172 PR rtl-optimization/100590
24173 * regcprop.c (copyprop_hardreg_forward_1): Only DCE dead sets if
24174 they are NONJUMP_INSN_P.
24176 2021-05-18 Jakub Jelinek <jakub@redhat.com>
24179 * function.c (push_dummy_function): Set DECL_ARTIFICIAL and
24180 DECL_ASSEMBLER_NAME on the fn_decl.
24182 2021-05-18 Jakub Jelinek <jakub@redhat.com>
24184 PR tree-optimization/94589
24185 * tree-ssa-phiopt.c (spaceship_replacement): Pattern match
24186 phi result used in (res & ~1) == 0 comparison as res >= 0 as
24187 res == 2 would be UB with -ffinite-math-only.
24189 2021-05-18 Martin Liska <mliska@suse.cz>
24191 * Makefile.in: genversion.o should depend on DATESTAMP.
24193 2021-05-18 Claudiu Zissulescu <claziss@synopsys.com>
24195 * config/arc/simdext.md (negv2si2): Remove round bracket.
24197 2021-05-18 Andreas Krebbel <krebbel@linux.ibm.com>
24199 * config/s390/s390-c.c (s390_cpu_cpp_builtins_internal): Define
24200 _Bool as macro expanding to _Bool.
24202 2021-05-18 Andreas Krebbel <krebbel@linux.ibm.com>
24205 * tree.c (build_reference_type_for_mode)
24206 (build_pointer_type_for_mode): Pick pointer mode if MODE argument
24208 (build_reference_type, build_pointer_type): Invoke
24209 build_*_type_for_mode with VOIDmode.
24211 2021-05-17 Andrew MacLeod <amacleod@redhat.com>
24213 PR tree-optimization/100512
24214 * gimple-range-cache.cc (ranger_cache::set_global_range): Mark const
24215 and non-zero pointer ranges as invariant.
24216 * gimple-range.cc (gimple_ranger::range_of_stmt): Remove pointer
24217 processing from here.
24219 2021-05-17 Tom de Vries <tdevries@suse.de>
24222 * config/nvptx/nvptx-protos.h (nvptx_output_atomic_insn): Declare
24223 * config/nvptx/nvptx.c (nvptx_output_barrier)
24224 (nvptx_output_atomic_insn): New function.
24225 (nvptx_print_operand): Add support for 'B'.
24226 * config/nvptx/nvptx.md: Use nvptx_output_atomic_insn for atomic
24229 2021-05-17 Aldy Hernandez <aldyh@redhat.com>
24231 PR tree-optimization/100349
24232 * vr-values.c (bounds_of_var_in_loop): Bail if scev returns
24235 2021-05-17 Tamar Christina <tamar.christina@arm.com>
24237 * config/aarch64/driver-aarch64.c (DEFAULT_ARCH): New.
24238 (host_detect_local_cpu): Use it.
24240 2021-05-17 Martin Liska <mliska@suse.cz>
24242 * doc/invoke.texi: Add 2 missing dots.
24244 2021-05-17 Marius Hillenbrand <mhillen@linux.ibm.com>
24246 PR bootstrap/100552
24247 * configure.ac: Replace pattern substitution with call to sed.
24248 * configure: Regenerate.
24250 2021-05-17 Richard Biener <rguenther@suse.de>
24252 PR middle-end/100582
24253 * tree.c (array_at_struct_end_p): Get to the base of the
24254 reference before looking for the underlying decl.
24256 2021-05-17 Joern Rennecke <joern.rennecke@embecosm.com>
24258 * genoutput.c (validate_insn_alternatives) Make "wrong number of
24259 alternatives" message more specific, and remove assumption on where
24262 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
24264 * config/arm/iterators.md (V16): New iterator.
24265 (VH_cvtto): New iterator.
24266 (v_cmp_result): Added V4HF and V8HF support.
24267 * config/arm/vec-common.md (vec_cmp<mode><v_cmp_result>): Use VDQWH.
24268 (vcond<mode><mode>): Likewise.
24269 (vcond_mask_<mode><v_cmp_result>): Likewise.
24270 (vcond<VH_cvtto><mode>): New expander.
24272 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
24274 * config/arm/arm-protos.h (arm_expand_vector_compare): Update
24276 * config/arm/arm.c (arm_expand_vector_compare): Add support for
24278 (arm_expand_vcond): Likewise.
24279 * config/arm/iterators.md (supf): Remove VCMPNEQ_S, VCMPEQQ_S,
24280 VCMPEQQ_N_S, VCMPNEQ_N_S.
24281 (VCMPNEQ, VCMPEQQ, VCMPEQQ_N, VCMPNEQ_N): Remove.
24282 * config/arm/mve.md (@mve_vcmp<mve_cmp_op>q_<mode>): Add '@' prefix.
24283 (@mve_vcmp<mve_cmp_op>q_f<mode>): Likewise.
24284 (@mve_vcmp<mve_cmp_op>q_n_f<mode>): Likewise.
24285 (@mve_vpselq_<supf><mode>): Likewise.
24286 (@mve_vpselq_f<mode>"): Likewise.
24287 * config/arm/neon.md (vec_cmp<mode><v_cmp_result): Enable for MVE
24288 and move to vec-common.md.
24289 (vec_cmpu<mode><mode>): Likewise.
24290 (vcond<mode><mode>): Likewise.
24291 (vcond<V_cvtto><mode>): Likewise.
24292 (vcondu<mode><v_cmp_result>): Likewise.
24293 (vcond_mask_<mode><v_cmp_result>): Likewise.
24294 * config/arm/unspecs.md (VCMPNEQ_U, VCMPNEQ_S, VCMPEQQ_S)
24295 (VCMPEQQ_N_S, VCMPNEQ_N_S, VCMPEQQ_U, CMPEQQ_N_U, VCMPNEQ_N_U)
24296 (VCMPGEQ_N_S, VCMPGEQ_S, VCMPGTQ_N_S, VCMPGTQ_S, VCMPLEQ_N_S)
24297 (VCMPLEQ_S, VCMPLTQ_N_S, VCMPLTQ_S, VCMPCSQ_N_U, VCMPCSQ_U)
24298 (VCMPHIQ_N_U, VCMPHIQ_U): Remove.
24299 * config/arm/vec-common.md (vec_cmp<mode><v_cmp_result): Moved
24301 (vec_cmpu<mode><mode>): Likewise.
24302 (vcond<mode><mode>): Likewise.
24303 (vcond<V_cvtto><mode>): Likewise.
24304 (vcondu<mode><v_cmp_result>): Likewise.
24305 (vcond_mask_<mode><v_cmp_result>): Likewise. Added unsafe math
24308 2021-05-17 liuhongt <hongtao.liu@intel.com>
24311 * config/i386/i386.c (ix86_gimple_fold_builtin): Use
24312 gsi_insert_seq_before instead.
24314 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
24316 * doc/sourcebuild.texi (arm_qbit_ok): Rename into...
24317 (arm_sat_ok): ...this.
24319 2021-05-17 Martin Liska <mliska@suse.cz>
24321 * lto-wrapper.c (merge_flto_options): Factor out a new function.
24322 (merge_and_complain): Use it.
24323 (run_gcc): Merge also linker command line -flto=foo argument
24326 2021-05-16 Christophe Lyon <christophe.lyon@linaro.org>
24328 * config/arm/arm.h (CPP_SPEC): Remove error message about
24329 -mlittle-endian/-mbig-endian conflict.
24331 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
24333 * config/rs6000/rs6000-c.c (rs6000_target_modify_macros): Define
24334 __ROP_PROTECT__ if -mrop-protect is selected.
24336 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
24338 * config/rs6000/rs6000-internal.h (rs6000_stack): Add
24339 rop_hash_save_offset and rop_hash_size.
24340 * config/rs6000/rs6000-logue.c (rs6000_stack_info): Compute
24341 rop_hash_size and rop_hash_save_offset.
24342 (debug_stack_info): Dump rop_hash_save_offset and rop_hash_size.
24343 (rs6000_emit_prologue): Emit hashst[p] in prologue.
24344 (rs6000_emit_epilogue): Emit hashchk[p] in epilogue.
24345 * config/rs6000/rs6000.md (unspec): Add UNSPEC_HASHST and
24347 (hashst): New define_insn.
24348 (hashchk): Likewise.
24350 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
24352 * config/rs6000/rs6000.c (rs6000_option_override_internal):
24353 Disable shrink wrap when inserting ROP-protect instructions.
24354 * config/rs6000/rs6000.opt (mrop-protect): New option.
24355 (mprivileged): Likewise.
24356 * doc/invoke.texi: Document mrop-protect and mprivileged.
24358 2021-05-15 Hans-Peter Nilsson <hp@axis.com>
24360 * reorg.c (fill_slots_from_thread): Reinstate code typoed out in
24363 2021-05-15 Martin Jambor <mjambor@suse.cz>
24366 2021-05-13 Martin Jambor <mjambor@suse.cz>
24368 PR tree-optimization/100453
24369 * tree-sra.c (sra_modify_assign): All const base accesses do not
24370 need refreshing, not just those from decl_pool.
24371 (sra_modify_assign): Do not refresh into a const base decl.
24373 2021-05-15 Jakub Jelinek <jakub@redhat.com>
24375 PR rtl-optimization/100342
24376 * regcprop.c (copy_value): When copying a source reg in a wider
24377 mode than it has recorded for the value, adjust recorded destination
24378 mode too or punt if !REG_CAN_CHANGE_MODE_P.
24380 2021-05-14 Jason Merrill <jason@redhat.com>
24382 * intl.h: Add comments.
24384 2021-05-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
24386 * config/aarch64/aarch64-simd.md
24387 (aarch64_sqdml<SBINQOPS:as>l2_lane<mode>_internal): Split into...
24388 (aarch64_sqdmlsl2_lane<mode>_internal): ... This...
24389 (aarch64_sqdmlal2_lane<mode>_internal): ... And this.
24390 (aarch64_sqdml<SBINQOPS:as>l2_laneq<mode>_internal): Split into ...
24391 (aarch64_sqdmlsl2_laneq<mode>_internal): ... This...
24392 (aarch64_sqdmlal2_laneq<mode>_internal): ... And this.
24393 (aarch64_sqdml<SBINQOPS:as>l2_n<mode>_internal): Split into...
24394 (aarch64_sqdmlsl2_n<mode>_internal): ... This...
24395 (aarch64_sqdmlal2_n<mode>_internal): ... And this.
24397 2021-05-14 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
24400 * config/arm/arm_neon.h (vtst_s8): Replace call to vtst builtin with it's
24401 boolean logic equivalent.
24402 (vtst_s16): Likewise.
24403 (vtst_s32): Likewise.
24404 (vtst_u8): Likewise.
24405 (vtst_u16): Likewise.
24406 (vtst_u32): Likewise.
24407 (vtst_p8): Likewise.
24408 (vtst_p16): Likewise.
24409 (vtstq_s8): Likewise.
24410 (vtstq_s16): Likewise.
24411 (vtstq_s32): Likewise.
24412 (vtstq_u8): Likewise.
24413 (vtstq_u16): Likewise.
24414 (vtstq_u32): Likewise.
24415 (vtstq_p8): Likewise.
24416 (vtstq_p16): Likewise.
24417 * config/arm/arm_neon_builtins.def: Remove entry for vtst.
24418 * config/arm/neon.md (neon_vtst<mode>): Remove pattern.
24420 2021-05-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
24422 * config/aarch64/aarch64-simd.md (aarch64_sqdmlal2<mode>): Merge into...
24423 (aarch64_sqdml<SBINQOPS:as>l2<mode>): ... This.
24424 (aarch64_sqdmlsl2<mode>): Delete.
24425 (aarch64_sqdmlal2_lane<mode>): Merge this...
24426 (aarch64_sqdmlsl2_lane<mode>): ... And this...
24427 (aarch64_sqdml<SBINQOPS:as>l2_lane<mode>): ... Into this.
24428 (aarch64_sqdmlal2_laneq<mode>): Merge this...
24429 (aarch64_sqdmlsl2_laneq<mode>): ... And this...
24430 (aarch64_sqdml<SBINQOPS:as>l2_laneq<mode>): ... Into this.
24431 (aarch64_sqdmlal2_n<mode>): Merge this...
24432 (aarch64_sqdmlsl2_n<mode>): ... And this...
24433 (aarch64_sqdml<SBINQOPS:as>l2_n<mode>): ... Into this.
24435 2021-05-13 Martin Sebor <msebor@redhat.com>
24437 PR middle-end/100574
24438 * builtins.c (access_ref::get_ref): Improve detection of PHIs with
24439 all null arguments.
24441 2021-05-13 Martin Sebor <msebor@redhat.com>
24443 PR tree-optimization/93100
24444 PR middle-end/98583
24445 * tree-ssa-uninit.c (check_defs): Exclude intrinsic functions that
24446 don't modify referenced objects.
24448 2021-05-13 Martin Jambor <mjambor@suse.cz>
24450 PR tree-optimization/100453
24451 * tree-sra.c (sra_modify_assign): All const base accesses do not
24452 need refreshing, not just those from decl_pool.
24453 (sra_modify_assign): Do not refresh into a const base decl.
24455 2021-05-13 Martin Liska <mliska@suse.cz>
24457 * tree-ssa-dom.c: Remove m_simplifier.
24459 2021-05-13 Richard Earnshaw <rearnsha@arm.com>
24462 * config/arm/arm.c (arm_canonicalize_comparison): Correctly
24463 canonicalize DImode inequality comparisons against the
24464 maximum integral value.
24466 2021-05-13 Jakub Jelinek <jakub@redhat.com>
24468 PR tree-optimization/98856
24469 * config/i386/i386.c (ix86_shift_rotate_cost): Add CODE argument.
24470 Expect V2DI and V4DI arithmetic right shifts to be emulated.
24471 (ix86_rtx_costs, ix86_add_stmt_cost): Adjust ix86_shift_rotate_cost
24473 * config/i386/i386-expand.c (expand_vec_perm_2perm_interleave,
24474 expand_vec_perm_2perm_pblendv): New functions.
24475 (ix86_expand_vec_perm_const_1): Use them.
24476 * config/i386/sse.md (ashr<mode>3<mask_name>): Rename to ...
24477 (<mask_codefor>ashr<mode>3<mask_name>): ... this.
24478 (ashr<mode>3): New define_expand with VI248_AVX512BW iterator.
24479 (ashrv4di3): New define_expand.
24480 (ashrv2di3): Change condition to TARGET_SSE2, handle !TARGET_XOP
24481 and !TARGET_AVX512VL expansion.
24483 2021-05-13 Uroš Bizjak <ubizjak@gmail.com>
24486 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Force mode
24487 sizes < 16 to a register when constructing vpcmov pattern.
24488 * config/i386/mmx.md (*xop_pcmov_<mode>): Use MMXMODE124 mode.
24490 2021-05-13 Martin Liska <mliska@suse.cz>
24492 * gcov-io.c (gcov_write_block): Remove.
24493 (gcov_write_words): Likewise.
24494 (gcov_read_words): Re-implement using gcov_read_bytes.
24495 (gcov_allocate): Remove.
24496 (GCOV_BLOCK_SIZE): Likewise.
24497 (struct gcov_var): Remove most of the fields.
24498 (gcov_position): Implement with ftell.
24499 (gcov_rewrite): Remove setting of start and offset fields.
24500 (from_file): Re-format.
24501 (gcov_open): Remove setbuf call. It should not be needed.
24502 (gcov_close): Remove internal buffer handling.
24503 (gcov_magic): Use __builtin_bswap32.
24504 (gcov_write_counter): Use directly gcov_write_unsigned.
24505 (gcov_write_string): Use direct fwrite and do not round
24507 (gcov_seek): Use directly fseek.
24508 (gcov_write_tag): Use gcov_write_unsigned directly.
24509 (gcov_write_length): Likewise.
24510 (gcov_write_tag_length): Likewise.
24511 (gcov_read_bytes): Use directly fread.
24512 (gcov_read_unsigned): Use gcov_read_words.
24513 (gcov_read_counter): Likewise.
24514 (gcov_read_string): Use gcov_read_bytes.
24515 * gcov-io.h (GCOV_WORD_SIZE): Adjust to reflect
24516 that size is not in bytes, but words (4B).
24517 (GCOV_TAG_FUNCTION_LENGTH): Likewise.
24518 (GCOV_TAG_ARCS_LENGTH): Likewise.
24519 (GCOV_TAG_ARCS_NUM): Likewise.
24520 (GCOV_TAG_COUNTER_LENGTH): Likewise.
24521 (GCOV_TAG_COUNTER_NUM): Likewise.
24522 (GCOV_TAG_SUMMARY_LENGTH): Likewise.
24524 2021-05-13 liuhongt <hongtao.liu@intel.com>
24527 * config/i386/sse.md (ssedoublevecmode): Add attribute for
24528 V64QI/V32HI/V16SI/V4DI.
24529 (ssehalfvecmode): Add attribute for V2DI/V2DF.
24530 (*vec_concatv4si_0): Extend to VI124_128.
24531 (*vec_concat<mode>_0): New pre-reload splitter.
24532 * config/i386/predicates.md (movq_parallel): New predicate.
24534 2021-05-13 Alexandre Oliva <oliva@adacore.com>
24536 * targhooks.c (default_zero_call_used_regs): Retry using
24537 successfully-zeroed registers as sources.
24539 2021-05-12 Tobias Burnus <tobias@codesourcery.com>
24541 * omp-low.c (finish_taskreg_scan): Use the proper detach decl.
24543 2021-05-12 Aldy Hernandez <aldyh@redhat.com>
24546 * gimple-range.cc (range_of_builtin_call): Skip out on
24547 processing __builtin_clz when varying.
24549 2021-05-12 Tom de Vries <tdevries@suse.de>
24552 * config/nvptx/nvptx-opts.h (enum ptx_version): New enum.
24553 * config/nvptx/nvptx.c (nvptx_file_start): Print .version according
24554 to ptx_version_option.
24555 * config/nvptx/nvptx.h (TARGET_PTX_6_3): Define.
24556 * config/nvptx/nvptx.md (define_insn "nvptx_shuffle<mode>")
24557 (define_insn "nvptx_vote_ballot"): Use sync variant for
24559 * config/nvptx/nvptx.opt (ptx_version): Add enum.
24560 (mptx): Add option.
24561 * doc/invoke.texi (Nvidia PTX Options): Add mptx item.
24563 2021-05-12 Richard Biener <rguenther@suse.de>
24565 PR tree-optimization/100566
24566 * tree-ssa-sccvn.c (dominated_by_p_w_unex): Properly handle
24567 allow_back for all edge queries.
24569 2021-05-12 liuhongt <hongtao.liu@intel.com>
24572 * config/i386/sse.md (<sse4_1_avx2>_pblendvb): Add
24573 splitters for pblendvb of NOT mask register.
24575 2021-05-12 Richard Biener <rguenther@suse.de>
24577 PR tree-optimization/100519
24578 * tree-ssa-reassoc.c (can_associate_p): Split into...
24579 (can_associate_op_p): ... this
24580 (can_associate_type_p): ... and this.
24581 (is_reassociable_op): Call can_associate_op_p.
24582 (break_up_subtract_bb): Call the appropriate predicates.
24583 (reassociate_bb): Likewise.
24585 2021-05-12 Martin Liska <mliska@suse.cz>
24587 * lto-wrapper.c (merge_and_complain): Merge -flto=arg options.
24588 (run_gcc): Use -flto argument detection for merged
24591 2021-05-12 Martin Liska <mliska@suse.cz>
24593 * lto-wrapper.c (print_lto_docs_link): New function.
24594 (run_gcc): Print warning about missing job server detection
24595 after we know NR of partitions. Do the same for -flto{,=1}.
24596 * opts.c (get_option_html_page): Support -flto option.
24598 2021-05-12 Martin Liska <mliska@suse.cz>
24600 * lto-wrapper.c (get_options_from_collect_gcc_options): Change
24602 (append_option): Remove.
24603 (find_option): Rework to use the vector type.
24604 (remove_option): Remove.
24605 (merge_and_complain): Use vectors for cl_decoded_option data
24607 (append_compiler_options): Likewise.
24608 (append_diag_options): Likewise.
24609 (append_linker_options): Likewise.
24610 (append_offload_options): Likewise.
24611 (compile_offload_image): Likewise.
24612 (compile_images_for_offload_targets): Likewise.
24613 (find_and_merge_options): Likewise.
24614 (run_gcc): Likewise.
24616 2021-05-12 Bernd Edlinger <bernd.edlinger@hotmail.de>
24619 * dwarf2out.c (dwarf2out_finish): Set
24620 have_multiple_function_sections with multi-range text_section.
24622 2021-05-12 Martin Liska <mliska@suse.cz>
24624 PR bootstrap/100560
24625 * Makefile.in: Remove version.h from linker command line.
24627 2021-05-12 Richard Biener <rguenther@suse.de>
24629 PR middle-end/100547
24630 * rtl.h (rtvec_alloc): Make argument size_t.
24631 * rtl.c (rtvec_alloc): Verify the count is less than INT_MAX.
24633 2021-05-12 Jakub Jelinek <jakub@redhat.com>
24635 PR middle-end/100508
24636 * cfgexpand.c (expand_debug_expr): For DEBUG_EXPR_DECL with vector
24637 type, don't reuse DECL_RTL if it has different mode, instead force
24638 creation of a new DEBUG_EXPR.
24640 2021-05-12 Jakub Jelinek <jakub@redhat.com>
24641 Marc Glisse <marc.glisse@inria.fr>
24643 PR tree-optimization/94589
24644 * match.pd ((X & Y) == X -> (X & ~Y) == 0,
24645 (X | Y) == Y -> (X & ~Y) == 0): New GIMPLE simplifications.
24647 2021-05-12 Uroš Bizjak <ubizjak@gmail.com>
24650 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Handle V2SF mode.
24651 * config/i386/mmx.md (MMXMODE124): New mode iterator.
24653 (mmxintvecmode): New mode attribute.
24654 (mmxintvecmodelower): Ditto.
24655 (*mmx_maskcmpv2sf3_comm): New insn pattern.
24656 (*mmx_maskcmpv2sf3): Ditto.
24657 (vec_cmpv2sfv2si): New expander.
24658 (vcond<V2FI:mode>v2si): Ditto.
24659 (mmx_vlendvps): New insn pattern.
24660 (vcond<MMXMODE124:mode><MMXMODEI:mode>): Also handle V2SFmode.
24661 (vcondu<MMXMODE124:mode><MMXMODEI:mode>): Ditto.
24662 (vcond_mask_<mode><mmxintvecmodelower>): Ditto.
24664 2021-05-11 Martin Sebor <msebor@redhat.com>
24666 PR middle-end/21433
24667 * expr.c (expand_expr_real_1): Replace unreachable code with an assert.
24669 2021-05-11 Richard Biener <rguenther@suse.de>
24671 * gimple-fold.c (gimple_fold_call): Do not call
24672 maybe_fold_reference on call arguments or the static chain.
24673 (fold_stmt_1): Do not call maybe_fold_reference on GIMPLE_ASM
24676 2021-05-11 Martin Liska <mliska@suse.cz>
24678 * builtins.def (DEF_HSAIL_BUILTIN): Remove.
24679 (DEF_HSAIL_ATOMIC_BUILTIN): Likewise.
24680 (DEF_HSAIL_SAT_BUILTIN): Likewise.
24681 (DEF_HSAIL_INTR_BUILTIN): Likewise.
24682 (DEF_HSAIL_CVT_ZEROI_SAT_BUILTIN): Likewise.
24683 * doc/frontends.texi: Remove BRIG.
24684 * doc/install.texi: Likewise.
24685 * doc/invoke.texi: Likewise.
24686 * doc/standards.texi: Likewise.
24687 * brig-builtins.def: Removed.
24688 * brig/ChangeLog: Removed.
24689 * brig/Make-lang.in: Removed.
24690 * brig/brig-builtins.h: Removed.
24691 * brig/brig-c.h: Removed.
24692 * brig/brig-lang.c: Removed.
24693 * brig/brigfrontend/brig-arg-block-handler.cc: Removed.
24694 * brig/brigfrontend/brig-atomic-inst-handler.cc: Removed.
24695 * brig/brigfrontend/brig-basic-inst-handler.cc: Removed.
24696 * brig/brigfrontend/brig-branch-inst-handler.cc: Removed.
24697 * brig/brigfrontend/brig-cmp-inst-handler.cc: Removed.
24698 * brig/brigfrontend/brig-code-entry-handler.cc: Removed.
24699 * brig/brigfrontend/brig-code-entry-handler.h: Removed.
24700 * brig/brigfrontend/brig-comment-handler.cc: Removed.
24701 * brig/brigfrontend/brig-control-handler.cc: Removed.
24702 * brig/brigfrontend/brig-copy-move-inst-handler.cc: Removed.
24703 * brig/brigfrontend/brig-cvt-inst-handler.cc: Removed.
24704 * brig/brigfrontend/brig-fbarrier-handler.cc: Removed.
24705 * brig/brigfrontend/brig-function-handler.cc: Removed.
24706 * brig/brigfrontend/brig-function.cc: Removed.
24707 * brig/brigfrontend/brig-function.h: Removed.
24708 * brig/brigfrontend/brig-inst-mod-handler.cc: Removed.
24709 * brig/brigfrontend/brig-label-handler.cc: Removed.
24710 * brig/brigfrontend/brig-lane-inst-handler.cc: Removed.
24711 * brig/brigfrontend/brig-machine.c: Removed.
24712 * brig/brigfrontend/brig-machine.h: Removed.
24713 * brig/brigfrontend/brig-mem-inst-handler.cc: Removed.
24714 * brig/brigfrontend/brig-module-handler.cc: Removed.
24715 * brig/brigfrontend/brig-queue-inst-handler.cc: Removed.
24716 * brig/brigfrontend/brig-seg-inst-handler.cc: Removed.
24717 * brig/brigfrontend/brig-signal-inst-handler.cc: Removed.
24718 * brig/brigfrontend/brig-to-generic.cc: Removed.
24719 * brig/brigfrontend/brig-to-generic.h: Removed.
24720 * brig/brigfrontend/brig-util.cc: Removed.
24721 * brig/brigfrontend/brig-util.h: Removed.
24722 * brig/brigfrontend/brig-variable-handler.cc: Removed.
24723 * brig/brigfrontend/hsa-brig-format.h: Removed.
24724 * brig/brigfrontend/phsa.h: Removed.
24725 * brig/brigspec.c: Removed.
24726 * brig/config-lang.in: Removed.
24727 * brig/gccbrig.texi: Removed.
24728 * brig/lang-specs.h: Removed.
24729 * brig/lang.opt: Removed.
24731 2021-05-11 Richard Biener <rguenther@suse.de>
24734 * ipa-param-manipulation.c
24735 (ipa_param_body_adjustments::modify_call_stmt): Avoid
24736 altering SSA_NAME_DEF_STMT by adjusting the calls LHS
24737 via gimple_call_lhs_ptr.
24739 2021-05-11 Alex Coplan <alex.coplan@arm.com>
24742 * config/arm/arm.c (cmse_nonsecure_call_inline_register_clear):
24743 Avoid emitting CFA adjusts on the sp if we have the fp.
24745 2021-05-11 Richard Sandiford <richard.sandiford@arm.com>
24747 * config/aarch64/iterators.md (VMUL_CHANGE_NLANES): Delete.
24748 (VMULD): New iterator.
24749 (VCOND): Handle V4HF and V8HF.
24750 (VCONQ): Fix entry for V2SF.
24751 * config/aarch64/aarch64-simd.md (mul_lane<mode>3): Use VMULD
24752 instead of VMUL. Use a 64-bit vector mode for the indexed operand.
24753 (*aarch64_mul3_elt_<vswap_width_name><mode>): Merge with...
24754 (mul_laneq<mode>3): ...this define_insn. Use VMUL instead of VDQSF.
24755 Use a 128-bit vector mode for the indexed operand. Use stype for
24756 the scheduling type.
24758 2021-05-11 Richard Biener <rguenther@suse.de>
24760 * gimple-fold.c (maybe_fold_reference): Only return
24761 is_gimple_min_invariant values.
24763 2021-05-11 Richard Biener <rguenther@suse.de>
24765 PR middle-end/100509
24766 * gimple-fold.c (fold_gimple_assign): Only call
24767 get_symbol_constant_value on register type symbols.
24769 2021-05-11 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
24770 Joe Ramsay <joe.ramsay@arm.com>
24773 * config/arm/arm_mve.h (__arm_vstrwq_scatter_offset): Fix wrong arguments.
24774 (__arm_vcmpneq): Remove duplicate definition.
24775 (__arm_vstrwq_scatter_offset_p): Likewise.
24776 (__arm_vmaxq_x): Likewise.
24777 (__arm_vmlsdavaq): Likewise.
24778 (__arm_vmlsdavaxq): Likewise.
24779 (__arm_vmlsdavq_p): Likewise.
24780 (__arm_vmlsdavxq_p): Likewise.
24781 (__arm_vrmlaldavhaq): Likewise.
24782 (__arm_vstrbq_p): Likewise.
24783 (__arm_vstrbq_scatter_offset): Likewise.
24784 (__arm_vstrbq_scatter_offset_p): Likewise.
24785 (__arm_vstrdq_scatter_offset): Likewise.
24786 (__arm_vstrdq_scatter_offset_p): Likewise.
24787 (__arm_vstrdq_scatter_shifted_offset): Likewise.
24788 (__arm_vstrdq_scatter_shifted_offset_p): Likewise.
24790 2021-05-11 Jakub Jelinek <jakub@redhat.com>
24792 PR middle-end/100471
24793 * omp-low.c (lower_omp_task_reductions): For OMP_TASKLOOP, if data
24794 is 0, bypass the reduction loop including
24795 GOMP_taskgroup_reduction_unregister call.
24797 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
24799 * config/rs6000/rs6000.c (struct rs6000_cost_data): New member
24800 costing_for_scalar.
24801 (rs6000_density_test): Early return if costing_for_scalar is true.
24802 (rs6000_init_cost): Init costing_for_scalar of rs6000_cost_data.
24804 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
24806 * doc/tm.texi: Regenerated.
24807 * target.def (init_cost): Add new parameter costing_for_scalar.
24808 * targhooks.c (default_init_cost): Adjust for new parameter.
24809 * targhooks.h (default_init_cost): Likewise.
24810 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Likewise.
24811 (vect_compute_single_scalar_iteration_cost): Likewise.
24812 (vect_analyze_loop_2): Likewise.
24813 * tree-vect-slp.c (_bb_vec_info::_bb_vec_info): Likewise.
24814 (vect_bb_vectorization_profitable_p): Likewise.
24815 * tree-vectorizer.h (init_cost): Likewise.
24816 * config/aarch64/aarch64.c (aarch64_init_cost): Likewise.
24817 * config/i386/i386.c (ix86_init_cost): Likewise.
24818 * config/rs6000/rs6000.c (rs6000_init_cost): Likewise.
24820 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
24822 * config/rs6000/rs6000.c (rs6000_vect_nonmem): Renamed to
24823 vect_nonmem and moved into...
24824 (struct rs6000_cost_data): ...here.
24825 (rs6000_init_cost): Use vect_nonmem of cost_data instead.
24826 (rs6000_add_stmt_cost): Likewise.
24827 (rs6000_finish_cost): Likewise.
24829 2021-05-10 Eric Botcazou <ebotcazou@adacore.com>
24831 * range-op.cc (get_bool_state): Adjust head comment.
24832 (operator_not_equal::op1_range): Fix comment.
24833 (operator_bitwise_xor::op1_range): Remove call to gcc_unreachable.
24835 2021-05-10 Martin Sebor <msebor@redhat.com>
24837 PR middle-end/100425
24838 PR middle-end/100510
24839 * gimple-ssa-warn-alloca.c (pass_walloca::firast_time_p): Rename...
24840 (pass_walloca::xlimit_certain_p): ...to this.
24841 (pass_walloca::gate): Execute for any kind of handled warning.
24842 (pass_walloca::execute): Avoid issuing "maybe" and "unbounded"
24843 warnings when xlimit_certain_p is set.
24845 2021-05-10 Pat Haugen <pthaugen@linux.ibm.com>
24847 * config/rs6000/rs6000.c (rs6000_ira_change_pseudo_allocno_class):
24848 Return ALTIVEC_REGS if that is best_class.
24849 (rs6000_compute_pressure_classes): Add ALTIVEC_REGS.
24851 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
24853 * config/arm/arm.h (CPP_SPEC): Remove error message about
24856 2021-05-10 Martin Jambor <mjambor@suse.cz>
24858 * ipa-prop.h (IPA_NODE_REF): Removed.
24859 (IPA_NODE_REF_GET_CREATE): Likewise.
24860 (IPA_EDGE_REF): Likewise.
24861 (IPA_EDGE_REF_GET_CREATE): Likewise.
24862 (IS_VALID_JUMP_FUNC_INDEX): Likewise.
24863 * ipa-cp.c (print_all_lattices): Replaced IPA_NODE_REF with a direct
24864 use of ipa_node_params_sum.
24865 (ipcp_versionable_function_p): Likewise.
24866 (push_node_to_stack): Likewise.
24867 (pop_node_from_stack): Likewise.
24868 (set_single_call_flag): Replaced two IPA_NODE_REF with one single
24869 direct use of ipa_node_params_sum.
24870 (initialize_node_lattices): Replaced IPA_NODE_REF with a direct use of
24871 ipa_node_params_sum.
24872 (ipa_context_from_jfunc): Replaced IPA_EDGE_REF with a direct use of
24874 (ipcp_verify_propagated_values): Replaced IPA_NODE_REF with a direct
24875 use of ipa_node_params_sum.
24876 (self_recursively_generated_p): Likewise.
24877 (propagate_scalar_across_jump_function): Likewise.
24878 (propagate_context_across_jump_function): Replaced IPA_EDGE_REF with a
24879 direct use of ipa_edge_args_sum, moved the lookup after the early
24880 exit. Replaced IPA_NODE_REF with a direct use of ipa_node_params_sum.
24881 (propagate_bits_across_jump_function): Replaced IPA_NODE_REF with
24882 direct uses of ipa_node_params_sum.
24883 (propagate_vr_across_jump_function): Likewise.
24884 (propagate_aggregate_lattice): Likewise.
24885 (propagate_aggs_across_jump_function): Likewise.
24886 (propagate_constants_across_call): Likewise, also replaced
24887 IPA_EDGE_REF with a direct use of ipa_edge_args_sum.
24888 (good_cloning_opportunity_p): Replaced IPA_NODE_REF with a direct use
24889 of ipa_node_params_sum.
24890 (estimate_local_effects): Likewise.
24891 (add_all_node_vals_to_toposort): Likewise.
24892 (propagate_constants_topo): Likewise.
24893 (ipcp_propagate_stage): Likewise.
24894 (ipcp_discover_new_direct_edges): Likewise.
24895 (calls_same_node_or_its_all_contexts_clone_p): Likewise.
24896 (cgraph_edge_brings_value_p): Likewise (in both overloaded functions).
24897 (get_info_about_necessary_edges): Likewise.
24898 (want_remove_some_param_p): Likewise.
24899 (create_specialized_node): Likewise.
24900 (self_recursive_pass_through_p): Likewise.
24901 (self_recursive_agg_pass_through_p): Likewise.
24902 (find_more_scalar_values_for_callers_subset): Likewise and also
24903 replaced IPA_EDGE_REF with direct uses of ipa_edge_args_sum, in one
24904 case replacing two of those with a single query.
24905 (find_more_contexts_for_caller_subset): Likewise for the
24906 ipa_polymorphic_call_context overload.
24907 (intersect_aggregates_with_edge): Replaced IPA_EDGE_REF with a direct
24908 use of ipa_edge_args_sum. Replaced IPA_NODE_REF with direct uses of
24909 ipa_node_params_sum.
24910 (find_aggregate_values_for_callers_subset): Likewise, also reusing
24911 results of ipa_edge_args_sum->get.
24912 (cgraph_edge_brings_all_scalars_for_node): Replaced IPA_NODE_REF with
24913 direct uses of ipa_node_params_sum, replaced IPA_EDGE_REF with a
24914 direct use of ipa_edge_args_sum.
24915 (cgraph_edge_brings_all_agg_vals_for_node): Likewise, moved node
24916 summary query after the early exit and reused the result later.
24917 (decide_about_value): Replaced IPA_NODE_REF with a direct use of
24918 ipa_node_params_sum.
24919 (decide_whether_version_node): Likewise. Removed re-querying for
24920 summaries after cloning.
24921 (spread_undeadness): Replaced IPA_NODE_REF with a direct use of
24922 ipa_node_params_sum.
24923 (has_undead_caller_from_outside_scc_p): Likewise, reusing results of
24925 (identify_dead_nodes): Likewise.
24926 (ipcp_store_bits_results): Replaced IPA_NODE_REF with direct uses of
24927 ipa_node_params_sum.
24928 (ipcp_store_vr_results): Likewise.
24929 * ipa-fnsummary.c (evaluate_properties_for_edge): Likewise.
24930 (ipa_fn_summary_t::duplicate): Likewise.
24931 (analyze_function_body): Likewise.
24932 (estimate_calls_size_and_time): Likewise.
24933 (ipa_cached_call_context::duplicate_from): Likewise.
24934 (ipa_call_context::equal_to): Likewise.
24935 (remap_edge_params): Likewise.
24936 (ipa_merge_fn_summary_after_inlining): Likewise.
24937 (inline_read_section): Likewise.
24938 * ipa-icf.c (sem_function::param_used_p): Likewise.
24939 * ipa-modref.c (compute_parm_map): Likewise.
24940 (compute_parm_map): Replaced IPA_EDGE_REF with a direct use of
24942 (get_access_for_fnspec): Replaced IPA_NODE_REF with a direct use of
24943 ipa_node_params_sum and replaced IPA_EDGE_REF with a direct use of
24945 * ipa-profile.c (check_argument_count): Likewise.
24946 * ipa-prop.c (ipa_alloc_node_params): Replaced IPA_NODE_REF_GET_CREATE
24947 with a direct use of ipa_node_params_sum.
24948 (ipa_initialize_node_params): Likewise.
24949 (ipa_print_node_jump_functions_for_edge): Replaced IPA_EDGE_REF with a
24950 direct use of ipa_edge_args_sum and reused the query result.
24951 (ipa_compute_jump_functions_for_edge): Replaced IPA_NODE_REF with a
24952 direct use of ipa_node_params_sum and replaced IPA_EDGE_REF with a
24953 direct use of ipa_edge_args_sum.
24954 (ipa_note_param_call): Replaced IPA_NODE_REF with a direct use of
24955 ipa_node_params_sum and reused the result of the query.
24956 (ipa_analyze_node): Likewise.
24957 (ipa_analyze_controlled_uses): Replaced IPA_NODE_REF with a direct use
24958 of ipa_node_params_sum.
24959 (update_jump_functions_after_inlining): Replaced IPA_EDGE_REF with
24960 direct uses of ipa_edge_args_sum.
24961 (update_indirect_edges_after_inlining): Replaced IPA_NODE_REF with
24962 direct uses of ipa_node_params_sum and replaced IPA_EDGE_REF with a
24963 direct use of ipa_edge_args_sum. Removed superficial re-querying the
24965 (propagate_controlled_uses): Replaced IPA_NODE_REF with direct uses of
24966 ipa_node_params_sum and replaced IPA_EDGE_REF with a direct use of
24968 (ipa_propagate_indirect_call_infos): Replaced IPA_EDGE_REF with a
24969 direct use of ipa_edge_args_sum.
24970 (ipa_edge_args_sum_t::duplicate): Replaced IPA_NODE_REF with a direct
24971 use of ipa_node_params_sum.
24972 (ipa_print_node_params): Likewise.
24973 (ipa_write_node_info): Likewise and also replaced IPA_EDGE_REF with
24974 direct uses of ipa_edge_args_sum.
24975 (ipa_read_edge_info): Replaced IPA_EDGE_REF with a direct use of
24977 (ipa_read_node_info): Replaced IPA_NODE_REF with a direct use of
24978 ipa_node_params_sum.
24979 (ipa_prop_write_jump_functions): Likewise. Move variable node to the
24980 scopes where it is used.
24982 2021-05-10 Uroš Bizjak <ubizjak@gmail.com>
24984 * config/i386/i386-expand.c (ix86_expand_sse_movcc)
24985 <case E_V2SImode>: Force op_true to register.
24987 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
24989 * config/arm/iterators.md (MVE_FP_COMPARISONS): New.
24990 * config/arm/mve.md (mve_vcmp<mve_cmp_op>q_f<mode>)
24991 (mve_vcmp<mve_cmp_op>q_n_f<mode>): New, merge all vcmp_*f*
24993 (mve_vcmpeqq_f<mode>, mve_vcmpeqq_n_f<mode>, mve_vcmpgeq_f<mode>)
24994 (mve_vcmpgeq_n_f<mode>, mve_vcmpgtq_f<mode>)
24995 (mve_vcmpgtq_n_f<mode>, mve_vcmpleq_f<mode>)
24996 (mve_vcmpleq_n_f<mode>, mve_vcmpltq_f<mode>)
24997 (mve_vcmpltq_n_f<mode>, mve_vcmpneq_f<mode>)
24998 (mve_vcmpneq_n_f<mode>): Remove.
24999 * config/arm/unspecs.md (VCMPEQQ_F, VCMPEQQ_N_F, VCMPGEQ_F)
25000 (VCMPGEQ_N_F, VCMPGTQ_F, VCMPGTQ_N_F, VCMPLEQ_F, VCMPLEQ_N_F)
25001 (VCMPLTQ_F, VCMPLTQ_N_F, VCMPNEQ_F, VCMPNEQ_N_F): Remove.
25003 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25005 * config/arm/iterators.md (MVE_COMPARISONS): New.
25007 (mve_cmp_type): New.
25008 * config/arm/mve.md (mve_vcmp<mve_cmp_op>q_<mode>): New, merge all
25010 (mve_vcmpneq_<mode>, mve_vcmpcsq_n_<mode>, mve_vcmpcsq_<mode>)
25011 (mve_vcmpeqq_n_<mode>, mve_vcmpeqq_<mode>, mve_vcmpgeq_n_<mode>)
25012 (mve_vcmpgeq_<mode>, mve_vcmpgtq_n_<mode>, mve_vcmpgtq_<mode>)
25013 (mve_vcmphiq_n_<mode>, mve_vcmphiq_<mode>, mve_vcmpleq_n_<mode>)
25014 (mve_vcmpleq_<mode>, mve_vcmpltq_n_<mode>, mve_vcmpltq_<mode>)
25015 (mve_vcmpneq_n_<mode>, mve_vcmpltq_n_<mode>, mve_vcmpltq_<mode>)
25016 (mve_vcmpneq_n_<mode>): Remove.
25018 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25020 * config/arm/arm_mve.h (__arm_vcmp*): Remove 's' suffix.
25021 * config/arm/arm_mve_builtins.def (vcmp*): Remove 's' suffix.
25022 * config/arm/mve.md (mve_vcmp*): Remove 's' suffix in pattern
25025 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25027 * config/arm/arm_mve_builtins.def (vcmpneq_u): Remove.
25028 (vcmpneq_n_u): Likewise.
25029 (vcmpeqq_u,): Likewise.
25030 (vcmpeqq_n_u): Likewise.
25031 * config/arm/iterators.md (supf): Remove VCMPNEQ_U, VCMPEQQ_U,
25032 VCMPEQQ_N_U and VCMPNEQ_N_U.
25033 * config/arm/mve.md (mve_vcmpneq): Remove <supf> iteration.
25034 (mve_vcmpeqq_n): Likewise.
25035 (mve_vcmpeqq): Likewise.
25036 (mve_vcmpneq_n): Likewise.
25038 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
25040 * config/arm/arm_mve.h (__arm_vcmpeq*u*, __arm_vcmpne*u*): Call
25041 the 's' version of the builtin.
25043 2021-05-10 Richard Biener <rguenther@suse.de>
25045 PR tree-optimization/100492
25046 * tree-loop-distribution.c (find_seed_stmts_for_distribution):
25047 Find nothing when the loop contains an irreducible region.
25049 2021-05-10 Richard Biener <rguenther@suse.de>
25051 PR middle-end/100464
25053 * gimple-fold.c (canonicalize_constructor_val): Do not set
25056 2021-05-10 Richard Biener <rguenther@suse.de>
25058 PR tree-optimization/100434
25059 * tree-ssa-dse.c (initialize_ao_ref_for_dse): Handle
25061 (dse_optimize_stmt): Handle call LHS by dropping the
25062 LHS or the whole call if it doesn't have other
25064 (pass_dse::execute): Adjust.
25066 2021-05-10 Martin Liska <mliska@suse.cz>
25068 * Makefile.in: Add missing genversion rule.
25070 2021-05-10 Alex Coplan <alex.coplan@arm.com>
25073 * config/arm/mve.md (*mve_mov<mode>): Simplify output code. Use
25074 vldrw.u32 and vstrw.32 for V2D[IF]mode loads and stores.
25076 2021-05-10 Martin Liska <mliska@suse.cz>
25078 * builtins.c (is_builtin_name): Use startswith
25079 function instead of strncmp.
25080 * collect2.c (main): Likewise.
25081 (has_lto_section): Likewise.
25082 (scan_libraries): Likewise.
25083 * coverage.c (coverage_checksum_string): Likewise.
25084 (coverage_init): Likewise.
25085 * dwarf2out.c (is_cxx): Likewise.
25086 (gen_compile_unit_die): Likewise.
25087 * gcc-ar.c (main): Likewise.
25088 * gcc.c (init_spec): Likewise.
25089 (read_specs): Likewise.
25090 (execute): Likewise.
25091 (check_live_switch): Likewise.
25092 * genattrtab.c (write_attr_case): Likewise.
25093 (IS_ATTR_GROUP): Likewise.
25094 * gencfn-macros.c (main): Likewise.
25095 * gengtype.c (type_for_name): Likewise.
25096 (gen_rtx_next): Likewise.
25097 (get_file_langdir): Likewise.
25098 (write_local): Likewise.
25099 * genmatch.c (get_operator): Likewise.
25100 (get_operand_type): Likewise.
25101 (expr::gen_transform): Likewise.
25102 * genoutput.c (validate_optab_operands): Likewise.
25103 * incpath.c (add_sysroot_to_chain): Likewise.
25104 * langhooks.c (lang_GNU_C): Likewise.
25105 (lang_GNU_CXX): Likewise.
25106 (lang_GNU_Fortran): Likewise.
25107 (lang_GNU_OBJC): Likewise.
25108 * lto-wrapper.c (run_gcc): Likewise.
25109 * omp-general.c (omp_max_simt_vf): Likewise.
25110 * omp-low.c (omp_runtime_api_call): Likewise.
25111 * opts-common.c (parse_options_from_collect_gcc_options): Likewise.
25112 * read-rtl-function.c (function_reader::read_rtx_operand_r): Likewise.
25113 * real.c (real_from_string): Likewise.
25114 * selftest.c (assert_str_startswith): Likewise.
25115 * timevar.c (timer::validate_phases): Likewise.
25116 * tree.c (get_file_function_name): Likewise.
25117 * ubsan.c (ubsan_use_new_style_p): Likewise.
25118 * varasm.c (default_function_rodata_section): Likewise.
25119 (incorporeal_function_p): Likewise.
25120 (default_section_type_flags): Likewise.
25121 * system.h (startswith): Define startswith.
25123 2021-05-10 Martin Liska <mliska@suse.cz>
25125 * bitmap.h (class auto_bitmap): Remove
25126 __cplusplus >= 201103.
25127 * config/aarch64/aarch64.c: Likewise.
25128 * gimple-ssa-store-merging.c (store_immediate_info::store_immediate_info):
25130 * sbitmap.h: Likewise.
25132 2021-05-10 Martin Liska <mliska@suse.cz>
25134 * Makefile.in: Rename gcov-iov to genversion and depend
25135 on version.h (instead of gcov-iov.h).
25136 * gcov-io.h: Include version.h instread of gcov-iov.h.
25137 * gengtype-state.c (read_state_version): Likewise.
25138 * gcov-iov.c: Moved to...
25139 * genversion.c: ...here.
25140 * lto-streamer.h (LTO_major_version): Define it with
25142 * version.c: Removed.
25143 * version.h: Removed.
25145 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25147 * config/arc/arc.md (UNSPEC_ARC_DMPYWH): Define.
25148 * config/arc/simdext.md (VCT): Add predicates for iterator
25151 (voptab): Likewise.
25152 (vec_widen_<V_US>mult_hi_v4hi): Change pattern predicate.
25153 (<voptab>v2si3): New patterns.
25155 (reduc_plus_scal_v4hi): Likewise.
25156 (reduc_plus_scal_v2si): Likewise.
25157 (vec_duplicatev2si): Likewise.
25158 (vec_duplicatev4hi): Likewise.
25160 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25162 * config/arc/simdext.md: Format and cleanup file.
25164 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25166 * config/arc/simdext.md (movmisalignv2hi): Allow misaligned access
25167 only when munaligned-access option is on.
25168 (movmisalign<mode>): Likewise.
25170 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25172 * common/config/arc/arc-common.c (arc_handle_option): Remove dot
25174 * config/arc/arc.c (arc_reorg): Remove underscore from string.
25176 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25178 * config/arc/arc.h (CLZ_DEFINED_VALUE_AT_ZERO): Define.
25179 (CTZ_DEFINED_VALUE_AT_ZERO): Likewise.
25180 * config/arc/arc.md (clrsbsi2): Cleanup pattern.
25181 (norm_f): Likewise.
25184 (clzsi2): Use fls instruction when available.
25185 (arc_clzsi2): Likewise.
25187 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25189 * config/arc/arc.h (ADDITIONAL_REGISTER_NAMES): Add r26 and r27.
25191 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
25193 * doc/extend.texi (__builtin_arc_sr): Swap arguments.
25195 2021-05-10 Bernd Edlinger <bernd.edlinger@hotmail.de>
25197 PR middle-end/100467
25198 * toplev.c (compile_file): Call insn_locations_init before
25199 targetm.asm_out.code_end.
25201 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
25204 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
25206 * config/gcn/gcn.c (gcn_scalar_mode_supported_p): Disable TImode.
25208 2021-05-07 Jakub Jelinek <jakub@redhat.com>
25209 Andrew Stubbs <amd@codesourcery.com>
25212 * builtins.c (try_store_by_multiple_pieces): Use force_operand for
25213 emit_move_insn operands.
25215 2021-05-07 Eric Botcazou <ebotcazou@adacore.com>
25217 * cfgexpand.c (expand_gimple_basic_block): Do not inherit a current
25218 location for the outgoing edges of an empty block.
25219 * dwarf2out.c (add_subscript_info): Retrieve the bounds and index
25220 type by means of the get_array_descr_info langhook, if it is set and
25221 returns true. Remove obsolete code dealing with unnamed subtypes.
25223 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25225 * gimple-range-cache.cc (ssa_block_ranges): Virtualize.
25226 (sbr_vector): Renamed from ssa_block_cache.
25227 (sbr_vector::sbr_vector): Allocate from obstack abd initialize.
25228 (ssa_block_ranges::~ssa_block_ranges): Remove.
25229 (sbr_vector::set_bb_range): Use varying and undefined cached values.
25230 (ssa_block_ranges::set_bb_varying): Remove.
25231 (sbr_vector::get_bb_range): Adjust assert.
25232 (sbr_vector::bb_range_p): Adjust assert.
25233 (~block_range_cache): No freeing loop required.
25234 (block_range_cache::get_block_ranges): Remove.
25235 (block_range_cache::set_bb_range): Inline get_block_ranges.
25236 (block_range_cache::set_bb_varying): Remove.
25237 * gimple-range-cache.h (set_bb_varying): Remove prototype.
25238 * value-range.h (irange_allocator::get_memory): New.
25240 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25242 * gimple-range-cache.cc (non_null_ref::non_null_deref_p): Search
25243 dominator tree is available and requested.
25244 (ranger_cache::ssa_range_in_bb): Don't search dom tree here.
25245 (ranger_cache::fill_block_cache): Don't search dom tree here either.
25246 * gimple-range-cache.h (non_null_deref_p): Add dom_search param.
25248 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25250 * gimple-range.cc (gimple_ranger::range_on_exit): Handle block with
25251 only PHI nodes better.
25253 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25255 * gimple-range-edge.h (gimple_outgoing_range): Rename from
25257 (gcond_edge_range): Export prototype.
25258 * gimple-range-edge.cc (gcond_edge_range): New.
25259 (gimple_outgoing_range::edge_range_p): Use gcond_edge_range.
25260 * gimple-range-gori.h (gori_compute): Use gimple_outgoing_range.
25262 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25264 * gimple-range-edge.cc (outgoing_range::calc_switch_ranges): Compute
25265 default range into a temp and allocate only what is needed.
25267 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25269 * range-op.cc (operator_trunc_mod::wi_fold): x % 0 is UNDEFINED.
25271 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25273 * gimple-range.h (gimple_range_global): Pick up parameter initial
25274 values, and use-before defined locals are UNDEFINED.
25276 2021-05-07 Eric Botcazou <ebotcazou@adacore.com>
25278 * doc/extend.texi (scalar_storage_order): Mention effect on pointer
25280 * tree.h (reverse_storage_order_for_component_p): Return false if
25281 the type is a pointer.
25283 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
25285 * config/gcn/gcn.c (gcn_scalar_mode_supported_p): Disable TImode.
25287 2021-05-07 Uroš Bizjak <ubizjak@gmail.com>
25290 * config/i386/i386-expand.c (ix86_expand_sse_movcc):
25291 Handle V8QI, V4HI and V2SI modes.
25292 * config/i386/mmx.md (mmx_pblendvb): New insn pattern.
25293 * config/i386/sse.md (unspec): Move UNSPEC_BLENDV ...
25294 * config/i386/i386.md (unspec): ... here.
25296 2021-05-07 Tobias Burnus <tobias@codesourcery.com>
25297 Tom de Vries <tdevries@suse.de>
25299 * omp-low.c (lower_rec_simd_input_clauses): Set max_vf = 1 if
25300 a truth_value_p reduction variable is nonintegral.
25302 2021-05-07 Uroš Bizjak <ubizjak@gmail.com>
25305 * config/i386/i386-expand.c (ix86_use_mask_cmp_p):
25306 Return false for mode sizes < 16.
25308 2021-05-07 Jakub Jelinek <jakub@redhat.com>
25311 * config/i386/mmx.md (*xop_pcmov_<mode>): New define_insn.
25313 2021-05-06 Martin Jambor <mjambor@suse.cz>
25315 * ipa-sra.c (ipa_sra_dump_all_summaries): Dump edge summaries even
25316 when there is no function summary.
25317 (ipa_sra_summarize_function): produce edge summaries even when
25320 2021-05-06 Tom Tromey <tom@tromey.com>
25322 * godump.c (string_hash_eq): Remove.
25323 (go_finish): Use htab_eq_string.
25325 2021-05-06 Tom Tromey <tom@tromey.com>
25327 * gengtype-state.c (read_state): Use htab_eq_string.
25328 (string_eq): Remove.
25330 2021-05-06 Tom Tromey <tom@tromey.com>
25332 * gensupport.c (htab_eq_string): Remove.
25334 2021-05-06 Bernd Edlinger <bernd.edlinger@hotmail.de>
25337 * debug.h (gcc_debug_hooks): Add set_ignored_loc function pointer.
25338 * dwarf2out.h (dw_fde_node::ignored_debug): New data item.
25339 * dbxout.c (dbx_debug_hooks, xcoff_debug_hooks): Add dummy
25340 set_ignored_loc callbacks.
25341 * debug.c (do_nothing_debug_hooks): Likewise.
25342 * vmsdbgout.c (vmsdbg_debug_hooks): Likewise.
25343 * dwarf2out.c (text_section_used, cold_text_section_used): Remove.
25344 (in_text_section_p, last_text_label, last_cold_label,
25345 switch_text_ranges, switch_cold_ranges): New data items.
25346 (dwarf2out_note_section_used): Remove.
25347 (dwarf2out_begin_prologue): Set fde->ignored_debug and
25349 (mark_ignored_debug_section): New helper function.
25350 (dwarf2out_end_epilogue, dwarf2out_switch_text_section): Call
25351 mark_ignored_debug_section.
25352 (dwarf2_debug_hooks): Use dwarf2out_set_ignored_loc.
25353 (dwarf2_lineno_debug_hooks): Use dummy for set_ignored_loc.
25354 (size_of_aranges): Adjust formula for multi-part text ranges size.
25355 (output_aranges): Output multi-part text ranges.
25356 (dwarf2out_set_ignored_loc): New callback function.
25357 (dwarf2out_finish): Output multi-part text ranges.
25358 (dwarf2out_c_finalize): Clear new data items.
25359 * final.c (final_start_function_1): Call set_ignored_loc callback.
25360 (final_scan_insn_1): Likewise.
25361 * ggc-page.c (gt_ggc_mx): New helper function.
25362 * stringpool.c (gt_pch_nx): Likewise.
25364 2021-05-06 Richard Biener <rguenther@suse.de>
25366 * timevar.def (TV_TREE_INSERT_PHI_NODES): Remove.
25367 (TV_TREE_SSA_REWRITE_BLOCKS): Likewise.
25368 (TV_TREE_INTO_SSA): New.
25369 * tree-into-ssa.c (insert_phi_nodes): Do not account separately.
25370 (rewrite_blocks): Likewise.
25371 (pass_data_build_ssa): Account to TV_TREE_INTO_SSA.
25373 2021-05-06 Jakub Jelinek <jakub@redhat.com>
25375 * tree-ssa-phiopt.c (value_replacement, minmax_replacement,
25376 abs_replacement, xor_replacement,
25377 cond_removal_in_popcount_clz_ctz_pattern,
25378 replace_phi_edge_with_variable): Change type of phi argument from
25379 gimple * to gphi *.
25381 2021-05-06 Richard Biener <rguenther@suse.de>
25383 * tree-ssa-loop-split.c (split_loop): Delay updating SSA form.
25384 Output an opt-info message.
25385 (do_split_loop_on_cond): Likewise.
25386 (tree_ssa_split_loops): Update SSA form here.
25388 2021-05-06 Richard Biener <rguenther@suse.de>
25390 * tree-inline.c (tree_function_versioning): Fix DECL_BY_REFERENCE
25391 return variable removal.
25393 2021-05-06 Marius Hillenbrand <mhillen@linux.ibm.com>
25395 * config/s390/s390-builtins.def (O_M5, O1_M5, ...): Remove unused macros.
25396 (s390_vec_permi_s64, s390_vec_permi_b64, s390_vec_permi_u64)
25397 (s390_vec_permi_dbl, s390_vpdi): Use the O3_U2 type for the immediate
25399 * config/s390/s390.c (s390_const_operand_ok): Remove unused
25402 2021-05-06 Jakub Jelinek <jakub@redhat.com>
25404 PR tree-optimization/94589
25405 * tree-ssa-phiopt.c (tree_ssa_phiopt_worker): Call
25406 spaceship_replacement.
25407 (cond_only_block_p, spaceship_replacement): New functions.
25409 2021-05-06 Richard Biener <rguenther@suse.de>
25412 * tree-emutls.c (gen_emutls_addr): Pass in whether we're
25413 dealing with a debug use and only query existing addresses
25415 (lower_emutls_1): Avoid splitting out addresses for debug
25416 stmts, reset the debug stmt when we fail to find existing
25418 (lower_emutls_phi_arg): Set wi.stmt.
25420 2021-05-06 Christoph Muellner <cmuellner@gcc.gnu.org>
25423 * config/riscv/riscv.c (riscv_block_move_loop): Use cbranch helper.
25424 * config/riscv/riscv.md (cbranch<mode>4): Generate helpers.
25425 (stack_protect_test): Use cbranch helper.
25427 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
25430 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
25431 always return the establisher frame for __builtin_frame_address (0).
25433 2021-05-05 Ivan Sorokin <vanyacpp@gmail.com>
25436 * config/i386/i386-builtins.c (ix86_cpu_model_type_node): New.
25437 (ix86_cpu_model_var): Likewise.
25438 (ix86_cpu_features2_type_node): Likewise.
25439 (ix86_cpu_features2_var): Likewise.
25440 (fold_builtin_cpu): Cache __cpu_model and __cpu_features2 with
25443 2021-05-05 Martin Sebor <msebor@redhat.com>
25445 * passes.def (pass_warn_printf): Run after SSA.
25447 2021-05-05 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
25449 * config/arm/neon.md (neon_vtst_combine<mode>): New pattern.
25450 * config/arm/predicates.md (minus_one_operand): New predicate.
25452 2021-05-05 Jeff Law <jlaw@tachyum.com>
25454 * config/avr/avr.md: Remove references to CC_STATUS_INIT.
25456 2021-05-05 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
25458 PR rtl-optimization/100263
25459 * postreload.c (move2add_valid_value_p): Ensure register can
25462 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
25464 PR rtl-optimization/100411
25465 * cfgcleanup.c (try_crossjump_to_edge): Also skip end of prologue
25466 and beginning of function markers.
25468 2021-05-05 Jeff Law <jlaw@tachyum.com>
25470 * config/cr16/cr16.h (NOTICE_UPDATE_CC): Remove.
25471 * config/cr16/cr16.c (notice_update_cc): Remove.
25472 * config/cr16/cr16-protos.h (notice_update_cc): Remove.
25474 2021-05-05 Uroš Bizjak <ubizjak@gmail.com>
25477 * config/i386/i386-expand.c (ix86_expand_int_sse_cmp):
25478 Handle V8QI, V4HI and V2SI modes.
25479 * config/i386/i386.c (ix86_build_const_vector): Handle V2SImode.
25480 (ix86_build_signbit_mask): Ditto.
25481 * config/i386/mmx.md (MMXMODE14): New mode iterator.
25482 (<smaxmin:code><MMXMODE14:mode>3): New expander.
25483 (*mmx_<smaxmin:code><MMXMODE14:mode>3): New insn pattern.
25484 (<umaxmin:code><MMXMODE24:mode>3): New expander.
25485 (*mmx_<umaxmin:code><MMXMODE24:mode>3): New insn pattern.
25486 (vec_cmp<MMXMODEI:mode><MMXMODEI:mode>): New expander.
25487 (vec_cmpu<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
25488 (vcond<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
25489 (vcondu<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
25490 (vcond_mask_<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
25492 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
25494 * dwarf2out.c (loc_list_from_tree_1) <DECL>: During early DWARF, do
25495 not expand the VALUE_EXPR of variables put in the non-local frame.
25496 * gimplify.c (gimplify_type_sizes) <RECORD_TYPE>: If the type is not
25497 to be ignored for debug info, ensure its variable offsets are not.
25499 2021-05-05 Richard Biener <rguenther@suse.de>
25501 PR tree-optimization/79333
25502 * tree-ssa-sccvn.c (eliminate_dom_walker::eliminate_stmt):
25503 Fold stmt following SSA edges.
25505 2021-05-05 Richard Biener <rguenther@suse.de>
25507 PR middle-end/100394
25508 * calls.c (expand_call): Preserve possibly throwing calls.
25509 * cfgexpand.c (expand_call_stmt): When a call can throw signal
25510 RTL expansion there are side-effects.
25511 * tree-ssa-dce.c (mark_stmt_if_obviously_necessary): Simplify,
25512 mark all possibly throwing stmts necessary unless we can elide
25514 * tree-ssa-dse.c (pass_dse::execute): Preserve exceptions unless
25515 -fdelete-dead-exceptions.
25516 * tree.h (DECL_PURE_P): Add note about exceptions.
25518 2021-05-05 Alexandre Oliva <oliva@adacore.com>
25520 * config/i386/vxworks.h (DBX_REGISTER_NUMBER): Make it
25523 2021-05-04 David Edelsohn <dje.gcc@gmail.com>
25525 * config/rs6000/rs6000-call.c (rs6000_output_mi_thunk): Use
25526 get_fnname_from_decl for name of thunk.
25527 * config/rs6000/rs6000.c (rs6000_declare_alias): Use assemble_name
25528 and ASM_OUTPUT_LABEL.
25529 (rs6000_xcoff_declare_function_name): Use assemble_name and
25531 (rs6000_xcoff_declare_object_name): Use ASM_OUTPUT_LABEL.
25532 (rs6000_xcoff_encode_section_info): Don't add mapping class
25533 for aliases. Always add [DS] mapping class to primary
25535 (rs6000_asm_weaken_decl): Don't explicitly add [DS].
25537 2021-05-04 Martin Sebor <msebor@redhat.com>
25539 PR middle-end/100307
25540 * builtins.c (compute_objsize_r): Clear base0 for pointers.
25542 2021-05-04 Jeff Law <jlaw@tachyum.com>
25544 * config/bfin/bfin.h (NOTICE_UPDATE_CC): Remove.
25546 2021-05-04 Segher Boessenkool <segher@kernel.crashing.org>
25548 * caller-save.c: Remove CC0.
25549 * cfgcleanup.c: Remove CC0.
25550 * cfgrtl.c: Remove CC0.
25551 * combine.c: Remove CC0.
25552 * compare-elim.c: Remove CC0.
25553 * conditions.h: Remove CC0.
25554 * config/h8300/h8300.h: Remove CC0.
25555 * config/h8300/h8300-protos.h: Remove CC0.
25556 * config/h8300/peepholes.md: Remove CC0.
25557 * config/i386/x86-tune-sched.c: Remove CC0.
25558 * config/m68k/m68k.c: Remove CC0.
25559 * config/rl78/rl78.c: Remove CC0.
25560 * config/sparc/sparc.c: Remove CC0.
25561 * config/xtensa/xtensa.c: Remove CC0.
25562 (gen_conditional_move): Use pc_rtx instead of cc0_rtx in a piece of
25563 RTL where that is used as a placeholder only.
25564 * cprop.c: Remove CC0.
25565 * cse.c: Remove CC0.
25566 * cselib.c: Remove CC0.
25567 * df-problems.c: Remove CC0.
25568 * df-scan.c: Remove CC0.
25569 * doc/md.texi: Remove CC0. Adjust an example.
25570 * doc/rtl.texi: Remove CC0. Adjust an example.
25571 * doc/tm.texi: Regenerate.
25572 * doc/tm.texi.in: Remove CC0.
25573 * emit-rtl.c: Remove CC0.
25574 * final.c: Remove CC0.
25575 * fwprop.c: Remove CC0.
25576 * gcse-common.c: Remove CC0.
25577 * gcse.c: Remove CC0.
25578 * genattrtab.c: Remove CC0.
25579 * genconfig.c: Remove CC0.
25580 * genemit.c: Remove CC0.
25581 * genextract.c: Remove CC0.
25582 * gengenrtl.c: Remove CC0.
25583 * genrecog.c: Remove CC0.
25584 * haifa-sched.c: Remove CC0.
25585 * ifcvt.c: Remove CC0.
25586 * ira-costs.c: Remove CC0.
25587 * ira.c: Remove CC0.
25588 * jump.c: Remove CC0.
25589 * loop-invariant.c: Remove CC0.
25590 * lra-constraints.c: Remove CC0.
25591 * lra-eliminations.c: Remove CC0.
25592 * optabs.c: Remove CC0.
25593 * postreload-gcse.c: Remove CC0.
25594 * postreload.c: Remove CC0.
25595 * print-rtl.c: Remove CC0.
25596 * read-rtl-function.c: Remove CC0.
25597 * reg-notes.def: Remove CC0.
25598 * reg-stack.c: Remove CC0.
25599 * reginfo.c: Remove CC0.
25600 * regrename.c: Remove CC0.
25601 * reload.c: Remove CC0.
25602 * reload1.c: Remove CC0.
25603 * reorg.c: Remove CC0.
25604 * resource.c: Remove CC0.
25605 * rtl.c: Remove CC0.
25606 * rtl.def: Remove CC0.
25607 * rtl.h: Remove CC0.
25608 * rtlanal.c: Remove CC0.
25609 * sched-deps.c: Remove CC0.
25610 * sched-rgn.c: Remove CC0.
25611 * shrink-wrap.c: Remove CC0.
25612 * simplify-rtx.c: Remove CC0.
25613 * system.h: Remove CC0. Poison NOTICE_UPDATE_CC, CC_STATUS_MDEP_INIT,
25614 CC_STATUS_MDEP, and CC_STATUS.
25615 * target.def: Remove CC0.
25616 * valtrack.c: Remove CC0.
25617 * var-tracking.c: Remove CC0.
25619 2021-05-04 Richard Biener <rguenther@suse.de>
25621 PR tree-optimization/100414
25622 * tree-ssa-phiopt.c (get_non_trapping): Do not compute dominance
25624 (tree_ssa_phiopt_worker): But unconditionally here.
25626 2021-05-04 Tobias Burnus <tobias@codesourcery.com>
25628 * omp-low.c (lower_rec_input_clauses, lower_reduction_clauses): Handle
25629 && and || with floating-point and complex arguments.
25631 2021-05-04 Eric Botcazou <ebotcazou@adacore.com>
25633 * tree-inline.c (insert_debug_decl_map): Delete.
25634 (copy_debug_stmt): Minor tweak.
25635 (setup_one_parameter): Do not use a variable if the value is either
25636 a read-only DECL or a non-addressable local variable in the caller.
25637 In this case, insert the debug-only variable in the map manually.
25638 (expand_call_inline): Do not generate a CLOBBER for these values.
25639 * tree-inline.h (debug_map): Minor tweak.
25641 2021-05-04 Eric Botcazou <ebotcazou@adacore.com>
25643 * builtins.c (builtin_with_linkage_p): Return true for stp[n]cpy.
25644 * symtab.c (symtab_node::output_to_lto_symbol_table_p): Tidy up.
25646 2021-05-04 Richard Biener <rguenther@suse.de>
25648 PR tree-optimization/100329
25649 * tree-ssa-reassoc.c (can_reassociate_p): Do not reassociate
25651 (insert_stmt_after): Assert we're not running into asm goto.
25653 2021-05-04 Richard Biener <rguenther@suse.de>
25655 PR tree-optimization/100398
25656 * tree-ssa-dse.c (pass_dse::execute): Preserve control
25659 2021-05-04 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
25661 * builtins.c (try_store_by_multiple_pieces): Fix constfun's prototype.
25663 2021-05-04 Alexandre Oliva <oliva@adacore.com>
25665 * builtins.c (try_store_by_multiple_pieces): New.
25666 (expand_builtin_memset_args): Use it. If target_char_cast
25667 fails, proceed as for non-constant val. Pass len's ctz to...
25668 * expr.c (clear_storage_hints): ... this. Try store by
25669 multiple pieces after setmem.
25670 (clear_storage): Adjust.
25671 * expr.h (clear_storage_hints): Likewise.
25672 (try_store_by_multiple_pieces): Declare.
25673 * passes.def: Replace the last copy_prop with ccp.
25675 2021-05-03 Tom de Vries <tdevries@suse.de>
25678 * omp-low.c (lower_rec_input_clauses): Disable SIMT for user-defined
25681 2021-05-03 Richard Biener <rguenther@suse.de>
25683 * tree-ssa-dse.c (dse_classify_store): Track two PHI defs.
25685 2021-05-03 Richard Biener <rguenther@suse.de>
25687 * tree-ssa-dse.c: Do not include domwalk.h but cfganal.h.
25688 (dse_dom_walker): Remove.
25689 (dse_dom_walker::dse_optimize_stmt): Rename...
25690 (dse_optimize_stmt): ... to this, pass in live_bytes sbitmap.
25691 (dse_dom_walker::before_dom_children): Inline ...
25692 (pass_dse::execute): ... here. Perform a reverse program
25695 2021-05-03 H.J. Lu <hjl.tools@gmail.com>
25698 * configure: Regenerated.
25700 2021-05-03 Ilya Leoshkevich <iii@linux.ibm.com>
25703 * config/s390/s390.c (s390_hard_fp_reg_p): New function.
25704 (s390_md_asm_adjust): Handle hard registers.
25706 2021-05-03 Jakub Jelinek <jakub@redhat.com>
25708 PR tree-optimization/100382
25709 * tree-ssa-dse.c: Include tree-eh.h.
25710 (dse_dom_walker::before_dom_children): Don't remove stmts if
25711 stmt_unremovable_because_of_non_call_eh_p is true.
25713 2021-05-02 David Edelsohn <dje.gcc@gmail.com>
25715 * varasm.c (compute_reloc_for_var): Split out from...
25716 (get_variable_section): Use it.
25717 * output.h (compute_reloc_for_var): Declare.
25718 * config/rs6000/rs6000-protos.h
25719 (rs6000_xcoff_asm_output_aligned_decl_common): Change alignment to
25721 * config/rs6000/rs6000.c (rs6000_legitimize_tls_address_aix):
25722 Don't append storage mapping class to symbol.
25723 (rs6000_xcoff_asm_named_section): Add BS and UL mapping classes.
25724 Don't convert TLS BSS to common.
25725 (rs6000_xcoff_unique_section): Don't fall back to select_secton.
25726 (rs6000_xcoff_section_type_flags): Add SECTION_BSS if DECL is
25728 (rs6000_xcoff_asm_globalize_decl_name): Don't strip storage
25730 (rs6000_xcoff_asm_output_aligned_decl_common): Align is unsigned int.
25731 If align is 0 from TLS class, use the same rules as varasm.c
25732 If not common, switch to BSS section manually.
25733 If common, emit appropriate comm or lcomm directive.
25734 (rs6000_xcoff_encode_section_info): Add logic to append all
25735 storage mapping classes.
25736 (rs6000_asm_weaken_decl): Adjust for qualname symbols.
25737 * config/rs6000/xcoff.h (ASM_OUTPUT_ALIGNED_DECL_LOCAL): Use
25738 rs6000_xcoff_asm_output_aligned_decl_common.
25739 (ASM_OUTPUT_ALIGNED_DECL_LOCAL): Use
25740 rs6000_xcoff_asm_output_aligned_decl_common.
25741 (ASM_OUTPUT_TLS_COMMON): Use
25742 rs6000_xcoff_asm_output_aligned_decl_common.
25744 2021-05-02 Jakub Jelinek <jakub@redhat.com>
25747 * config/nvptx/nvptx.c (nvptx_sese_pseudo): Use nullptr instead of 0
25748 as first argument of pseudo_node_t constructors.
25750 2021-05-02 Jakub Jelinek <jakub@redhat.com>
25753 * config/i386/t-i386 (TM_H): Add $(srcdir)/config/i386/i386-isa.def.
25755 2021-05-01 Aldy Hernandez <aldyh@redhat.com>
25757 * value-range.cc (DEFINE_INT_RANGE_GC_STUBS): Remove.
25758 (gt_pch_nx (int_range<1> *&)): New.
25759 (gt_ggc_mx (int_range<1> *&)): New.
25760 * value-range.h (class irange): Add GTY support for
25763 2021-05-01 Geng Qi <gengqi@linux.alibaba.com>
25765 * doc/options.texi (Negative): Change either or to both and.
25767 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
25769 * config/aarch64/aarch64-simd-builtins.def: Add
25770 float_ml[as][q]_laneq builtin generator macros.
25771 * config/aarch64/aarch64-simd.md (mul_laneq<mode>3): Define.
25772 (aarch64_float_mla_laneq<mode>): Define.
25773 (aarch64_float_mls_laneq<mode>): Define.
25774 * config/aarch64/arm_neon.h (vmla_laneq_f32): Use RTL builtin
25775 instead of GCC vector extensions.
25776 (vmlaq_laneq_f32): Likewise.
25777 (vmls_laneq_f32): Likewise.
25778 (vmlsq_laneq_f32): Likewise.
25780 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
25782 * config/aarch64/aarch64-simd-builtins.def: Add
25783 float_ml[as]_lane builtin generator macros.
25784 * config/aarch64/aarch64-simd.md (*aarch64_mul3_elt<mode>):
25786 (mul_lane<mode>3): This, and re-order arguments.
25787 (aarch64_float_mla_lane<mode>): Define.
25788 (aarch64_float_mls_lane<mode>): Define.
25789 * config/aarch64/arm_neon.h (vmla_lane_f32): Use RTL builtin
25790 instead of GCC vector extensions.
25791 (vmlaq_lane_f32): Likewise.
25792 (vmls_lane_f32): Likewise.
25793 (vmlsq_lane_f32): Likewise.
25795 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
25797 * config/aarch64/aarch64-simd-builtins.def: Add float_ml[as]
25798 builtin generator macros.
25799 * config/aarch64/aarch64-simd.md (aarch64_float_mla<mode>):
25801 (aarch64_float_mls<mode>): Define.
25802 * config/aarch64/arm_neon.h (vmla_f32): Use RTL builtin
25803 instead of relying on GCC vector extensions.
25804 (vmla_f64): Likewise.
25805 (vmlaq_f32): Likewise.
25806 (vmlaq_f64): Likewise.
25807 (vmls_f32): Likewise.
25808 (vmls_f64): Likewise.
25809 (vmlsq_f32): Likewise.
25810 (vmlsq_f64): Likewise.
25811 * config/aarch64/iterators.md: Define VDQF_DF mode iterator.
25813 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
25815 * config/aarch64/aarch64-simd-builtins.def: Add
25816 float_ml[as]_n_builtin generator macros.
25817 * config/aarch64/aarch64-simd.md (*aarch64_mul3_elt_from_dup<mode>):
25819 (mul_n<mode>3): This, and re-order arguments.
25820 (aarch64_float_mla_n<mode>): Define.
25821 (aarch64_float_mls_n<mode>): Define.
25822 * config/aarch64/arm_neon.h (vmla_n_f32): Use RTL builtin
25823 instead of inline asm.
25824 (vmlaq_n_f32): Likewise.
25825 (vmls_n_f32): Likewise.
25826 (vmlsq_n_f32): Likewise.
25828 2021-04-30 Jonathan Wright <joanthan.wright@arm.com>
25830 * config/aarch64/aarch64-simd-builtins.def: Add pmull[2]
25831 builtin generator macros.
25832 * config/aarch64/aarch64-simd.md (aarch64_pmullv8qi): Define.
25833 (aarch64_pmull_hiv16qi_insn): Define.
25834 (aarch64_pmull_hiv16qi): Define.
25835 * config/aarch64/arm_neon.h (vmull_high_p8): Use RTL builtin
25836 instead of inline asm.
25837 (vmull_p8): Likewise.
25839 2021-04-30 Senthil Kumar Selvaraj <saaadhu@gcc.gnu.org>
25841 * config/avr/avr.md: Adjust peepholes to match and
25842 generate parallels with clobber of REG_CC.
25843 (mov<mode>_insn): Rename to mov<mode>_insn_split.
25844 (*mov<mode>_insn): Rename to mov<mode>_insn.
25846 2021-04-30 David Edelsohn <dje.gcc@gmail.com>
25848 * varasm.c (use_blocks_for_decl_p): Don't use section anchors
25849 for VAR_DECLs if -fdata-sections enabled.
25851 2021-04-30 Michael Meissner <meissner@linux.ibm.com>
25853 PR bootstrap/100327
25854 * config/rs6000/rs6000.c
25855 (TARGET_LIBGCC_FLOATING_MODE_SUPPORTED_P): Define.
25856 (rs6000_libgcc_floating_mode_supported_p): New target hook.
25858 2021-04-30 Aldy Hernandez <aldyh@redhat.com>
25860 * tree-ssa-threadbackward.c (class thread_jumps): Split out code
25862 (class back_threader_registry): ...to here...
25863 (class back_threader_profitability): ...and here...
25864 (thread_jumps::thread_through_all_blocks): Remove argument.
25865 (back_threader_registry::back_threader_registry): New.
25866 (back_threader_registry::~back_threader_registry): New.
25867 (back_threader_registry::thread_through_all_blocks): New.
25868 (thread_jumps::profitable_jump_thread_path): Move from here...
25869 (back_threader_profitability::profitable_path_p): ...to here.
25870 (thread_jumps::find_taken_edge): New.
25871 (thread_jumps::convert_and_register_current_path): Move...
25872 (back_threader_registry::register_path): ...to here.
25873 (thread_jumps::register_jump_thread_path_if_profitable): Move...
25874 (thread_jumps::maybe_register_path): ...to here.
25875 (thread_jumps::handle_phi): Call find_taken_edge and
25876 maybe_register_path.
25877 (thread_jumps::handle_assignment): Same.
25878 (thread_jumps::fsm_find_control_statement_thread_paths): Remove
25879 tree argument to handle_phi and handle_assignment.
25880 (thread_jumps::find_jump_threads_backwards): Set m_name. Remove
25881 set of m_speed_p and m_max_threaded_paths.
25882 (pass_thread_jumps::execute): Remove second argument from
25883 find_jump_threads_backwards.
25884 (pass_early_thread_jumps::execute): Same.
25886 2021-04-30 Aldy Hernandez <aldyh@redhat.com>
25888 * tree-ssa-dom.c (class dom_jump_threader_simplifier): New.
25889 (class dom_opt_dom_walker): Initialize some class variables.
25890 (pass_dominator::execute): Pass evrp_range_analyzer and
25891 dom_jump_threader_simplifier to dom_opt_dom_walker.
25892 Adjust for some functions moving into classes.
25893 (simplify_stmt_for_jump_threading): Adjust and move to...
25894 (jump_threader_simplifier::simplify): ...here.
25895 (dom_opt_dom_walker::before_dom_children): Adjust for
25896 m_evrp_range_analyzer.
25897 (dom_opt_dom_walker::after_dom_children): Remove x_vr_values hack.
25898 (test_for_singularity): Place in dom_opt_dom_walker class.
25899 (dom_opt_dom_walker::optimize_stmt): The argument
25900 evrp_range_analyzer is now a class field.
25901 * tree-ssa-threadbackward.c (class thread_jumps): Add m_registry.
25902 (thread_jumps::thread_through_all_blocks): New.
25903 (thread_jumps::convert_and_register_current_path): Use m_registry.
25904 (pass_thread_jumps::execute): Adjust for thread_through_all_blocks
25905 being in the threader class.
25906 (pass_early_thread_jumps::execute): Same.
25907 * tree-ssa-threadedge.c (threadedge_initialize_values): Move...
25908 (jump_threader::jump_threader): ...here.
25909 (threadedge_finalize_values): Move...
25910 (jump_threader::~jump_threader): ...here.
25911 (jump_threader::remove_jump_threads_including): New.
25912 (jump_threader::thread_through_all_blocks): New.
25913 (record_temporary_equivalences_from_phis): Move...
25914 (jump_threader::record_temporary_equivalences_from_phis): ...here.
25915 (record_temporary_equivalences_from_stmts_at_dest): Move...
25916 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
25918 (simplify_control_stmt_condition_1): Move to jump_threader class.
25919 (simplify_control_stmt_condition): Move...
25920 (jump_threader::simplify_control_stmt_condition): ...here.
25921 (thread_around_empty_blocks): Move...
25922 (jump_threader::thread_around_empty_blocks): ...here.
25923 (thread_through_normal_block): Move...
25924 (jump_threader::thread_through_normal_block): ...here.
25925 (thread_across_edge): Move...
25926 (jump_threader::thread_across_edge): ...here.
25927 (thread_outgoing_edges): Move...
25928 (jump_threader::thread_outgoing_edges): ...here.
25929 * tree-ssa-threadedge.h: Move externally facing functings...
25930 (class jump_threader): ...here...
25931 (class jump_threader_simplifier): ...and here.
25932 * tree-ssa-threadupdate.c (struct redirection_data): Remove comment.
25933 (jump_thread_path_allocator::jump_thread_path_allocator): New.
25934 (jump_thread_path_allocator::~jump_thread_path_allocator): New.
25935 (jump_thread_path_allocator::allocate_thread_edge): New.
25936 (jump_thread_path_allocator::allocate_thread_path): New.
25937 (jump_thread_path_registry::jump_thread_path_registry): New.
25938 (jump_thread_path_registry::~jump_thread_path_registry): New.
25939 (jump_thread_path_registry::allocate_thread_edge): New.
25940 (jump_thread_path_registry::allocate_thread_path): New.
25941 (dump_jump_thread_path): Make extern.
25942 (debug (const vec<jump_thread_edge *> &path)): New.
25943 (struct removed_edges): Move to tree-ssa-threadupdate.h.
25944 (struct thread_stats_d): Remove.
25945 (remove_ctrl_stmt_and_useless_edges): Make static.
25946 (lookup_redirection_data): Move...
25947 (jump_thread_path_registry::lookup_redirection_data): ...here.
25948 (ssa_redirect_edges): Make static.
25949 (thread_block_1): Move...
25950 (jump_thread_path_registry::thread_block_1): ...here.
25951 (thread_block): Move...
25952 (jump_thread_path_registry::thread_block): ...here.
25953 (thread_through_loop_header): Move...
25954 (jump_thread_path_registry::thread_through_loop_header): ...here.
25955 (mark_threaded_blocks): Move...
25956 (jump_thread_path_registry::mark_threaded_blocks): ...here.
25957 (debug_path): Move...
25958 (jump_thread_path_registry::debug_path): ...here.
25959 (debug_all_paths): Move...
25960 (jump_thread_path_registry::dump): ..here.
25961 (rewire_first_differing_edge): Move...
25962 (jump_thread_path_registry::rewire_first_differing_edge): ...here.
25963 (adjust_paths_after_duplication): Move...
25964 (jump_thread_path_registry::adjust_paths_after_duplication): ...here.
25965 (duplicate_thread_path): Move...
25966 (jump_thread_path_registry::duplicate_thread_path): ..here.
25967 (remove_jump_threads_including): Move...
25968 (jump_thread_path_registry::remove_jump_threads_including): ...here.
25969 (thread_through_all_blocks): Move to...
25970 (jump_thread_path_registry::thread_through_all_blocks): ...here.
25971 (delete_jump_thread_path): Remove.
25972 (register_jump_thread): Move...
25973 (jump_thread_path_registry::register_jump_thread): ...here.
25974 * tree-ssa-threadupdate.h: Move externally facing functions...
25975 (class jump_thread_path_allocator): ...here...
25976 (class jump_thread_path_registry): ...and here.
25977 (thread_through_all_blocks): Remove.
25978 (struct removed_edges): New.
25979 (register_jump_thread): Remove.
25980 (remove_jump_threads_including): Remove.
25981 (delete_jump_thread_path): Remove.
25982 (remove_ctrl_stmt_and_useless_edges): Remove.
25983 (free_dom_edge_info): New prototype.
25984 * tree-vrp.c: Remove x_vr_values hack.
25985 (class vrp_jump_threader_simplifier): New.
25986 (vrp_jump_threader_simplifier::simplify): New.
25987 (vrp_jump_threader::vrp_jump_threader): Adjust method signature.
25988 Remove m_dummy_cond.
25989 Instantiate m_simplifier and m_threader.
25990 (vrp_jump_threader::thread_through_all_blocks): New.
25991 (vrp_jump_threader::simplify_stmt): Remove.
25992 (vrp_jump_threader::after_dom_children): Do not set m_dummy_cond.
25993 Remove x_vr_values hack.
25994 (execute_vrp): Adjust for thread_through_all_blocks being in a
25997 2021-04-30 Christophe Lyon <christophe.lyon@linaro.org>
25999 * genflags.c (gen_insn): Print failed expansion string.
26001 2021-04-30 H.J. Lu <hjl.tools@gmail.com>
26003 * expr.c (alignment_for_piecewise_move): Call mode_for_size
26004 without limit to MAX_FIXED_MODE_SIZE.
26006 2021-04-30 H.J. Lu <hjl.tools@gmail.com>
26008 PR middle-end/90773
26009 * builtins.c (builtin_memset_gen_str): Don't use return from
26010 simplify_gen_subreg.
26012 2021-04-30 Uroš Bizjak <ubizjak@gmail.com>
26015 * config/i386/i386.md (*add<mode>3_carry_0r): New insn pattern.
26016 (*addsi3_carry_zext_0r): Ditto.
26017 (*sub<mode>3_carry_0): Ditto.
26018 (*subsi3_carry_zext_0r): Ditto.
26019 * config/i386/predicates.md (ix86_carry_flag_unset_operator):
26021 * config/i386/i386.c (ix86_rtx_costs) <case PLUS, case MINUS>:
26022 Also consider ix86_carry_flag_unset_operator to calculate
26023 the cost of adc/sbb insn.
26025 2021-04-30 Roman Zhuykov <zhroma@ispras.ru>
26027 PR rtl-optimization/100225
26028 PR rtl-optimization/84878
26029 * modulo-sched.c (sms_schedule): Use note_stores to skip loops
26030 where we have an instruction which touches (writes) any hard
26031 register from df->regular_block_artificial_uses set.
26032 Allow not-single-set instruction only right before basic block
26035 2021-04-30 Geng Qi <gengqi@linux.alibaba.com>
26037 * config/riscv/riscv.opt (march=,mabi=): Negative itself.
26039 2021-04-30 LevyHsu <admin@levyhsu.com>
26041 * config/riscv/riscv.c (riscv_min_arithmetic_precision): New.
26042 * config/riscv/riscv.h (TARGET_MIN_ARITHMETIC_PRECISION): New.
26043 * config/riscv/riscv.md (addv<mode>4, uaddv<mode>4): New.
26044 (subv<mode>4, usubv<mode>4, mulv<mode>4, umulv<mode>4): New.
26046 2021-04-29 Alexandre Oliva <oliva@adacore.com>
26048 * config.gcc: Merged x86 and x86_64 cpu_type-setting cases.
26050 2021-04-29 Alexandre Oliva <oliva@adacore.com>
26052 * config/i386/i386.h (ASM_OUTPUT_MAX_SKIP_PAD): Rename to...
26053 (ASM_OUTPUT_MAX_SKIP_ALIGN): ... this. Enclose in do/while(0).
26054 * config/i386/i386.c: Adjust.
26055 * config/i386/i386.md: Adjust.
26056 * config/i386/darwin.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Drop.
26057 * config/i386/dragonfly.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26058 * config/i386/freebsd.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26059 * config/i386/gas.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26060 * config/i386/gnu-user.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26061 * config/i386/iamcu.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26062 * config/i386/lynx.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26063 * config/i386/netbsd-elf.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26064 * config/i386/openbsdelf.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26065 * config/i386/x86-64.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
26066 (ASM_OUTPUT_MAX_SKIP_PAD): Likewise.
26068 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
26070 * config/i386/i386-expand.c (ix86_expand_int_compare):
26071 Swap operands of GTU and LEU comparison to emit carry flag comparison.
26072 * config/i386/i386.md (*add<mode>3_carry_0): Change insn
26073 predicate to allow more combine opportunities with memory operands.
26074 (*sub<mode>3_carry_0): Ditto.
26076 2021-04-29 Richard Sandiford <richard.sandiford@arm.com>
26078 PR rtl-optimization/100303
26079 * rtl-ssa/accesses.cc (function_info::make_use_available): Take a
26080 boolean that indicates whether the use will only be used in
26081 debug instructions. Treat it in the same way that existing
26082 cross-EBB debug references would be handled if so.
26083 (function_info::make_uses_available): Likewise.
26084 * rtl-ssa/functions.h (function_info::make_uses_available): Update
26085 prototype accordingly.
26086 (function_info::make_uses_available): Likewise.
26087 * fwprop.c (try_fwprop_subst): Update call accordingly.
26089 2021-04-29 Jeff Law <jlaw@tachyum.com>
26091 * config/nios2/nios2-protos.h (nios2_fpu_insn_enabled): Move outside
26094 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
26095 Richard Biener <rguenther@suse.de>
26098 * config/i386/i386-builtin.def (IX86_BUILTIN_MASKLOADPD)
26099 (IX86_BUILTIN_MASKLOADPS, IX86_BUILTIN_MASKLOADPD256)
26100 (IX86_BUILTIN_MASKLOADPS256, IX86_BUILTIN_MASKLOADD)
26101 (IX86_BUILTIN_MASKLOADQ, IX86_BUILTIN_MASKLOADD256)
26102 (IX86_BUILTIN_MASKLOADQ256): Move from SPECIAL_ARGS
26103 to PURE_ARGS category.
26104 * config/i386/i386-builtins.c (ix86_init_mmx_sse_builtins):
26105 Handle PURE_ARGS category.
26106 * config/i386/i386-expand.c (ix86_expand_builtin): Ditto.
26108 2021-04-29 Eric Botcazou <ebotcazou@adacore.com>
26110 * configure.ac: Check for the presence of sys/locking.h header and
26111 for whether _LK_LOCK is supported by _locking.
26112 * configure: Regenerate.
26113 * config.in: Likewise.
26114 * gcov-io.h: Define GCOV_LOCKED_WITH_LOCKING if HOST_HAS_LK_LOCK.
26115 * gcov-io.c (gcov_open): Add support for GCOV_LOCKED_WITH_LOCKING.
26116 * system.h: Include <sys/locking.h> if HAVE_SYS_LOCKING_H.
26118 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
26120 * config/i386/predicates.md (fcmov_comparison_operator):
26121 Do not check for trivial FP comparison operator.
26122 <case GEU, case LTU>: Allow CCGZmode.
26123 <case GTU, case LEU>: Do not allow CCCmode.
26124 (ix86_comparison_operator) <case GTU, case LEU>: Allow only CCmode.
26125 (ix86_carry_flag_operator): Match only LTU and UNLT code.
26126 Do not check for trivial FP comparison operator. Allow CCGZmode.
26128 2021-04-29 Tom de Vries <tdevries@suse.de>
26130 * omp-expand.c (expand_omp_simd): Add step_orig, and replace uses of
26131 fd->loop.step by either step or orig_step.
26133 2021-04-29 Eric Botcazou <ebotcazou@adacore.com>
26135 * config/sparc/sparc.c (gen_load_pcrel_sym): Delete.
26136 (load_got_register): Do the PIC dance here.
26137 (sparc_legitimize_tls_address): Simplify.
26138 (sparc_emit_probe_stack_range): Likewise.
26139 (sparc32_initialize_trampoline): Likewise.
26140 (sparc64_initialize_trampoline): Likewise.
26141 * config/sparc/sparc.md (load_pcrel_sym<P:mode>): Add @ marker.
26142 (probe_stack_range<P:mode>): Likewise.
26143 (flush<P:mode>): Likewise.
26144 (tgd_hi22<P:mode>): Likewise.
26145 (tgd_lo10<P:mode>): Likewise.
26146 (tgd_add<P:mode>): Likewise.
26147 (tgd_call<P:mode>): Likewise.
26148 (tldm_hi22<P:mode>): Likewise.
26149 (tldm_lo10<P:mode>): Likewise.
26150 (tldm_add<P:mode>): Likewise.
26151 (tldm_call<P:mode>): Likewise.
26152 (tldo_hix22<P:mode>): Likewise.
26153 (tldo_lox10<P:mode>): Likewise.
26154 (tldo_add<P:mode>): Likewise.
26155 (tie_hi22<P:mode>): Likewise.
26156 (tie_lo10<P:mode>): Likewise.
26157 (tie_add<P:mode>): Likewise.
26158 (tle_hix22<P:mode>): Likewise.
26159 (tle_lox10<P:mode>): Likewise.
26160 (stack_protect_setsi): Rename to...
26161 (stack_protect_set32): ...this.
26162 (stack_protect_setdi): Rename to...
26163 (stack_protect_set64): ...this.
26164 (stack_protect_set): Adjust calls to above.
26165 (stack_protect_testsi): Rename to...
26166 (stack_protect_test32): ...this.
26167 (stack_protect_testdi): Rename to...
26168 (stack_protect_test64): ...this.
26169 (stack_protect_test): Adjust calls to above.
26171 2021-04-29 H.J. Lu <hjl.tools@gmail.com>
26173 PR middle-end/90773
26174 * builtins.c (builtin_memcpy_read_str): Add a dummy argument.
26175 (builtin_strncpy_read_str): Likewise.
26176 (builtin_memset_read_str): Add an argument for the previous RTL
26177 information and generate the new RTL from the previous RTL info.
26178 (builtin_memset_gen_str): Likewise.
26179 * builtins.h (builtin_strncpy_read_str): Update the prototype.
26180 (builtin_memset_read_str): Likewise.
26181 * expr.c (by_pieces_ninsns): If targetm.overlap_op_by_pieces_p()
26182 returns true, round up size and alignment to the widest integer
26183 mode for maximum size.
26184 (pieces_addr::adjust): Add a pointer to by_pieces_prev argument
26185 and pass it to m_constfn.
26186 (op_by_pieces_d): Add m_push and m_overlap_op_by_pieces.
26187 (op_by_pieces_d::op_by_pieces_d): Add a bool argument to
26188 initialize m_push. Initialize m_overlap_op_by_pieces with
26189 targetm.overlap_op_by_pieces_p ().
26190 (op_by_pieces_d::run): Pass the previous RTL information to
26191 pieces_addr::adjust and generate overlapping operations if
26192 m_overlap_op_by_pieces is true.
26194 (move_by_pieces_d::move_by_pieces_d): Updated for op_by_pieces_d
26196 (store_by_pieces_d::store_by_pieces_d): Updated for op_by_pieces_d
26198 (can_store_by_pieces): Use by_pieces_constfn on constfun.
26199 (store_by_pieces): Use by_pieces_constfn on constfun. Updated
26200 for op_by_pieces_d change.
26201 (clear_by_pieces_1): Add a dummy argument.
26202 (clear_by_pieces): Updated for op_by_pieces_d change.
26203 (compare_by_pieces_d::compare_by_pieces_d): Likewise.
26204 (string_cst_read_str): Add a dummy argument.
26205 * expr.h (by_pieces_constfn): Add a dummy argument.
26206 (by_pieces_prev): New.
26207 * target.def (overlap_op_by_pieces_p): New target hook.
26208 * config/i386/i386.c (TARGET_OVERLAP_OP_BY_PIECES_P): New.
26209 * doc/tm.texi.in: Add TARGET_OVERLAP_OP_BY_PIECES_P.
26210 * doc/tm.texi: Regenerated.
26212 2021-04-29 Richard Biener <rguenther@suse.de>
26214 PR tree-optimization/100253
26215 * tree-vect-stmts.c (vectorizable_load): Do not assume
26216 element alignment when DR_MISALIGNMENT is -1.
26217 (vectorizable_store): Likewise.
26219 2021-04-29 Jakub Jelinek <jakub@redhat.com>
26222 * config/aarch64/aarch64.c (aarch64_add_offset_1_temporaries): Use
26223 absu_hwi instead of abs_hwi.
26225 2021-04-29 Richard Biener <rguenther@suse.de>
26227 PR middle-end/38474
26228 * tree-ssa-structalias.c (add_graph_edge): Avoid direct
26229 forwarding when indirect forwarding through ESCAPED
26232 2021-04-29 Tom de Vries <tdevries@suse.de>
26235 * internal-fn.c (expand_GOMP_SIMT_ENTER_ALLOC)
26236 (expand_GOMP_SIMT_LAST_LANE, expand_GOMP_SIMT_ORDERED_PRED)
26237 (expand_GOMP_SIMT_VOTE_ANY, expand_GOMP_SIMT_XCHG_BFLY)
26238 (expand_GOMP_SIMT_XCHG_IDX): Ensure target is assigned to.
26240 2021-04-29 Richard Biener <rguenther@suse.de>
26242 PR tree-optimization/99912
26243 * tree-ssa-dse.c (dse_dom_walker::m_need_cfg_cleanup): New.
26244 (dse_dom_walker::todo): Likewise.
26245 (dse_dom_walker::dse_optimize_stmt): Move VDEF check to the
26247 (dse_dom_walker::before_dom_children): Remove trivially
26248 dead SSA defs and schedule CFG cleanup if we removed all
26250 (pass_dse::execute): Get TODO as computed by the DOM walker
26251 and return it. Wipe dominator info earlier.
26253 2021-04-29 Richard Biener <rguenther@suse.de>
26256 * ipa-prop.c (ipcp_modif_dom_walker::before_dom_children):
26257 Track blocks to cleanup EH in new m_need_eh_cleanup.
26258 (ipcp_modif_dom_walker::cleanup_eh): New.
26259 (ipcp_transform_function): Release dominator info before
26262 2021-04-29 Martin Sebor <msebor@redhat.com>
26264 PR middle-end/100250
26265 * attribs.c (attr_access::array_as_string): Avoid dereferencing
26266 a pointer when it's null.
26268 2021-04-29 Martin Sebor <msebor@redhat.com>
26270 * Makefile.in (OBJS): Add ipa-free-lang-data.o.
26271 * ipa-free-lang-data.cc: New file.
26272 * tree.c: Move pass free_lang_data to file above.
26273 (build_array_type_1): Declare extern.
26274 * tree.h (build_array_type_1): Declare.
26276 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26278 * config/aarch64/aarch64-simd-builtins.def: Modify comment to
26279 make consistent with updated RTL pattern.
26280 * config/aarch64/aarch64-simd.md (aarch64_<sur>qmovn<mode>):
26281 Implement using ss_truncate and us_truncate rather than
26283 * config/aarch64/iterators.md: Remove redundant unspecs and
26284 iterator: UNSPEC_[SU]QXTN and SUQMOVN respectively.
26286 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26288 * config/aarch64/arm_acle.h (__attribute__): Make intrinsic
26289 attributes consistent with those defined in arm_neon.h.
26291 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26293 * config/aarch64/arm_fp16.h (__attribute__): Make intrinsic
26294 attributes consistent with those defined in arm_neon.h.
26296 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26298 * config/aarch64/aarch64-simd-builtins.def: Add
26299 float_trunc_rodd builtin generator macros.
26300 * config/aarch64/aarch64-simd.md (aarch64_float_trunc_rodd_df):
26302 (aarch64_float_trunc_rodd_lo_v2sf): Define.
26303 (aarch64_float_trunc_rodd_hi_v4sf_le): Define.
26304 (aarch64_float_trunc_rodd_hi_v4sf_be): Define.
26305 (aarch64_float_trunc_rodd_hi_v4sf): Define.
26306 * config/aarch64/arm_neon.h (vcvtx_f32_f64): Use RTL builtin
26307 instead of inline asm.
26308 (vcvtx_high_f32_f64): Likewise.
26309 (vcvtxd_f32_f64): Likewise.
26310 * config/aarch64/iterators.md: Add FCVTXN unspec.
26312 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26314 * config/aarch64/aarch64-simd-builtins.def: Add tbx1 builtin
26316 * config/aarch64/aarch64-simd.md (aarch64_tbx1<mode>):
26318 * config/aarch64/arm_neon.h (vqtbx1_s8): USE RTL builtin
26319 instead of inline asm.
26320 (vqtbx1_u8): Likewise.
26321 (vqtbx1_p8): Likewise.
26322 (vqtbx1q_s8): Likewise.
26323 (vqtbx1q_u8): Likewise.
26324 (vqtbx1q_p8): Likewise.
26325 (vtbx2_s8): Likewise.
26326 (vtbx2_u8): Likewise.
26327 (vtbx2_p8): Likewise.
26329 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26331 * config/aarch64/aarch64-simd-builtins.def: Add tbl1 builtin
26333 * config/aarch64/arm_neon.h (vqtbl1_p8): Use RTL builtin
26334 instead of inline asm.
26335 (vqtbl1_s8): Likewise.
26336 (vqtbl1_u8): Likewise.
26337 (vqtbl1q_p8): Likewise.
26338 (vqtbl1q_s8): Likewise.
26339 (vqtbl1q_u8): Likewise.
26340 (vtbl1_s8): Likewise.
26341 (vtbl1_u8): Likewise.
26342 (vtbl1_p8): Likewise.
26343 (vtbl2_s8): Likewise.
26344 (vtbl2_u8): Likewise.
26345 (vtbl2_p8): Likewise.
26347 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26349 * config/aarch64/aarch64-simd-builtins.def: Add polynomial
26350 ssri_n buitin generator macro.
26351 * config/aarch64/arm_neon.h (vsri_n_p8): Use RTL builtin
26352 instead of inline asm.
26353 (vsri_n_p16): Likewise.
26354 (vsri_n_p64): Likewise.
26355 (vsriq_n_p8): Likewise.
26356 (vsriq_n_p16): Likewise.
26357 (vsriq_n_p64): Likewise.
26359 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26361 * config/aarch64/aarch64-simd-builtins.def: Use VALLP mode
26362 iterator for polynomial ssli_n builtin generator macro.
26363 * config/aarch64/arm_neon.h (vsli_n_p8): Use RTL builtin
26364 instead of inline asm.
26365 (vsli_n_p16): Likewise.
26366 (vsliq_n_p8): Likewise.
26367 (vsliq_n_p16): Likewise.
26368 * config/aarch64/iterators.md: Define VALLP mode iterator.
26370 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26372 * config/aarch64/aarch64-simd-builtins.def: Use VDQV_L
26373 iterator to generate [su]adalp RTL builtins.
26374 * config/aarch64/aarch64-simd.md: Use VDQV_L iterator in
26375 [su]adalp RTL pattern.
26376 * config/aarch64/arm_neon.h (vpadal_s32): Use RTL builtin
26377 instead of inline asm.
26378 (vpadal_u32): Likewise.
26380 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26382 * config/aarch64/aarch64-simd-builtins.def: Add [su]addlp
26383 builtin generator macros.
26384 * config/aarch64/aarch64-simd.md (aarch64_<su>addlp<mode>):
26386 * config/aarch64/arm_neon.h (vpaddl_s8): Use RTL builtin
26387 instead of inline asm.
26388 (vpaddl_s16): Likewise.
26389 (vpaddl_s32): Likewise.
26390 (vpaddl_u8): Likewise.
26391 (vpaddl_u16): Likewise.
26392 (vpaddl_u32): Likewise.
26393 (vpaddlq_s8): Likewise.
26394 (vpaddlq_s16): Likewise.
26395 (vpaddlq_s32): Likewise.
26396 (vpaddlq_u8): Likewise.
26397 (vpaddlq_u16): Likewise.
26398 (vpaddlq_u32): Liwewise.
26399 * config/aarch64/iterators.md: Define [SU]ADDLP unspecs with
26400 appropriate attributes.
26402 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26404 * config/aarch64/aarch64-simd-builtins.def: Use VDQ_I iterator
26405 for aarch64_addp<mode> builtin macro generator.
26406 * config/aarch64/aarch64-simd.md: Use VDQ_I iterator in
26407 aarch64_addp<mode> RTL pattern.
26408 * config/aarch64/arm_neon.h (vpaddq_s8): Use RTL builtin
26409 instead of inline asm.
26410 (vpaddq_s16): Likewise.
26411 (vpaddq_s32): Likewise.
26412 (vpaddq_s64): Likewise.
26413 (vpaddq_u8): Likewise.
26414 (vpaddq_u16): Likewise.
26415 (vpaddq_u32): Likewise.
26416 (vpaddq_u64): Likewise.
26418 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26420 * config/aarch64/aarch64-simd-builtins.def: Add sq[r]dmulh_n
26421 builtin generator macros.
26422 * config/aarch64/aarch64-simd.md (aarch64_sq<r>dmulh_n<mode>):
26424 * config/aarch64/arm_neon.h (vqdmulh_n_s16): Use RTL builtin
26425 instead of inline asm.
26426 (vqdmulh_n_s32): Likewise.
26427 (vqdmulhq_n_s16): Likewise.
26428 (vqdmulhq_n_s32): Likewise.
26429 (vqrdmulh_n_s16): Likewise.
26430 (vqrdmulh_n_s32): Likewise.
26431 (vqrdmulhq_n_s16): Likewise.
26432 (vqrdmulhq_n_s32): Likewise.
26434 2021-04-28 Tobias Burnus <tobias@codesourcery.com>
26436 * doc/install.texi (--enable-offload-defaulted): Document.
26438 2021-04-28 Senthil Kumar Selvaraj <saaadhu@gcc.gnu.org>
26440 * config/avr/avr-dimode.md: Turn existing patterns into
26441 define_insn_and_split style patterns where the splitter
26442 adds a clobber of the condition code register. Drop "cc"
26443 attribute. Add new patterns to match output of
26445 * config/avr/avr-fixed.md: Likewise.
26446 * config/avr/avr.c (cc_reg_rtx): New.
26447 (avr_parallel_insn_from_insns): Adjust insn count
26448 for removal of set of cc0.
26449 (avr_is_casesi_sequence): Likewise.
26450 (avr_casei_sequence_check_operands): Likewise.
26451 (avr_optimize_casesi): Likewise. Also insert
26452 new insns after jump_insn.
26453 (avr_pass_casesi::avr_rest_of_handle_casesi): Adjust
26454 for removal of set of cc0.
26455 (avr_init_expanders): Initialize cc_reg_rtx.
26456 (avr_regno_reg_class): Handle REG_CC.
26457 (cond_string): Remove usage of CC_OVERFLOW_UNUSABLE.
26458 (avr_notice_update_cc): Remove function.
26459 (ret_cond_branch): Remove usage of CC_OVERFLOW_UNUSABLE.
26460 (compare_condition): Adjust for PARALLEL with
26462 (out_shift_with_cnt): Likewise.
26463 (ashlhi3_out): Likewise.
26464 (ashrhi3_out): Likewise.
26465 (lshrhi3_out): Likewise.
26466 (avr_class_max_nregs): Return single reg for REG_CC.
26467 (avr_compare_pattern): Check for REG_CC instead
26469 (avr_reorg_remove_redundant_compare): Likewise.
26470 (avr_reorg):Adjust for PARALLEL with REG_CC clobber.
26471 (avr_hard_regno_nregs): Return single reg for REG_CC.
26472 (avr_hard_regno_mode_ok): Allow only CCmode for REG_CC.
26473 (avr_md_asm_adjust): Clobber REG_CC.
26474 (TARGET_HARD_REGNO_NREGS): Define.
26475 (TARGET_CLASS_MAX_NREGS): Define.
26476 (TARGET_MD_ASM_ADJUST): Define.
26477 * config/avr/avr.h (FIRST_PSEUDO_REGISTER): Adjust
26479 (enum reg_class): Add CC_REG class.
26480 (NOTICE_UPDATE_CC): Remove.
26481 (CC_OVERFLOW_UNUSABLE): Remove.
26482 (CC_NO_CARRY): Remove.
26483 * config/avr/avr.md: Turn existing patterns into
26484 define_insn_and_split style patterns where the splitter
26485 adds a clobber of the condition code register. Drop "cc"
26486 attribute. Add new patterns to match output of
26488 (sez): Remove unused pattern.
26490 2021-04-28 Richard Earnshaw <rearnsha@arm.com>
26493 * config/arm/arm.c (arm_hard_regno_mode_ok): Only allow VPR to be
26496 2021-04-28 Richard Sandiford <richard.sandiford@arm.com>
26499 * config/aarch64/constraints.md (Utq): Require the address to
26500 be valid for both the element mode and for V2DImode.
26502 2021-04-28 Jakub Jelinek <jakub@redhat.com>
26503 Tobias Burnus <tobias@codesourcery.com>
26505 * configure.ac (OFFLOAD_DEFAULTED): AC_DEFINE if offload-defaulted.
26506 * gcc.c (process_command): New variable.
26507 (driver::maybe_putenv_OFFLOAD_TARGETS): If OFFLOAD_DEFAULTED,
26508 set it if -foffload is defaulted.
26509 * lto-wrapper.c (OFFLOAD_TARGET_DEFAULT_ENV): Define.
26510 (compile_offload_image): If OFFLOAD_DEFAULTED and
26511 OFFLOAD_TARGET_DEFAULT is in the environment, don't fail
26512 if corresponding mkoffload can't be found.
26513 (compile_images_for_offload_targets): Likewise. Free and clear
26514 offload_names if no valid offload is found.
26515 * config.in: Regenerate.
26516 * configure: Regenerate.
26518 2021-04-28 Richard Biener <rguenther@suse.de>
26520 PR tree-optimization/100292
26521 * tree-vect-generic.c (expand_vector_condition): Do not fold
26524 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
26526 * config/rs6000/aix.h (SUBTARGET_DRIVER_SELF_SPECS): New.
26527 * config/rs6000/aix64.opt (m64): New.
26530 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
26532 * config/vax/vax.c (print_operand_address, vax_address_cost_1)
26533 (index_term_p): Handle ASHIFT too.
26535 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
26537 * config/vax/builtins.md (jbb<ccss>i<mode>): Remove operand #3.
26538 (sync_lock_test_and_set<mode>): Adjust accordingly.
26539 (sync_lock_release<mode>): Likewise.
26541 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
26543 * config/vax/vax-protos.h (adjacent_operands_p): Remove
26545 * config/vax/vax.c (adjacent_operands_p): Remove.
26547 2021-04-27 Maciej W. Rozycki <macro@linux-mips.org>
26549 * ifcvt.c (dead_or_predicable) [!IFCVT_MODIFY_TESTS]: Fall
26550 through to the non-conditional execution case if getting the
26551 condition for conditional execution has failed.
26553 2021-04-27 Richard Sandiford <richard.sandiford@arm.com>
26555 PR middle-end/100284
26556 * gimple.c (gimple_could_trap_p_1): Remove VEC_COND_EXPR test.
26557 * tree-eh.c (operation_could_trap_p): Handle VEC_COND_EXPR rather
26558 than asserting on it.
26560 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
26562 * config/rs6000/rs6000.c (rs6000_aix_precompute_tls_p): Protect
26563 with TARGET_AIX_OS.
26565 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
26568 * calls.c (precompute_register_parameters): Additionally test
26569 targetm.precompute_tls_p to pre-compute argument.
26570 * config/rs6000/aix.h (TARGET_PRECOMPUTE_TLS_P): Define.
26571 * config/rs6000/rs6000.c (rs6000_aix_precompute_tls_p): New.
26572 * target.def (precompute_tls_p): New.
26573 * doc/tm.texi.in (TARGET_PRECOMPUTE_TLS_P): Add hook documentation.
26574 * doc/tm.texi: Regenerated.
26576 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26579 * config/aarch64/aarch64.c (aarch64_print_operand): Cast -UINTVAL
26580 back to HOST_WIDE_INT.
26582 2021-04-27 Bernd Edlinger <bernd.edlinger@hotmail.de>
26585 * simplify-rtx.c (simplify_context::simplify_subreg): Check the
26586 memory alignment for the outer mode.
26588 2021-04-27 H.J. Lu <hjl.tools@gmail.com>
26590 PR middle-end/90773
26591 * expr.c (op_by_pieces_d::get_usable_mode): New member function.
26592 (op_by_pieces_d::run): Cange a while loop to a do-while loop.
26594 2021-04-27 Alex Coplan <alex.coplan@arm.com>
26597 * config/arm/arm.c (arm_split_compare_and_swap): Fix up codegen
26598 with negative immediates: ensure we expand cbranchsi4_scratch
26599 correctly and ensure we satisfy its constraints.
26600 * config/arm/sync.md
26601 (@atomic_compare_and_swap<CCSI:arch><NARROW:mode>_1): Don't
26602 attempt to tie two output operands together with constraints;
26603 collapse two alternatives.
26604 (@atomic_compare_and_swap<CCSI:arch><SIDI:mode>_1): Likewise.
26605 * config/arm/thumb1.md (cbranchsi4_neg_late): New.
26607 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26610 * config/aarch64/predicates.md (aarch64_sub_immediate,
26611 aarch64_plus_immediate): Use -UINTVAL instead of -INTVAL.
26612 * config/aarch64/aarch64.md (casesi, rotl<mode>3): Likewise.
26613 * config/aarch64/aarch64.c (aarch64_print_operand,
26614 aarch64_split_atomic_op, aarch64_expand_subvti): Likewise.
26616 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26618 PR tree-optimization/100239
26619 * tree-vect-generic.c (lower_vec_perm): Don't accept constant
26620 permutations with all indices from the first zero element as vec_shl.
26622 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26624 PR rtl-optimization/100254
26625 * cfgcleanup.c (outgoing_edges_match): Check REG_EH_REGION on
26626 last1 and last2 insns rather than BB_END (bb1) and BB_END (bb2) insns.
26628 2021-04-27 Richard Biener <rguenther@suse.de>
26630 PR tree-optimization/99912
26631 * passes.def: Add comment about new TODO_remove_unused_locals.
26632 * tree-stdarg.c (pass_data_stdarg): Run TODO_remove_unused_locals
26635 2021-04-27 Richard Biener <rguenther@suse.de>
26637 PR tree-optimization/99912
26638 * passes.def (pass_all_optimizations): Add pass_dse before
26639 the first pass_dce, move the first pass_dse before the
26640 pass_dce following pass_pre.
26642 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26644 PR tree-optimization/95527
26645 * generic-match-head.c: Include tm.h.
26646 * gimple-match-head.c: Include tm.h.
26647 * match.pd (CLZ == INTEGER_CST): Don't use
26648 #ifdef CLZ_DEFINED_VALUE_AT_ZERO, only test CLZ_DEFINED_VALUE_AT_ZERO
26649 if clz == CFN_CLZ. Add missing val declaration.
26650 (CTZ cmp CST): New simplifications.
26652 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26654 PR tree-optimization/96696
26655 * expr.c (expand_expr_divmod): New function.
26656 (expand_expr_real_2) <case TRUNC_DIV_EXPR>: Use it for truncations and
26657 divisions. Formatting fixes.
26658 <case MULT_EXPR>: Optimize x / y * y as x - x % y if the latter is
26661 2021-04-27 Martin Jambor <mjambor@suse.cz>
26664 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call):
26665 If removing a call statement LHS SSA name, release it.
26667 2021-04-27 Richard Earnshaw <rearnsha@arm.com>
26670 * config/arm/arm.c (THUMB2_WORK_REGS): Check PIC_OFFSET_TABLE_REGNUM
26671 is valid before including it in the mask.
26673 2021-04-27 Richard Sandiford <richard.sandiford@arm.com>
26676 * config/aarch64/aarch64.c (aarch64_comp_type_attributes): Handle
26679 2021-04-27 Richard Biener <rguenther@suse.de>
26681 PR tree-optimization/100051
26682 * tree-ssa-alias.c (indirect_ref_may_alias_decl_p): Add
26683 disambiguator based on access size vs. decl size.
26685 2021-04-27 Richard Biener <rguenther@suse.de>
26687 PR tree-optimization/100278
26688 * tree-ssa-pre.c (compute_avail): Give up when we cannot
26689 adjust TBAA beacuse of mismatching bases.
26691 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26694 * config/i386/i386.md (*<insn><mode>3_mask, *<insn><mode>3_mask_1):
26695 For any_rotate define_insn_split and following splitters, use
26696 SWI iterator instead of SWI48.
26698 2021-04-27 Richard Biener <rguenther@suse.de>
26700 PR tree-optimization/99776
26701 * match.pd (bit_field_ref (ctor)): Relax element extract
26702 type compatibility checks.
26704 2021-04-27 Cui,Lili <lili.cui@intel.com>
26706 * common/config/i386/i386-common.c (processor_names):
26707 Sync processor_names with processor_type.
26708 * config/i386/i386-options.c (processor_cost_table):
26709 Sync processor_cost_table with processor_type.
26711 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
26713 * value-range.cc (irange::irange_set_1bit_anti_range): Add assert.
26714 (irange::set): Call irange_set_1bit_anti_range for handling all
26715 1-bit ranges. Fall through on ~[MIN,MAX].
26717 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
26719 * value-range.cc (irange::legacy_num_pairs): Remove.
26720 (irange::invert): Change gcc_assert to gcc_checking_assert.
26721 * value-range.h (irange::num_pairs): Adjust for a cached
26722 num_pairs(). Also, rename all gcc_assert's to
26723 gcc_checking_assert's.
26725 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
26727 * value-range.cc (irange::operator=): Set m_kind.
26728 (irange::copy_to_legacy): Handle varying and undefined sources
26729 as a legacy copy since they can be easily copied.
26730 (irange::irange_set): Set m_kind.
26731 (irange::irange_set_anti_range): Same.
26732 (irange::set): Rename normalize_min_max to normalize_kind.
26733 (irange::verify_range): Adjust for multi-ranges having the
26735 (irange::irange_union): Set m_kind.
26736 (irange::irange_intersect): Same.
26737 (irange::invert): Same.
26738 * value-range.h (irange::kind): Always return m_kind.
26739 (irange::varying_p): Rename to...
26740 (irange::varying_comptaible_p): ...this.
26741 (irange::undefined_p): Only look at m_kind.
26742 (irange::irange): Always set VR_UNDEFINED if applicable.
26743 (irange::set_undefined): Always set VR_UNDEFINED.
26744 (irange::set_varying): Always set m_kind to VR_VARYING.
26745 (irange::normalize_min_max): Rename to...
26746 (irange::normalize_kind): ...this.
26748 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
26750 * gimple-ssa-evrp-analyze.c (evrp_range_analyzer::set_ssa_range_info):
26751 Adjust for constant_p including varying_p.
26752 * tree-vrp.c (vrp_prop::finalize): Same.
26753 (determine_value_range): Same.
26754 * vr-values.c (vr_values::range_of_expr): Same.
26755 * value-range.cc (irange::symbolic_p): Do not check varying_p.
26756 (irange::constant_p): Same.
26758 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
26760 * value-range.cc (irange::legacy_lower_bound): Replace
26761 !undefined_p check with num_ranges > 0.
26762 (irange::legacy_upper_bound): Same.
26763 * value-range.h (irange::type): Same.
26764 (irange::lower_bound): Same.
26765 (irange::upper_bound): Same.
26767 2021-04-26 Richard Biener <rguenther@suse.de>
26769 PR tree-optimization/99956
26770 * gimple-loop-interchange.cc (compute_access_stride):
26771 Try instantiating the access in a shallower loop nest
26772 if instantiating failed.
26773 (compute_access_strides): Pass adjustable loop_nest
26774 to compute_access_stride.
26776 2021-04-26 Christophe Lyon <christophe.lyon@linaro.org>
26778 * doc/sourcebuild.texi (arm_cmse_hw): Document.
26780 2021-04-26 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
26782 * config/aarch64/iterators.md (vwcore): Handle V4BF, V8BF.
26784 2021-04-26 Thomas Schwinge <thomas@codesourcery.com>
26785 Nathan Sidwell <nathan@codesourcery.com>
26786 Tom de Vries <vries@codesourcery.com>
26787 Julian Brown <julian@codesourcery.com>
26788 Kwok Cheung Yeung <kcy@codesourcery.com>
26790 * omp-offload.c (oacc_validate_dims): Implement
26791 '-Wopenacc-parallelism'.
26792 * doc/invoke.texi (-Wopenacc-parallelism): Document.
26794 2021-04-26 Richard Biener <rguenther@suse.de>
26796 * tree-cfg.h (gimplify_build1): Remove.
26797 (gimplify_build2): Likewise.
26798 (gimplify_build3): Likewise.
26799 * tree-cfg.c (gimplify_build1): Move to tree-vect-generic.c.
26800 (gimplify_build2): Likewise.
26801 (gimplify_build3): Likewise.
26802 * tree-vect-generic.c (gimplify_build1): Move from tree-cfg.c.
26804 (gimplify_build2): Likewise.
26805 (gimplify_build3): Likewise.
26806 (tree_vec_extract): Use resimplify with following SSA edges.
26807 (expand_vector_parallel): Avoid passing NULL size/bitpos
26808 to tree_vec_extract.
26809 * expr.c (store_constructor): Deal with zero-element CTORs.
26810 * match.pd (bit_field_ref <vector CTOR>): Make sure to
26811 produce vector constants when possible.
26813 2021-04-26 Richard Biener <rguenther@suse.de>
26815 * tree-complex.c: Include gimple-fold.h.
26816 (expand_complex_addition): Use gimple_build.
26817 (expand_complex_multiplication_components): Likewise.
26818 (expand_complex_multiplication): Likewise.
26819 (expand_complex_div_straight): Likewise.
26820 (expand_complex_div_wide): Likewise.
26821 (expand_complex_division): Likewise.
26822 (expand_complex_conjugate): Likewise.
26823 (expand_complex_comparison): Likewise.
26825 2021-04-26 Richard Biener <rguenther@suse.de>
26827 * tree-ssa-phiopt.c (two_value_replacement): Remove use
26828 of legacy gimplify_buildN API.
26830 2021-04-26 Richard Biener <rguenther@suse.de>
26832 PR tree-optimization/99473
26833 * tree-ssa-phiopt.c (cond_store_replacement): Handle all
26836 2021-04-26 Richard Biener <rguenther@suse.de>
26838 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_builtin):
26839 Use replace_call_with_value.
26841 2021-04-26 Richard Biener <rguenther@suse.de>
26843 * tree-ssa-propagate.h (valid_gimple_rhs_p): Remove.
26844 (update_gimple_call): Likewise.
26845 (update_call_from_tree): Likewise.
26846 * tree-ssa-propagate.c (valid_gimple_rhs_p): Remove.
26847 (valid_gimple_call_p): Likewise.
26848 (move_ssa_defining_stmt_for_defs): Likewise.
26849 (finish_update_gimple_call): Likewise.
26850 (update_gimple_call): Likewise.
26851 (update_call_from_tree): Likewise.
26852 (propagate_tree_value_into_stmt): Use replace_call_with_value.
26853 * gimple-fold.h (update_gimple_call): Declare.
26854 * gimple-fold.c (valid_gimple_rhs_p): Move here from
26855 tree-ssa-propagate.c.
26856 (update_gimple_call): Likewise.
26857 (valid_gimple_call_p): Likewise.
26858 (finish_update_gimple_call): Likewise, and simplify.
26859 (gimplify_and_update_call_from_tree): Implement
26860 update_call_from_tree functionality, avoid excessive
26861 push/pop_gimplify_context.
26862 (gimple_fold_builtin): Use only gimplify_and_update_call_from_tree.
26863 (gimple_fold_call): Likewise.
26864 * gimple-ssa-sprintf.c (try_substitute_return_value): Likewise.
26865 * tree-ssa-ccp.c (ccp_folder::fold_stmt): Likewise.
26866 (pass_fold_builtins::execute): Likewise.
26867 (optimize_stack_restore): Use replace_call_with_value.
26868 * tree-cfg.c (fold_loop_internal_call): Likewise.
26869 * tree-ssa-dce.c (maybe_optimize_arith_overflow): Use
26870 only gimplify_and_update_call_from_tree.
26871 * tree-ssa-strlen.c (handle_builtin_strlen): Likewise.
26872 (handle_builtin_strchr): Likewise.
26873 * tsan.c: Include gimple-fold.h instead of tree-ssa-propagate.h.
26875 2021-04-26 Jakub Jelinek <jakub@redhat.com>
26878 * vmsdbgout.c (ASM_OUTPUT_DEBUG_STRING, vmsdbgout_begin_block,
26879 vmsdbgout_end_block, lookup_filename, vmsdbgout_source_line): Remove
26882 2021-04-25 liuhongt <hongtao.liu@intel.com>
26885 * config/i386/i386-builtin.def (BDESC): Change the icode of
26886 the following builtins to CODE_FOR_nothing.
26887 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold
26888 IX86_BUILTIN_PCMPEQB128, IX86_BUILTIN_PCMPEQW128,
26889 IX86_BUILTIN_PCMPEQD128, IX86_BUILTIN_PCMPEQQ,
26890 IX86_BUILTIN_PCMPEQB256, IX86_BUILTIN_PCMPEQW256,
26891 IX86_BUILTIN_PCMPEQD256, IX86_BUILTIN_PCMPEQQ256,
26892 IX86_BUILTIN_PCMPGTB128, IX86_BUILTIN_PCMPGTW128,
26893 IX86_BUILTIN_PCMPGTD128, IX86_BUILTIN_PCMPGTQ,
26894 IX86_BUILTIN_PCMPGTB256, IX86_BUILTIN_PCMPGTW256,
26895 IX86_BUILTIN_PCMPGTD256, IX86_BUILTIN_PCMPGTQ256.
26896 * config/i386/sse.md (avx2_eq<mode>3): Deleted.
26897 (sse2_eq<mode>3): Ditto.
26898 (sse4_1_eqv2di3): Ditto.
26899 (sse2_gt<mode>3): Rename to ..
26900 (*sse2_gt<mode>3): .. this.
26902 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
26905 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
26908 * config/darwin.c (darwin_binds_local_p): Assume that any
26909 public symbol might be interposed for PIC code. Update function
26910 header comment to reflect current Darwin capability.
26912 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
26915 * config/darwin.c (darwin_binds_local_p): Assume that any
26916 public symbol might be interposed for PIC code. Update function
26917 header comment to reflect current Darwin capability.
26919 2021-04-24 Richard Sandiford <richard.sandiford@arm.com>
26921 * doc/sourcebuild.texi: Document no-opts and any-opts target
26924 2021-04-23 YiFei Zhu <zhuyifei1999@gmail.com>
26926 * config/bpf/bpf.h (ASM_OUTPUT_ALIGNED_BSS): Use .type and .lcomm.
26928 2021-04-23 YiFei Zhu <zhuyifei1999@gmail.com>
26930 * config/bpf/bpf.h (FUNCTION_BOUNDARY): Set to 64.
26932 2021-04-23 Uroš Bizjak <ubizjak@gmail.com>
26935 * config/i386/i386-options.c (ix86_option_override_internal):
26936 Error out when -m96bit-long-double is used with 64bit targets.
26937 * config/i386/i386.md (*pushxf_rounded): Remove pattern.
26939 2021-04-23 Martin Liska <mliska@suse.cz>
26941 * lto-wrapper.c: Remove FIXME about usage of
26942 hardware_concurrency. The function is not on par with
26945 2021-04-23 Uroš Bizjak <ubizjak@gmail.com>
26948 * config/i386/sync.md (FILD_ATOMIC/FIST_ATOMIC FP load peephole2):
26949 Copy operand 3 to operand 4. Use sse_reg_operand
26950 as operand 3 predicate.
26951 (FILD_ATOMIC/FIST_ATOMIC FP load peephole2 with mem blockage): Ditto.
26952 (LDX_ATOMIC/STX_ATOMIC FP load peephole2): Ditto.
26953 (LDX_ATOMIC/LDX_ATOMIC FP load peephole2 with mem blockage): Ditto.
26954 (FILD_ATOMIC/FIST_ATOMIC FP store peephole2):
26955 Copy operand 1 to operand 0.
26956 (FILD_ATOMIC/FIST_ATOMIC FP store peephole2 with mem blockage): Ditto.
26957 (LDX_ATOMIC/STX_ATOMIC FP store peephole2): Ditto.
26958 (LDX_ATOMIC/LDX_ATOMIC FP store peephole2 with mem blockage): Ditto.
26960 2021-04-23 Alex Coplan <alex.coplan@arm.com>
26962 PR rtl-optimization/100230
26963 * early-remat.c (early_remat::sort_candidates): Use delete[]
26964 instead of delete for array allocated with new[].
26966 2021-04-23 Richard Biener <rguenther@suse.de>
26968 * genmatch.c (lower_cond): Remove VEC_COND_EXPR special-casing.
26969 (capture_info::capture_info): Likewise.
26970 (capture_info::walk_match): Likewise.
26971 (expr::gen_transform): Likewise.
26972 (dt_simplify::gen_1): Likewise.
26973 * gimple-match-head.c (maybe_resimplify_conditional_op):
26974 Remove VEC_COND_EXPR special-casing.
26975 (gimple_simplify): Likewise.
26976 * gimple.c (gimple_could_trap_p_1): Adjust.
26977 * tree-ssa-pre.c (compute_avail): Allow VEC_COND_EXPR
26978 to participate in PRE.
26980 2021-04-23 Richard Biener <rguenther@suse.de>
26982 * cfganal.c (connect_infinite_loops_to_exit): First call
26983 add_noreturn_fake_exit_edges.
26984 * ipa-sra.c (process_scan_results): Do not call the now redundant
26985 add_noreturn_fake_exit_edges.
26986 * predict.c (tree_estimate_probability): Likewise.
26987 (rebuild_frequencies): Likewise.
26988 * store-motion.c (one_store_motion_pass): Likewise.
26990 2021-04-23 Richard Biener <rguenther@suse.de>
26992 PR tree-optimization/100222
26993 * predict.c (pass_profile::execute): Remove redundant call to
26994 mark_irreducible_loops.
26995 (report_predictor_hitrates): Likewise.
26997 2021-04-23 Richard Biener <rguenther@suse.de>
26999 * tree-ssa-loop-ivopts.c (rewrite_use_nonlinear_expr): Avoid
27000 valid_gimple_rhs_p by instead gimplifying to one.
27002 2021-04-23 Richard Biener <rguenther@suse.de>
27004 PR tree-optimization/99971
27005 * tree-vect-data-refs.c (vect_slp_analyze_node_dependences):
27006 Always use TBAA for loads.
27008 2021-04-23 liuhongt <hongtao.liu@intel.com>
27011 * config/i386/i386-options.c (ix86_option_override_internal):
27012 Clear MASK_AVX256_SPLIT_UNALIGNED_LOAD/STORE in x_target_flags
27013 when X86_TUNE_AVX256_UNALIGNED_LOAD/STORE_OPTIMAL is enabled
27014 by target attribute.
27016 2021-04-23 David Edelsohn <dje.gcc@gmail.com>
27018 * config/rs6000/aix71.h (PREFERRED_DEBUGGING_TYPE): Change to
27020 * config/rs6000/aix72.h (PREFERRED_DEBUGGING_TYPE): Same.
27022 2021-04-22 David Edelsohn <dje.gcc@gmail.com>
27024 * config.gcc (powerpc-ibm-aix6.*): Remove.
27025 * config/rs6000/aix61.h: Delete.
27027 2021-04-22 Martin Liska <mliska@suse.cz>
27029 PR testsuite/100159
27030 PR testsuite/100192
27031 * builtins.c (expand_builtin): Fix typos and missing comments.
27032 * dwarf2out.c (gen_subprogram_die): Likewise.
27033 (gen_struct_or_union_type_die): Likewise.
27035 2021-04-22 Uroš Bizjak <ubizjak@gmail.com>
27038 * config/i386/i386-expand.c (ix86_expand_convert_uns_sidf_sse):
27039 Remove the sign with FE_DOWNWARD, where x - x = -0.0.
27041 2021-04-21 Iain Sandoe <iain@sandoe.co.uk>
27043 * config/i386/darwin.h (TARGET_64BIT): Remove definition
27044 based on TARGET_ISA_64BIT.
27045 (TARGET_64BIT_P): Remove definition based on
27046 TARGET_ISA_64BIT_P().
27048 2021-04-21 Martin Liska <mliska@suse.cz>
27051 2021-04-21 Martin Liska <mliska@suse.cz>
27053 * lto-wrapper.c (cpuset_popcount): Remove.
27054 (init_num_threads): Remove and use hardware_concurrency.
27056 2021-04-21 Martin Liska <mliska@suse.cz>
27059 * main.c (main): Call toplev::finalize in CHECKING_P mode.
27060 * ipa-modref.c (ipa_modref_c_finalize): summaries are NULL
27061 when incremental LTO linking happens.
27063 2021-04-21 Martin Liska <mliska@suse.cz>
27065 * lto-wrapper.c (run_gcc): When -flto=jobserver is used, but the
27066 makeserver cannot be detected, then use -flto=N fallback.
27068 2021-04-21 Richard Sandiford <richard.sandiford@arm.com>
27070 * acinclude.m4 (gcc_AC_INITFINI_ARRAY): When cross-compiling,
27071 default to yes for aarch64-linux-gnu.
27072 * configure: Regenerate.
27074 2021-04-21 Martin Liska <mliska@suse.cz>
27076 * lto-wrapper.c (cpuset_popcount): Remove.
27077 (init_num_threads): Remove and use hardware_concurrency.
27079 2021-04-21 Martin Liska <mliska@suse.cz>
27081 * config/i386/i386.c: Remove superfluous || TARGET_MACHO
27082 which remains to be '(... || 0)' and clang complains about it.
27083 * dwarf2out.c (AT_vms_delta): Declare conditionally.
27084 (add_AT_vms_delta): Likewise.
27085 * tree.c (fld_simplified_type): Use rather more common pattern
27086 for disabling of something (#if 0).
27087 (get_tree_code_name): Likewise.
27088 (verify_type_variant): Likewise.
27090 2021-04-21 Martin Liska <mliska@suse.cz>
27092 * config/i386/i386-expand.c (decide_alignment): Use newly named
27093 macro TARGET_CPU_P.
27094 * config/i386/i386.c (ix86_decompose_address): Likewise.
27095 (ix86_address_cost): Likewise.
27096 (ix86_lea_outperforms): Likewise.
27097 (ix86_avoid_lea_for_addr): Likewise.
27098 (ix86_add_stmt_cost): Likewise.
27099 * config/i386/i386.h (TARGET_*): Remove.
27100 (TARGET_CPU_P): New macro.
27101 * config/i386/i386.md: Use newly named macro TARGET_CPU_P.
27102 * config/i386/x86-tune-sched-atom.c (do_reorder_for_imul): Likewise.
27103 (swap_top_of_ready_list): Likewise.
27104 (ix86_atom_sched_reorder): Likewise.
27105 * config/i386/x86-tune-sched-bd.c (ix86_bd_has_dispatch): Likewise.
27106 * config/i386/x86-tune-sched.c (ix86_adjust_cost): Likewise.
27108 2021-04-21 Martin Liska <mliska@suse.cz>
27110 * config/i386/i386-options.c (TARGET_EXPLICIT_NO_SAHF_P):
27112 (SET_TARGET_NO_SAHF): Likewise.
27113 (TARGET_EXPLICIT_PREFETCH_SSE_P): Likewise.
27114 (SET_TARGET_PREFETCH_SSE): Likewise.
27115 (TARGET_EXPLICIT_NO_TUNE_P): Likewise.
27116 (SET_TARGET_NO_TUNE): Likewise.
27117 (TARGET_EXPLICIT_NO_80387_P): Likewise.
27118 (SET_TARGET_NO_80387): Likewise.
27120 * config/i386/i386.h (TARGET_*): Remove.
27121 * opth-gen.awk: Generate new used macros.
27123 2021-04-21 Martin Liska <mliska@suse.cz>
27125 * config/i386/i386.h (PTA_*): Remove.
27126 (enum pta_flag): New.
27127 (DEF_PTA): Generate PTA_* values from i386-isa.def.
27128 * config/i386/i386-isa.def: New file.
27130 2021-04-21 Alex Coplan <alex.coplan@arm.com>
27133 * config/aarch64/aarch64-bti-insert.c (aarch64_bti_j_insn_p): New.
27134 (rest_of_insert_bti): Avoid inserting duplicate bti j insns for
27135 jump table targets.
27137 2021-04-21 H.J. Lu <hjl.tools@gmail.com>
27139 * config.gcc: Install mwaitintrin.h for i[34567]86-*-* and
27140 x86_64-*-* targets.
27141 * common/config/i386/i386-common.c (OPTION_MASK_ISA2_MWAIT_SET):
27143 (OPTION_MASK_ISA2_MWAIT_UNSET): Likewise.
27144 (ix86_handle_option): Handle -mmwait.
27145 * config/i386/i386-builtins.c (ix86_init_mmx_sse_builtins):
27146 Replace OPTION_MASK_ISA_SSE3 with OPTION_MASK_ISA2_MWAIT on
27147 __builtin_ia32_monitor and __builtin_ia32_mwait.
27148 * config/i386/i386-options.c (isa2_opts): Add -mmwait.
27149 (ix86_valid_target_attribute_inner_p): Likewise.
27150 (ix86_option_override_internal): Enable mwait/monitor
27151 instructions for -msse3.
27152 * config/i386/i386.h (TARGET_MWAIT): New.
27153 (TARGET_MWAIT_P): Likewise.
27154 * config/i386/i386.opt: Add -mmwait.
27155 * config/i386/mwaitintrin.h: New file.
27156 * config/i386/pmmintrin.h: Include <mwaitintrin.h>.
27157 * config/i386/sse.md (sse3_mwait): Replace TARGET_SSE3 with
27159 (@sse3_monitor_<mode>): Likewise.
27160 * config/i386/x86gprintrin.h: Include <mwaitintrin.h>.
27161 * doc/extend.texi: Document mwait target attribute.
27162 * doc/invoke.texi: Document -mmwait.
27164 2021-04-21 Martin Liska <mliska@suse.cz>
27166 * config/i386/i386-options.c (DEF_ENUM): Remove it.
27167 * config/i386/i386-opts.h (DEF_ENUM): Likewise.
27168 * config/i386/stringop.def (DEF_ENUM): Likewise.
27170 2021-04-21 Martin Liska <mliska@suse.cz>
27172 * tree-cfg.c (gimple_verify_flow_info): Use qD instead
27173 of print_generic_expr.
27175 2021-04-21 Jakub Jelinek <jakub@redhat.com>
27177 PR rtl-optimization/100148
27178 * cprop.c (constprop_register): Use next_nondebug_insn instead of
27181 2021-04-21 Martin Liska <mliska@suse.cz>
27184 * cgraphunit.c (cgraph_node::analyze): Remove duplicate
27185 free_dominance_info calls.
27187 2021-04-21 Richard Biener <rguenther@suse.de>
27189 * gimple-fold.c (maybe_fold_reference): Remove is_lhs
27190 parameter (and assume it to be false).
27191 (fold_gimple_assign): Adjust, remove all callers of
27192 maybe_fold_reference calling it with is_lhs true.
27193 (gimple_fold_call): Likewise.
27194 (fold_stmt_1): Likewise.
27196 2021-04-21 Richard Biener <rguenther@suse.de>
27198 * fold-const.c (pedantic_non_lvalue_loc): Remove.
27199 (fold_binary_loc): Adjust.
27200 (fold_ternary_loc): Likewise.
27202 2021-04-21 Richard Sandiford <richard.sandiford@arm.com>
27204 PR middle-end/100130
27205 * varasm.c (get_block_for_decl): Make sure that any use of the
27206 retain attribute matches the section's retain flag.
27207 (switch_to_section): Check for retain mismatches even when
27208 changing sections, but do not warn if the given decl is the
27209 section's named.decl.
27210 (output_object_block): Pass the first decl in the block (if any)
27211 to switch_to_section.
27213 2021-04-20 H.J. Lu <hjl.tools@gmail.com>
27215 * config/i386/i386-c.c (ix86_target_macros_internal): Define
27216 __CRC32__ for -mcrc32.
27217 * config/i386/i386-options.c (ix86_option_override_internal):
27218 Enable crc32 instruction for -msse4.2.
27219 * config/i386/i386.md (sse4_2_crc32<mode>): Remove TARGET_SSE4_2
27221 (sse4_2_crc32di): Likewise.
27222 * config/i386/ia32intrin.h: Use crc32 target option for CRC32
27225 2021-04-20 Segher Boessenkool <segher@kernel.crashing.org>
27228 * config/rs6000/rs6000.c (rs6000_machine_from_flags): Do not consider
27231 2021-04-20 Martin Liska <mliska@suse.cz>
27233 * doc/invoke.texi: Fix typo.
27234 * params.opt: Likewise.
27236 2021-04-20 Martin Liska <mliska@suse.cz>
27238 * doc/invoke.texi: Document new param.
27240 2021-04-19 Andrew MacLeod <amacleod@redhat.com>
27242 PR tree-optimization/100081
27243 * gimple-range-cache.h (ranger_cache): Inherit from gori_compute
27244 rather than gori_compute_cache.
27245 * gimple-range-gori.cc (is_gimple_logical_p): Move to top of file.
27246 (range_def_chain::m_logical_depth): New member.
27247 (range_def_chain::range_def_chain): Initialize m_logical_depth.
27248 (range_def_chain::get_def_chain): Don't build defchains through more
27249 than LOGICAL_LIMIT logical expressions.
27250 * params.opt (param_ranger_logical_depth): New.
27252 2021-04-19 Richard Earnshaw <rearnsha@arm.com>
27255 * config/arm/arm.c (arm_configure_build_target): Do not strip
27256 extended FPU/SIMD feature bits from the target ISA when -mfpu
27257 is specified (partial revert of r11-8168).
27259 2021-04-19 Thomas Schwinge <thomas@codesourcery.com>
27261 * params.opt (-param=openacc-kernels=): Add.
27262 * omp-oacc-kernels-decompose.cc
27263 (pass_omp_oacc_kernels_decompose::gate): Use it.
27264 * doc/invoke.texi (-fopenacc-kernels=@var{mode}): Move...
27265 (--param): ... here, 'openacc-kernels'.
27267 2021-04-19 Martin Liska <mliska@suse.cz>
27270 * gengtype.c (finish_root_table): Align function arguments
27271 in between declaration and definition.
27273 2021-04-19 Eric Botcazou <ebotcazou@adacore.com>
27275 * config/i386/winnt.c (i386_pe_seh_cold_init): Properly deal with
27276 frames larger than the SEH maximum frame size.
27278 2021-04-18 Segher Boessenkool <segher@kernel.crashing.org>
27280 PR rtl-optimization/99927
27281 * combine.c (distribute_notes) [REG_UNUSED]: If the register already
27282 is dead, just drop it.
27284 2021-04-17 Iain Buclaw <ibuclaw@gdcproject.org>
27287 * config/i386/winnt-d.c (TARGET_D_TEMPLATES_ALWAYS_COMDAT): Define.
27288 * doc/tm.texi: Regenerate.
27289 * doc/tm.texi.in (D language and ABI): Add @hook for
27290 TARGET_D_TEMPLATES_ALWAYS_COMDAT.
27292 2021-04-17 Iain Buclaw <ibuclaw@gdcproject.org>
27294 * config/darwin-d.c (darwin_d_handle_target_object_format): New
27296 (darwin_d_register_target_info): New function.
27297 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27298 * config/dragonfly-d.c (dragonfly_d_handle_target_object_format): New
27300 (dragonfly_d_register_target_info): New function.
27301 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27302 * config/freebsd-d.c (freebsd_d_handle_target_object_format): New
27304 (freebsd_d_register_target_info): New function.
27305 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27306 * config/glibc-d.c (glibc_d_handle_target_object_format): New
27308 (glibc_d_register_target_info): New function.
27309 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27310 * config/i386/i386-d.c (ix86_d_handle_target_object_format): New
27312 (ix86_d_register_target_info): Add ix86_d_handle_target_object_format
27313 as handler for objectFormat key.
27314 * config/i386/winnt-d.c (winnt_d_handle_target_object_format): New
27316 (winnt_d_register_target_info): New function.
27317 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27318 * config/netbsd-d.c (netbsd_d_handle_target_object_format): New
27320 (netbsd_d_register_target_info): New function.
27321 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27322 * config/openbsd-d.c (openbsd_d_handle_target_object_format): New
27324 (openbsd_d_register_target_info): New function.
27325 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27326 * config/pa/pa-d.c (pa_d_handle_target_object_format): New function.
27327 (pa_d_register_target_info): Add pa_d_handle_target_object_format as
27328 handler for objectFormat key.
27329 * config/rs6000/rs6000-d.c (rs6000_d_handle_target_object_format): New
27331 (rs6000_d_register_target_info): Add
27332 rs6000_d_handle_target_object_format as handler for objectFormat key.
27333 * config/sol2-d.c (solaris_d_handle_target_object_format): New
27335 (solaris_d_register_target_info): New function.
27336 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27338 2021-04-16 Jakub Jelinek <jakub@redhat.com>
27341 * config/aarch64/aarch64.c (aarch64_function_arg_alignment): Change
27342 abi_break argument from bool * to unsigned *, store there the pre-GCC 9
27344 (aarch64_layout_arg, aarch64_gimplify_va_arg_expr): Adjust callers.
27345 (aarch64_function_arg_regno_p): Likewise. Only emit -Wpsabi note if
27346 the old and new alignment after applying MIN/MAX to it is different.
27348 2021-04-16 Tamar Christina <tamar.christina@arm.com>
27351 * config/aarch64/aarch64-sve.md (@aarch64_sve_trn1_conv<mode>): New.
27352 * config/aarch64/aarch64.c (aarch64_expand_sve_const_pred_trn): Use new
27354 * config/aarch64/iterators.md (UNSPEC_TRN1_CONV): New.
27356 2021-04-16 Bill Schmidt <wschmidt@linux.ibm.com>
27358 * doc/extend.texi (PowerPC AltiVec/VSX Built-in Functions): Revise
27359 this section and its subsections.
27361 2021-04-16 Jakub Jelinek <jakub@redhat.com>
27364 * config/aarch64/aarch64.md (*neg_asr_si2_extr, *extrsi5_insn_di): New
27365 define_insn patterns.
27367 2021-04-16 Richard Sandiford <richard.sandiford@arm.com>
27369 PR rtl-optimization/98689
27370 * reg-notes.def (UNTYPED_CALL): New note.
27371 * combine.c (distribute_notes): Handle it.
27372 * emit-rtl.c (try_split): Likewise.
27373 * rtlanal.c (rtx_properties::try_to_add_insn): Likewise. Assume
27374 that calls with the note implicitly set all return value registers.
27375 * builtins.c (expand_builtin_apply): Add a REG_UNTYPED_CALL
27378 2021-04-16 Richard Sandiford <richard.sandiford@arm.com>
27380 PR rtl-optimization/99596
27381 * rtlanal.c (rtx_properties::try_to_add_insn): Don't add global
27382 register accesses for const calls. Assume that pure functions
27383 can only read from global registers. Ignore cases in which
27384 the stack pointer has been marked global.
27386 2021-04-16 Jakub Jelinek <jakub@redhat.com>
27389 * tree-vect-loop.c (vect_transform_loop): Don't remove just
27390 dead scalar .MASK_LOAD calls, but also dead .COND_* calls - replace
27391 them by their last argument.
27393 2021-04-15 Martin Liska <mliska@suse.cz>
27395 * doc/invoke.texi: Other params don't use it, remove it.
27397 2021-04-15 Richard Biener <rguenther@suse.de>
27399 * gimple-builder.h: Add deprecation note.
27401 2021-04-15 Richard Sandiford <richard.sandiford@arm.com>
27404 * attribs.h (restrict_type_identity_attributes_to): Declare.
27405 * attribs.c (restrict_type_identity_attributes_to): New function.
27407 2021-04-15 Richard Sandiford <richard.sandiford@arm.com>
27410 * attribs.h (affects_type_identity_attributes): Declare.
27411 * attribs.c (remove_attributes_matching): New function.
27412 (affects_type_identity_attributes): Likewise.
27414 2021-04-15 Jakub Jelinek <jakub@redhat.com>
27417 * config/aarch64/aarch64.md (*<LOGICAL:optab>_<SHIFT:optab><mode>3):
27418 Add combine splitters for *<LOGICAL:optab>_ashl<mode>3 with
27419 ZERO_EXTEND, SIGN_EXTEND or AND.
27421 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
27423 PR rtl-optimization/99929
27424 * rtl.h (same_vector_encodings_p): New function.
27425 * cse.c (exp_equiv_p): Check that CONST_VECTORs have the same encoding.
27426 * cselib.c (rtx_equal_for_cselib_1): Likewise.
27427 * jump.c (rtx_renumbered_equal_p): Likewise.
27428 * lra-constraints.c (operands_match_p): Likewise.
27429 * reload.c (operands_match_p): Likewise.
27430 * rtl.c (rtx_equal_p_cb, rtx_equal_p): Likewise.
27432 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
27434 * print-rtl.c (rtx_writer::print_rtx_operand_codes_E_and_V): Print
27435 more information about variable-length CONST_VECTORs.
27437 2021-04-14 Vladimir N. Makarov <vmakarov@redhat.com>
27439 PR rtl-optimization/100066
27440 * lra-constraints.c (split_reg): Check paradoxical_subreg_p for
27441 ordered modes when choosing splitting mode for hard reg.
27443 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
27446 * config/aarch64/aarch64.c (aarch64_expand_sve_const_vector_sel):
27448 (aarch64_expand_sve_const_vector): Use it for nelts_per_pattern==2.
27450 2021-04-14 Andreas Krebbel <krebbel@linux.ibm.com>
27452 * config/s390/s390-builtins.def (O_M5, O_M12, ...): Add new macros
27453 for mask operand types.
27454 (s390_vec_permi_s64, s390_vec_permi_b64, s390_vec_permi_u64)
27455 (s390_vec_permi_dbl, s390_vpdi): Use the M5 type for the immediate
27457 (s390_vec_msum_u128, s390_vmslg): Use the M12 type for the
27459 * config/s390/s390.c (s390_const_operand_ok): Check the new
27460 operand types and generate a list of valid values.
27462 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
27464 * doc/tm.texi: Regenerate.
27465 * doc/tm.texi.in (D language and ABI): Add @hook for
27466 TARGET_D_REGISTER_OS_TARGET_INFO.
27468 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
27470 * config/aarch64/aarch64-d.c (aarch64_d_handle_target_float_abi): New
27472 (aarch64_d_register_target_info): New function.
27473 * config/aarch64/aarch64-protos.h (aarch64_d_register_target_info):
27475 * config/aarch64/aarch64.h (TARGET_D_REGISTER_CPU_TARGET_INFO):
27477 * config/arm/arm-d.c (arm_d_handle_target_float_abi): New function.
27478 (arm_d_register_target_info): New function.
27479 * config/arm/arm-protos.h (arm_d_register_target_info): Declare.
27480 * config/arm/arm.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27481 * config/i386/i386-d.c (ix86_d_handle_target_float_abi): New function.
27482 (ix86_d_register_target_info): New function.
27483 * config/i386/i386-protos.h (ix86_d_register_target_info): Declare.
27484 * config/i386/i386.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27485 * config/mips/mips-d.c (mips_d_handle_target_float_abi): New function.
27486 (mips_d_register_target_info): New function.
27487 * config/mips/mips-protos.h (mips_d_register_target_info): Declare.
27488 * config/mips/mips.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27489 * config/pa/pa-d.c (pa_d_handle_target_float_abi): New function.
27490 (pa_d_register_target_info): New function.
27491 * config/pa/pa-protos.h (pa_d_register_target_info): Declare.
27492 * config/pa/pa.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27493 * config/riscv/riscv-d.c (riscv_d_handle_target_float_abi): New
27495 (riscv_d_register_target_info): New function.
27496 * config/riscv/riscv-protos.h (riscv_d_register_target_info): Declare.
27497 * config/riscv/riscv.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27498 * config/rs6000/rs6000-d.c (rs6000_d_handle_target_float_abi): New
27500 (rs6000_d_register_target_info): New function.
27501 * config/rs6000/rs6000-protos.h (rs6000_d_register_target_info):
27503 * config/rs6000/rs6000.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27504 * config/s390/s390-d.c (s390_d_handle_target_float_abi): New function.
27505 (s390_d_register_target_info): New function.
27506 * config/s390/s390-protos.h (s390_d_register_target_info): Declare.
27507 * config/s390/s390.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27508 * config/sparc/sparc-d.c (sparc_d_handle_target_float_abi): New
27510 (sparc_d_register_target_info): New function.
27511 * config/sparc/sparc-protos.h (sparc_d_register_target_info): Declare.
27512 * config/sparc/sparc.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27513 * doc/tm.texi: Regenerate.
27514 * doc/tm.texi.in (D language and ABI): Add @hook for
27515 TARGET_D_REGISTER_CPU_TARGET_INFO.
27517 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
27519 * config/i386/i386-d.c (ix86_d_has_stdcall_convention): New function.
27520 * config/i386/i386-protos.h (ix86_d_has_stdcall_convention): Declare.
27521 * config/i386/i386.h (TARGET_D_HAS_STDCALL_CONVENTION): Define.
27522 * doc/tm.texi: Regenerate.
27523 * doc/tm.texi.in (D language and ABI): Add @hook for
27524 TARGET_D_HAS_STDCALL_CONVENTION.
27526 2021-04-14 Richard Biener <rguenther@suse.de>
27528 * tree-cfg.c (verify_gimple_assign_ternary): Verify that
27529 VEC_COND_EXPRs have a gimple_val condition.
27530 * tree-ssa-propagate.c (valid_gimple_rhs_p): VEC_COND_EXPR
27531 can no longer have a GENERIC condition.
27533 2021-04-14 Richard Earnshaw <rearnsha@arm.com>
27536 * config/arm/arm.c (arm_configure_build_target): Strip isa_all_fpbits
27537 from the isa_delta when -mfpu has been used.
27538 (arm_options_perform_arch_sanity_checks): It's the architecture that
27539 lacks an FPU not the processor.
27541 2021-04-13 Richard Biener <rguenther@suse.de>
27543 PR tree-optimization/100053
27544 * tree-ssa-sccvn.c (vn_nary_op_get_predicated_value): Do
27545 not use optimistic dominance queries for backedges to validate
27547 (dominated_by_p_w_unex): Add parameter to ignore executable
27548 state on backedges.
27549 (rpo_elim::eliminate_avail): Adjust.
27551 2021-04-13 Jakub Jelinek <jakub@redhat.com>
27554 * config/aarch64/aarch64.md (*aarch64_bfxil<mode>_extr,
27555 *aarch64_bfxilsi_extrdi): New define_insn patterns.
27557 2021-04-13 Jakub Jelinek <jakub@redhat.com>
27560 * simplify-rtx.c (simplify_immed_subreg): For MODE_COMPOSITE_P
27561 outermode, return NULL if the result doesn't encode back to the
27562 original byte sequence.
27563 (simplify_gen_subreg): Don't create SUBREGs from constants to
27564 MODE_COMPOSITE_P outermode.
27566 2021-04-12 Jakub Jelinek <jakub@redhat.com>
27568 PR rtl-optimization/99905
27569 * combine.c (expand_compound_operation): If pos + len > modewidth,
27570 perform the right shift by pos in inner_mode and then convert to mode,
27571 instead of trying to simplify a shift of rtx with inner_mode by pos
27572 as if it was a shift in mode.
27574 2021-04-12 Jakub Jelinek <jakub@redhat.com>
27577 * combine.c (simplify_and_const_int_1): Don't optimize varop
27578 away if it has side-effects.
27580 2021-04-12 Martin Liska <mliska@suse.cz>
27582 * doc/extend.texi: Escape @smallexample content.
27584 2021-04-12 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
27586 * config/s390/s390.md ("*movdi_31", "*movdi_64"): Add
27587 alternative in order to load a DFP zero.
27589 2021-04-12 Martin Liska <mliska@suse.cz>
27591 * doc/extend.texi: Be more precise in documentation
27592 of symver attribute.
27594 2021-04-12 Martin Liska <mliska@suse.cz>
27597 * gimplify.c (gimplify_expr): Right now, we unpoison all
27598 variables before a goto <dest>. We should not do it if we are
27601 2021-04-12 Cui,Lili <lili.cui@intel.com>
27603 * common/config/i386/cpuinfo.h (get_intel_cpu): Handle
27605 * common/config/i386/i386-common.c (processor_names): Add
27607 (processor_alias_table): Add rocketlake.
27608 * common/config/i386/i386-cpuinfo.h (processor_subtypes): Add
27609 INTEL_COREI7_ROCKETLAKE.
27610 * config.gcc: Add -march=rocketlake.
27611 * config/i386/i386-c.c (ix86_target_macros_internal): Handle
27613 * config/i386/i386-options.c (m_ROCKETLAKE) : Define.
27614 (processor_cost_table): Add rocketlake cost.
27615 * config/i386/i386.h (ix86_size_cost) : Define
27617 (processor_type) : Add PROCESSOR_ROCKETLAKE.
27618 (PTA_ROCKETLAKE): Ditto.
27619 * doc/extend.texi: Add rocketlake.
27620 * doc/invoke.texi: Add rocketlake.
27622 2021-04-12 Cui,Lili <lili.cui@intel.com>
27624 * config/i386/i386.h (PTA_ALDERLAKE): Change alderlake ISA list.
27625 * config/i386/i386-options.c (m_CORE_AVX2): Add m_ALDERLAKE.
27626 * common/config/i386/cpuinfo.h (get_intel_cpu): Add AlderLake model.
27627 * doc/invoke.texi: Change alderlake ISA list.
27629 2021-04-11 Hafiz Abid Qadeer <abidh@codesourcery.com>
27631 PR middle-end/98088
27632 * omp-expand.c (expand_oacc_collapse_init): Update condition in
27635 2021-04-10 H.J. Lu <hjl.tools@gmail.com>
27638 * config/i386/serializeintrin.h (_serialize): Defined as macro.
27640 2021-04-10 Jakub Jelinek <jakub@redhat.com>
27643 * expr.c (expand_expr_addr_expr_1): Test is_global_var rather than
27644 just TREE_STATIC on COMPOUND_LITERAL_EXPR_DECLs.
27646 2021-04-10 Jakub Jelinek <jakub@redhat.com>
27648 PR middle-end/99989
27649 * gimple-ssa-warn-alloca.c
27650 (alloca_type_and_limit::alloca_type_and_limit): Initialize limit to
27651 0 with integer precision unconditionally.
27653 2021-04-10 Jakub Jelinek <jakub@redhat.com>
27655 PR rtl-optimization/98601
27656 * rtlanal.c (rtx_addr_can_trap_p_1): Allow in assert unknown size
27657 not just for BLKmode, but also for VOIDmode. For STRICT_ALIGNMENT
27658 unaligned_mems handle VOIDmode like BLKmode.
27660 2021-04-10 Jan Hubicka <hubicka@ucw.cz>
27663 * tree.c (free_lang_data_in_decl): Do not release body of
27664 declare_variant_alt.
27666 2021-04-09 Richard Sandiford <richard.sandiford@arm.com>
27668 * config/aarch64/aarch64.c (aarch64_option_restore): If the
27669 architecture was specified explicitly and the tuning wasn't,
27670 tune for the architecture rather than the configured default CPU.
27672 2021-04-09 Richard Sandiford <richard.sandiford@arm.com>
27674 * config/aarch64/aarch64.md (tlsdesc_small_sve_<mode>): Use X30
27675 as the temporary register.
27677 2021-04-09 Martin Liska <mliska@suse.cz>
27679 * doc/extend.texi: Move non-target attributes on the top level.
27681 2021-04-09 Martin Liska <mliska@suse.cz>
27683 * doc/invoke.texi: Document minimum and maximum value of the
27684 argument for both supported compression algorithms.
27686 2021-04-08 David Edelsohn <dje.gcc@gmail.com>
27688 * config/rs6000/rs6000.c (rs6000_xcoff_select_section): Select
27689 TLS BSS before TLS data.
27690 * config/rs6000/xcoff.h (ASM_OUTPUT_TLS_COMMON): Use .comm.
27692 2021-04-08 Richard Sandiford <richard.sandiford@arm.com>
27694 * doc/sourcebuild.texi (stdint_types_mbig_endian): Document.
27696 2021-04-08 Richard Sandiford <richard.sandiford@arm.com>
27698 * match.pd: Extend vec_cond folds to handle shifts.
27700 2021-04-08 Maciej W. Rozycki <macro@orcam.me.uk>
27702 * config/vax/vax.md: Fix comment for `*bit<mode>' pattern's
27705 2021-04-08 Alex Coplan <alex.coplan@arm.com>
27708 * config/arm/iterators.md (MVE_vecs): New.
27709 (V_elem): Also handle V2DF.
27710 * config/arm/mve.md (*mve_mov<mode>): Rename to ...
27711 (*mve_vdup<mode>): ... this. Remove second alternative since
27712 vec_duplicate of const_int is not canonical RTL, and we don't
27713 want to match symbol_refs.
27714 (*mve_vec_duplicate<mode>): Delete (pattern is redundant).
27716 2021-04-08 Xionghu Luo <luoxhu@linux.ibm.com>
27718 * fold-const.c (fold_single_bit_test): Fix typo.
27719 * print-rtl.c (print_rtx_insn_vec): Call print_rtl_single
27722 2021-04-07 Richard Sandiford <richard.sandiford@arm.com>
27724 PR tree-optimization/97513
27725 * tree-vect-slp.c (vect_add_slp_permutation): New function,
27727 (vectorizable_slp_permutation): ...here. Detect cases in which
27728 all VEC_PERM_EXPRs are guaranteed to have the same stepped
27729 permute vector and only generate one permute vector for that case.
27730 Extend that case to handle variable-length vectors.
27732 2021-04-07 Richard Sandiford <richard.sandiford@arm.com>
27734 PR tree-optimization/99873
27735 * tree-vect-slp.c (vect_slp_prefer_store_lanes_p): New function.
27736 (vect_build_slp_instance): Don't split store groups that could
27737 use IFN_STORE_LANES.
27739 2021-04-07 Jakub Jelinek <jakub@redhat.com>
27742 * varasm.c (output_constant_pool_contents): Don't strip name encoding
27743 from XSTR (desc->sym, 0) or from label before passing those to
27746 2021-04-07 Richard Biener <rguenther@suse.de>
27748 PR tree-optimization/99954
27749 * tree-loop-distribution.c: Include tree-affine.h.
27750 (generate_memcpy_builtin): Try using tree-affine to prove
27752 (loop_distribution::classify_builtin_ldst): Always classify
27755 2021-04-07 Richard Biener <rguenther@suse.de>
27757 PR tree-optimization/99947
27758 * tree-vect-loop.c (vectorizable_induction): Pre-allocate
27759 steps vector to avoid pushing elements from the reallocated
27762 2021-04-07 Richard Biener <rguenther@suse.de>
27764 * tree-ssa-sccvn.h (print_vn_reference_ops): Declare.
27765 * tree-ssa-pre.c (print_pre_expr): Factor out VN reference operand
27767 * tree-ssa-sccvn.c (print_vn_reference_ops): ... into this new
27769 (debug_vn_reference_ops): New.
27771 2021-04-07 Bin Cheng <bin.cheng@linux.alibaba.com>
27773 PR tree-optimization/98736
27774 * tree-loop-distribution.c
27775 * (loop_distribution::bb_top_order_init):
27776 Compute RPO with programing order preserved by calling function
27777 rev_post_order_and_mark_dfs_back_seme.
27779 2021-04-06 Vladimir N. Makarov <vmakarov@redhat.com>
27782 * lra-constraints.c (split_reg): Don't check paradoxical_subreg_p.
27783 * lra-lives.c (clear_sparseset_regnos, regnos_in_sparseset_p): New
27785 (process_bb_lives): Don't update biggest mode of hard reg for
27786 implicit in multi-register group. Use the new functions for
27787 updating dead_set and unused_set by register notes.
27789 2021-04-06 Xianmiao Qu <xianmiao_qu@c-sky.com>
27791 * config/csky/csky_pipeline_ck802.md : Use insn reservation name
27794 2021-04-06 H.J. Lu <hjl.tools@gmail.com>
27796 * config/i386/x86-tune-costs.h (skylake_memcpy): Updated.
27797 (skylake_memset): Likewise.
27798 (skylake_cost): Change CLEAR_RATIO to 17.
27799 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
27800 Replace m_CANNONLAKE, m_ICELAKE_CLIENT, m_ICELAKE_SERVER,
27801 m_TIGERLAKE and m_SAPPHIRERAPIDS with m_SKYLAKE and m_CORE_AVX512.
27803 2021-04-06 Richard Biener <rguenther@suse.de>
27805 PR tree-optimization/99880
27806 * tree-vect-loop.c (maybe_set_vectorized_backedge_value): Only
27807 set vectorized defs of relevant PHIs.
27809 2021-04-06 Richard Biener <rguenther@suse.de>
27811 PR tree-optimization/99924
27812 * tree-vect-slp.c (vect_bb_partition_graph_r): Do not mark
27813 nodes w/o scalar stmts as visited.
27815 2021-04-06 Alex Coplan <alex.coplan@arm.com>
27818 * config/arm/arm.c (arm_libcall_uses_aapcs_base): Also use base
27819 PCS for [su]fix_optab.
27821 2021-04-03 Iain Sandoe <iain@sandoe.co.uk>
27823 * config/darwin.c (machopic_legitimize_pic_address): Check
27824 that the current pic register is one of the hard reg set
27825 before setting liveness.
27827 2021-04-03 Iain Sandoe <iain@sandoe.co.uk>
27829 * config/darwin.c (machopic_legitimize_pic_address): Fix
27830 whitespace, remove unused code.
27832 2021-04-03 Jakub Jelinek <jakub@redhat.com>
27834 PR tree-optimization/99882
27835 * gimple-ssa-store-merging.c (bswap_view_convert): Handle val with
27838 2021-04-03 Jakub Jelinek <jakub@redhat.com>
27840 PR rtl-optimization/99863
27841 * dse.c (replace_read): Drop regs_live argument. Instead of
27842 regs_live, use store_insn->fixed_regs_live if non-NULL,
27843 otherwise punt if insns sequence clobbers or sets any hard
27846 2021-04-03 Jakub Jelinek <jakub@redhat.com>
27849 * targhooks.h (default_print_patchable_function_entry_1): Declare.
27850 * targhooks.c (default_print_patchable_function_entry_1): New function,
27851 copied from default_print_patchable_function_entry with an added flags
27853 (default_print_patchable_function_entry): Rewritten into a small
27854 wrapper around default_print_patchable_function_entry_1.
27855 * config/rs6000/rs6000.c (TARGET_ASM_PRINT_PATCHABLE_FUNCTION_ENTRY):
27857 (rs6000_print_patchable_function_entry): New function.
27859 2021-04-02 Eric Botcazou <ebotcazou@adacore.com>
27861 * doc/invoke.texi (fdelete-dead-exceptions): Minor tweak.
27863 2021-04-01 Jason Merrill <jason@redhat.com>
27866 * common.opt: Document v15 and v16.
27868 2021-04-01 Richard Biener <rguenther@suse.de>
27870 PR tree-optimization/99863
27871 * gimplify.c (gimplify_init_constructor): Recompute vector
27874 2021-04-01 Jakub Jelinek <jakub@redhat.com>
27876 * doc/extend.texi (symver attribute): Fix up syntax errors
27879 2021-04-01 Jakub Jelinek <jakub@redhat.com>
27881 PR tree-optimization/96573
27882 * gimple-ssa-store-merging.c (init_symbolic_number): Handle
27883 also pointer types.
27885 2021-04-01 Richard Biener <rguenther@suse.de>
27887 PR tree-optimization/99856
27888 * tree-vect-patterns.c (vect_recog_over_widening_pattern): Promote
27889 precision to vector element precision.
27891 2021-04-01 Martin Jambor <mjambor@suse.cz>
27893 PR tree-optimization/97009
27894 * tree-sra.c (access_or_its_child_written): New function.
27895 (propagate_subaccesses_from_rhs): Use it instead of a simple grp_write
27898 2021-03-31 Jan Hubicka <hubicka@ucw.cz>
27901 * cif-code.def (USES_COMDAT_LOCAL): Make CIF_FINAL_NORMAL.
27903 2021-03-31 Pat Haugen <pthaugen@linux.ibm.com>
27906 * config/rs6000/altivec.md (xxspltiw_v4si, xxspltiw_v4sf_inst,
27907 xxspltidp_v2df_inst, xxsplti32dx_v4si_inst, xxsplti32dx_v4sf_inst,
27908 xxblend_<mode>, xxpermx_inst, xxeval): Mark prefixed.
27909 * config/rs6000/mma.md (mma_<vvi4i4i8>, mma_<avvi4i4i8>,
27910 mma_<vvi4i4i2>, mma_<avvi4i4i2>, mma_<vvi4i4>, mma_<avvi4i4>,
27911 mma_<pvi4i2>, mma_<apvi4i2>, mma_<vvi4i4i4>, mma_<avvi4i4i4>):
27913 * config/rs6000/rs6000.c (rs6000_final_prescan_insn): Adjust test.
27914 * config/rs6000/rs6000.md (define_attr "maybe_prefixed"): New.
27915 (define_attr "prefixed"): Update initializer.
27917 2021-03-31 Jakub Jelinek <jakub@redhat.com>
27920 * dwarf2out.c (debug_ranges_dwo_section): New variable.
27921 (DW_RANGES_IDX_SKELETON): Define.
27922 (struct dw_ranges): Add begin_entry and end_entry members.
27923 (DEBUG_DWO_RNGLISTS_SECTION): Define.
27924 (add_ranges_num): Adjust r initializer for addition of *_entry
27926 (add_ranges_by_labels): For -gsplit-dwarf and force_direct,
27927 set idx to DW_RANGES_IDX_SKELETON.
27928 (use_distinct_base_address_for_range): New function.
27929 (index_rnglists): Don't set r->idx if it is equal to
27930 DW_RANGES_IDX_SKELETON. Initialize r->begin_entry and
27931 r->end_entry for -gsplit-dwarf if those will be needed by
27933 (output_rnglists): Add DWO argument. If true, switch to
27934 debug_ranges_dwo_section rather than debug_ranges_section.
27935 Adjust l1/l2 label indexes. Only output the offset table when
27936 dwo is true and don't include in there the skeleton range
27937 entry if present. For -gsplit-dwarf, skip ranges that belong
27938 to the other rnglists section. Change return type from void
27939 to bool and return true if there are any range entries for
27940 the other section. For dwarf_split_debug_info use
27941 DW_RLE_startx_endx, DW_RLE_startx_length and DW_RLE_base_addressx
27942 entries instead of DW_RLE_start_end, DW_RLE_start_length and
27943 DW_RLE_base_address. Use use_distinct_base_address_for_range.
27944 (init_sections_and_labels): Initialize debug_ranges_dwo_section
27945 if -gsplit-dwarf and DWARF >= 5. Adjust ranges_section_label
27946 and range_base_label indexes.
27947 (dwarf2out_finish): Call index_rnglists earlier before finalizing
27948 .debug_addr. Never emit DW_AT_rnglists_base attribute. For
27949 -gsplit-dwarf and DWARF >= 5 call output_rnglists up to twice
27950 with different dwo arguments.
27951 (dwarf2out_c_finalize): Clear debug_ranges_dwo_section.
27953 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
27955 PR tree-optimization/98268
27956 * gimple-fold.c (maybe_canonicalize_mem_ref_addr): Call
27957 recompute_tree_invariant_for_addr_expr after successfully
27958 folding a TARGET_MEM_REF that occurs inside an ADDR_EXPR.
27960 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
27962 PR tree-optimization/99726
27963 * tree-data-ref.c (create_intersect_range_checks_index): Bail
27964 out if there is more than one access function SCEV for the loop
27967 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
27969 PR rtl-optimization/97141
27970 PR rtl-optimization/98726
27971 * emit-rtl.c (valid_for_const_vector_p): Return true for
27973 * rtx-vector-builder.h (rtx_vector_builder::step): Return a
27974 poly_wide_int instead of a wide_int.
27975 (rtx_vector_builder::apply_set): Take a poly_wide_int instead
27977 * rtx-vector-builder.c (rtx_vector_builder::apply_set): Likewise.
27978 * config/aarch64/aarch64.c (aarch64_legitimate_constant_p): Return
27979 false for CONST_VECTORs that cannot be forced to memory.
27980 * config/aarch64/aarch64-simd.md (mov<mode>): If a CONST_VECTOR
27981 is too complex to force to memory, build it up from individual
27984 2021-03-31 Jan Hubicka <jh@suse.cz>
27987 * cgraph.c (cgraph_node::release_body): Fix overactive check.
27989 2021-03-31 Christophe Lyon <christophe.lyon@linaro.org>
27992 * config/arm/vec-common.md (mul<mode>3): Disable on iwMMXT, expect
27995 2021-03-31 H.J. Lu <hjl.tools@gmail.com>
27997 * config/i386/i386-expand.c (expand_set_or_cpymem_via_rep):
27998 For TARGET_PREFER_KNOWN_REP_MOVSB_STOSB, don't convert QImode
28000 (decide_alg): For TARGET_PREFER_KNOWN_REP_MOVSB_STOSB, use
28001 "rep movsb/stosb" only for known sizes.
28002 * config/i386/i386-options.c (processor_cost_table): Use Ice
28003 Lake cost for Cannon Lake, Ice Lake, Tiger Lake, Sapphire
28004 Rapids and Alder Lake.
28005 * config/i386/i386.h (TARGET_PREFER_KNOWN_REP_MOVSB_STOSB): New.
28006 * config/i386/x86-tune-costs.h (icelake_memcpy): New.
28007 (icelake_memset): Likewise.
28008 (icelake_cost): Likewise.
28009 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
28012 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
28015 * config/aarch64/aarch64.c
28016 (aarch64_vectorize_preferred_vector_alignment): Query the size
28017 of the provided SVE vector; do not assume that all SVE vectors
28018 have the same size.
28020 2021-03-31 Jan Hubicka <jh@suse.cz>
28023 * cgraph.c (cgraph_node::release_body): Remove all callers and
28025 * cgraphclones.c (cgraph_node::materialize_clone): Do not do it here.
28026 * cgraphunit.c (cgraph_node::expand): And here.
28028 2021-03-31 Martin Liska <mliska@suse.cz>
28030 * ipa-modref.c (analyze_ssa_name_flags): Fix coding style
28031 and one negated condition.
28033 2021-03-31 Jakub Jelinek <jakub@redhat.com>
28034 Richard Sandiford <richard.sandiford@arm.com>
28037 * config/aarch64/aarch64.md (*add<mode>3_poly_1): Swap Uai and Uav
28038 constraints on operands[2] and similarly 0 and rk constraints
28039 on operands[1] corresponding to that.
28041 2021-03-31 Jakub Jelinek <jakub@redhat.com>
28044 * configure.ac (HAVE_LD_BROKEN_PE_DWARF5): New AC_DEFINE if PECOFF
28045 linker doesn't support DWARF sections new in DWARF5.
28046 * config/i386/i386-options.c (ix86_option_override_internal): Default
28047 to dwarf_version 4 if HAVE_LD_BROKEN_PE_DWARF5 for TARGET_PECOFF
28049 * config.in: Regenerated.
28050 * configure: Regenerated.
28052 2021-03-30 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28055 * config/aarch64/aarch64.c (aarch64_analyze_loop_vinfo): Check for
28056 available issue_info before using it.
28058 2021-03-30 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28061 * config/aarch64/aarch64.md (sub<mode>3_compare1_imm): Do not allow zero
28064 2021-03-30 Xionghu Luo <luoxhu@linux.ibm.com>
28067 * config/rs6000/altivec.md (altivec_lvsl_reg): Change to ...
28068 (altivec_lvsl_reg_<mode>): ... this.
28069 (altivec_lvsr_reg): Change to ...
28070 (altivec_lvsr_reg_<mode>): ... this.
28071 * config/rs6000/predicates.md (vec_set_index_operand): New.
28072 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
28073 Enable 32bit variable vec_insert for all TARGET_VSX.
28074 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var_p9):
28075 Enable 32bit variable vec_insert for p9 and above.
28076 (rs6000_expand_vector_set_var_p8): Rename to ...
28077 (rs6000_expand_vector_set_var_p7): ... this.
28078 (rs6000_expand_vector_set): Use TARGET_VSX and adjust assert
28080 * config/rs6000/vector.md (vec_set<mode>): Use vec_set_index_operand.
28081 * config/rs6000/vsx.md (xl_len_r): Use gen_altivec_lvsl_reg_di and
28082 gen_altivec_lvsr_reg_di.
28084 2021-03-30 H.J. Lu <hjl.tools@gmail.com>
28087 * config/i386/ia32intrin.h (__rdtsc): Defined as macro.
28088 (__rdtscp): Likewise.
28090 2021-03-30 Tamar Christina <tamar.christina@arm.com>
28092 PR tree-optimization/99825
28093 * tree-vect-slp-patterns.c (vect_check_evenodd_blend):
28094 Reject non-mult 2 lanes.
28096 2021-03-30 Richard Earnshaw <rearnsha@arm.com>
28099 * config/arm/arm.c (arm_file_start): Fix emission of
28100 Tag_ABI_VFP_args attribute.
28102 2021-03-30 Richard Biener <rguenther@suse.de>
28104 PR tree-optimization/99824
28105 * stor-layout.c (set_min_and_max_values_for_integral_type):
28106 Assert the precision is within the bounds of
28107 WIDE_INT_MAX_PRECISION.
28108 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Use
28109 the outermost component ref only to lower the access size
28110 and initialize that from the access type.
28112 2021-03-30 Richard Sandiford <richard.sandiford@arm.com>
28115 * config/aarch64/aarch64.md (mov<mode>): Pass multi-instruction
28116 CONST_INTs to aarch64_expand_mov_immediate when called after RA.
28118 2021-03-30 Mihailo Stojanovic <mihailo.stojanovic@typhoon-hil.com>
28120 * config/aarch64/aarch64.md
28121 (<optab>_trunc<fcvt_target><GPI:mode>2): Set the "arch"
28122 attribute to disambiguate between SIMD and FP variants of the
28125 2021-03-29 Jan Hubicka <hubicka@ucw.cz>
28127 * ipa-modref.c (merge_call_lhs_flags): Correct handling of deref.
28128 (analyze_ssa_name_flags): Fix typo in comment.
28130 2021-03-29 Alex Coplan <alex.coplan@arm.com>
28133 * config/aarch64/aarch64-sve-builtins.cc
28134 (function_builder::add_function): Add placeholder_p argument, use
28135 placeholder decls if this is set.
28136 (function_builder::add_unique_function): Instead of conditionally adding
28137 direct overloads, unconditionally add either a direct overload or a
28139 (function_builder::add_overloaded_function): Set placeholder_p if we're
28140 using C++ overloads. Use the obstack for string storage instead
28141 of relying on the tree nodes.
28142 (function_builder::add_overloaded_functions): Don't return early for
28143 m_direct_overloads: we need to add placeholders.
28144 * config/aarch64/aarch64-sve-builtins.h
28145 (function_builder::add_function): Add placeholder_p argument.
28147 2021-03-29 Richard Biener <rguenther@suse.de>
28149 PR tree-optimization/99807
28150 * tree-vect-slp.c (vect_slp_analyze_node_operations_1): Move
28151 assert below VEC_PERM handling.
28153 2021-03-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28156 * config/aarch64/aarch64-simd.md (move_lo_quad_internal_<mode>): Use
28157 aarch64_simd_or_scalar_imm_zero to match zeroes. Remove pattern
28158 matching const_int 0.
28159 (move_lo_quad_internal_be_<mode>): Likewise.
28160 (move_lo_quad_<mode>): Update for the above.
28161 * config/aarch64/iterators.md (VQ_2E): Delete.
28163 2021-03-29 Jakub Jelinek <jakub@redhat.com>
28165 PR tree-optimization/99777
28166 * fold-const.c (extract_muldiv_1): For conversions, punt on casts from
28167 types other than scalar integral types.
28169 2021-03-28 David Edelsohn <dje.gcc@gmail.com>
28171 * config/rs6000/rs6000.c (rs6000_output_dwarf_dtprel): Do not add
28172 XCOFF TLS reloc decorations.
28174 2021-03-28 Gerald Pfeifer <gerald@pfeifer.com>
28176 * doc/analyzer.texi (Analyzer Internals): Update link to
28177 "A Memory Model for Static Analysis of C Programs".
28179 2021-03-26 David Edelsohn <dje.gcc@gmail.com>
28181 * config/rs6000/aix.h (ADJUST_FIELD_ALIGN): Call function.
28182 * config/rs6000/rs6000-protos.h (rs6000_special_adjust_field_align):
28184 * config/rs6000/rs6000.c (rs6000_special_adjust_field_align): New.
28185 (rs6000_special_round_type_align): Recursively check innermost first
28188 2021-03-26 Jakub Jelinek <jakub@redhat.com>
28191 * dwarf2out.h (struct dw_fde_node): Add rule18 member.
28192 * dwarf2cfi.c (dwarf2out_frame_debug_expr): When handling (set hfp sp)
28193 assignment with drap_reg active, queue reg save for hfp with offset 0
28194 and flush queued reg saves. When handling a push with rule18,
28195 defer queueing reg save for hfp and just assert the offset is 0.
28196 (scan_trace): Assert that fde->rule18 is false.
28198 2021-03-26 Vladimir Makarov <vmakarov@redhat.com>
28201 * ira-costs.c (record_reg_classes): Put case with
28202 CT_RELAXED_MEMORY adjacent to one with CT_MEMORY.
28203 * ira.c (ira_setup_alts): Ditto.
28204 * lra-constraints.c (process_alt_operands): Ditto.
28205 * recog.c (asm_operand_ok): Ditto.
28206 * reload.c (find_reloads): Ditto.
28208 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28210 * config/aarch64/aarch64-protos.h
28211 (cpu_addrcost_table::post_modify_ld3_st3): New member variable.
28212 (cpu_addrcost_table::post_modify_ld4_st4): Likewise.
28213 * config/aarch64/aarch64.c (generic_addrcost_table): Update
28214 accordingly, using the same costs as for post_modify.
28215 (exynosm1_addrcost_table, xgene1_addrcost_table): Likewise.
28216 (thunderx2t99_addrcost_table, thunderx3t110_addrcost_table):
28217 (tsv110_addrcost_table, qdf24xx_addrcost_table): Likewise.
28218 (a64fx_addrcost_table): Likewise.
28219 (neoversev1_addrcost_table): New.
28220 (neoversev1_tunings): Use neoversev1_addrcost_table.
28221 (aarch64_address_cost): Use the new post_modify costs for CImode
28224 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28226 * config/aarch64/aarch64.opt
28227 (-param=aarch64-loop-vect-issue-rate-niters=): New parameter.
28228 * doc/invoke.texi: Document it.
28229 * config/aarch64/aarch64-protos.h (aarch64_base_vec_issue_info)
28230 (aarch64_scalar_vec_issue_info, aarch64_simd_vec_issue_info)
28231 (aarch64_advsimd_vec_issue_info, aarch64_sve_vec_issue_info)
28232 (aarch64_vec_issue_info): New structures.
28233 (cpu_vector_cost): Write comments above the variables rather
28235 (cpu_vector_cost::issue_info): New member variable.
28236 * config/aarch64/aarch64.c: Include gimple-pretty-print.h
28237 and tree-ssa-loop-niter.h.
28238 (generic_vector_cost, a64fx_vector_cost, qdf24xx_vector_cost)
28239 (thunderx_vector_cost, tsv110_vector_cost, cortexa57_vector_cost)
28240 (exynosm1_vector_cost, xgene1_vector_cost, thunderx2t99_vector_cost)
28241 (thunderx3t110_vector_cost): Initialize issue_info to null.
28242 (neoversev1_scalar_issue_info, neoversev1_advsimd_issue_info)
28243 (neoversev1_sve_issue_info, neoversev1_vec_issue_info): New structures.
28244 (neoversev1_vector_cost): Use them.
28245 (aarch64_vec_op_count, aarch64_sve_op_count): New structures.
28246 (aarch64_vector_costs::saw_sve_only_op): New member variable.
28247 (aarch64_vector_costs::num_vector_iterations): Likewise.
28248 (aarch64_vector_costs::scalar_ops): Likewise.
28249 (aarch64_vector_costs::advsimd_ops): Likewise.
28250 (aarch64_vector_costs::sve_ops): Likewise.
28251 (aarch64_vector_costs::seen_loads): Likewise.
28252 (aarch64_simd_vec_costs_for_flags): New function.
28253 (aarch64_analyze_loop_vinfo): Initialize num_vector_iterations.
28254 Count the number of predicate operations required by SVE WHILE
28256 (aarch64_comparison_type, aarch64_multiply_add_p): New functions.
28257 (aarch64_sve_only_stmt_p, aarch64_in_loop_reduction_latency): Likewise.
28258 (aarch64_count_ops): Likewise.
28259 (aarch64_add_stmt_cost): Record whether see an SVE operation
28260 that cannot currently be implementing using Advanced SIMD.
28261 Record issue information about the scalar, Advanced SIMD
28262 and (where relevant) SVE versions of a loop.
28263 (aarch64_vec_op_count::dump): New function.
28264 (aarch64_sve_op_count::dump): Likewise.
28265 (aarch64_estimate_min_cycles_per_iter): Likewise.
28266 (aarch64_adjust_body_cost): If issue information is available,
28267 try to compare the issue rates of the various loop implementations
28268 and increase or decrease the vector body cost accordingly.
28270 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28272 * config/aarch64/aarch64.c (aarch64_detect_vector_stmt_subtype):
28273 Assume a zero cost for induction phis.
28275 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28277 * config/aarch64/aarch64.c (aarch64_embedded_comparison_type): New
28279 (aarch64_adjust_stmt_cost): Add the costs of embedded scalar and
28280 vector comparisons.
28282 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28284 * config/aarch64/aarch64.c (aarch64_detect_scalar_stmt_subtype):
28286 (aarch64_add_stmt_cost): Call it.
28288 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28290 * config/aarch64/aarch64-tuning-flags.def (matched_vector_throughput):
28291 New tuning parameter.
28292 * config/aarch64/aarch64.c (neoversev1_tunings): Use it.
28293 (aarch64_estimated_sve_vq): New function.
28294 (aarch64_vector_costs::analyzed_vinfo): New member variable.
28295 (aarch64_vector_costs::is_loop): Likewise.
28296 (aarch64_vector_costs::unrolled_advsimd_niters): Likewise.
28297 (aarch64_vector_costs::unrolled_advsimd_stmts): Likewise.
28298 (aarch64_record_potential_advsimd_unrolling): New function.
28299 (aarch64_analyze_loop_vinfo, aarch64_analyze_bb_vinfo): Likewise.
28300 (aarch64_add_stmt_cost): Call aarch64_analyze_loop_vinfo or
28301 aarch64_analyze_bb_vinfo on the first use of a costs structure.
28302 Detect whether we're vectorizing a loop for SVE that might be
28303 completely unrolled if it used Advanced SIMD instead.
28304 (aarch64_adjust_body_cost_for_latency): New function.
28305 (aarch64_finish_cost): Call it.
28307 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28309 * config/aarch64/aarch64.c (aarch64_vector_costs): New structure.
28310 (aarch64_init_cost): New function.
28311 (aarch64_add_stmt_cost): Use aarch64_vector_costs instead of
28312 the default unsigned[3].
28313 (aarch64_finish_cost, aarch64_destroy_cost_data): New functions.
28314 (TARGET_VECTORIZE_INIT_COST): Override.
28315 (TARGET_VECTORIZE_FINISH_COST): Likewise.
28316 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
28318 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28320 * config/aarch64/aarch64.c (neoversev1_advsimd_vector_cost)
28321 (neoversev1_sve_vector_cost): New cost structures.
28322 (neoversev1_vector_cost): Likewise.
28323 (neoversev1_tunings): Use them. Enable use_new_vector_costs.
28325 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28327 * config/aarch64/aarch64-protos.h
28328 (sve_vec_cost::scatter_store_elt_cost): New member variable.
28329 * config/aarch64/aarch64.c (generic_sve_vector_cost): Update
28330 accordingly, taking the cost from the cost of a scalar_store.
28331 (a64fx_sve_vector_cost): Likewise.
28332 (aarch64_detect_vector_stmt_subtype): Detect scatter stores.
28334 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28336 * config/aarch64/aarch64-protos.h
28337 (simd_vec_cost::store_elt_extra_cost): New member variable.
28338 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
28339 accordingly, using the vec_to_scalar cost for the new field.
28340 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
28341 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
28342 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
28343 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
28344 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
28345 (thunderx3t110_advsimd_vector_cost): Likewise.
28346 (aarch64_detect_vector_stmt_subtype): Detect single-element stores.
28348 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28350 * config/aarch64/aarch64-protos.h (simd_vec_cost::ld2_st2_permute_cost)
28351 (simd_vec_cost::ld3_st3_permute_cost): New member variables.
28352 (simd_vec_cost::ld4_st4_permute_cost): Likewise.
28353 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
28354 accordingly, using zero for the new costs.
28355 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
28356 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
28357 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
28358 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
28359 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
28360 (thunderx3t110_advsimd_vector_cost): Likewise.
28361 (aarch64_ld234_st234_vectors): New function.
28362 (aarch64_adjust_stmt_cost): Likewise.
28363 (aarch64_add_stmt_cost): Call aarch64_adjust_stmt_cost if using
28364 the new vector costs.
28366 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28368 * config/aarch64/aarch64-protos.h (sve_vec_cost): Turn into a
28369 derived class of simd_vec_cost. Add information about CLAST[AB]
28370 and FADDA instructions.
28371 * config/aarch64/aarch64.c (generic_sve_vector_cost): Update
28372 accordingly, using the vec_to_scalar costs for the new fields.
28373 (a64fx_sve_vector_cost): Likewise.
28374 (aarch64_reduc_type): New function.
28375 (aarch64_sve_in_loop_reduction_latency): Likewise.
28376 (aarch64_detect_vector_stmt_subtype): Take a vinfo parameter.
28377 Use aarch64_sve_in_loop_reduction_latency to handle SVE reductions
28378 that occur in the loop body.
28379 (aarch64_add_stmt_cost): Update call accordingly.
28381 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28383 * config/aarch64/aarch64-tuning-flags.def (use_new_vector_costs):
28385 * config/aarch64/aarch64-protos.h (simd_vec_cost): Put comments
28386 above the fields rather than to the right.
28387 (simd_vec_cost::reduc_i8_cost): New member variable.
28388 (simd_vec_cost::reduc_i16_cost): Likewise.
28389 (simd_vec_cost::reduc_i32_cost): Likewise.
28390 (simd_vec_cost::reduc_i64_cost): Likewise.
28391 (simd_vec_cost::reduc_f16_cost): Likewise.
28392 (simd_vec_cost::reduc_f32_cost): Likewise.
28393 (simd_vec_cost::reduc_f64_cost): Likewise.
28394 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
28395 accordingly, using the vec_to_scalar_cost for the new fields.
28396 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
28397 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
28398 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
28399 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
28400 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
28401 (thunderx3t110_advsimd_vector_cost): Likewise.
28402 (aarch64_use_new_vector_costs_p): New function.
28403 (aarch64_simd_vec_costs): New function, split out from...
28404 (aarch64_builtin_vectorization_cost): ...here.
28405 (aarch64_is_reduction): New function.
28406 (aarch64_detect_vector_stmt_subtype): Likewise.
28407 (aarch64_add_stmt_cost): Call aarch64_detect_vector_stmt_subtype if
28408 using the new vector costs.
28410 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
28413 * tree-emutls.c (get_emutls_init_templ_addr): Mark initializer of weak
28414 TLS declarations as public.
28416 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
28418 * config/aarch64/aarch64-d.c (IN_TARGET_CODE): Define.
28419 * config/arm/arm-d.c (IN_TARGET_CODE): Likewise.
28420 * config/i386/i386-d.c (IN_TARGET_CODE): Likewise.
28421 * config/mips/mips-d.c (IN_TARGET_CODE): Likewise.
28422 * config/pa/pa-d.c (IN_TARGET_CODE): Likewise.
28423 * config/riscv/riscv-d.c (IN_TARGET_CODE): Likewise.
28424 * config/rs6000/rs6000-d.c (IN_TARGET_CODE): Likewise.
28425 * config/s390/s390-d.c (IN_TARGET_CODE): Likewise.
28426 * config/sparc/sparc-d.c (IN_TARGET_CODE): Likewise.
28428 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
28431 * config.gcc (*-*-cygwin*): Add winnt-d.o
28432 (*-*-mingw*): Likewise.
28433 * config/i386/cygwin.h (EXTRA_TARGET_D_OS_VERSIONS): New macro.
28434 * config/i386/mingw32.h (EXTRA_TARGET_D_OS_VERSIONS): Likewise.
28435 * config/i386/t-cygming: Add winnt-d.o.
28436 * config/i386/winnt-d.c: New file.
28438 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
28440 * config/freebsd-d.c: Include memmodel.h.
28442 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
28445 * config.gcc (*-*-openbsd*): Add openbsd-d.o.
28446 * config/t-openbsd: Add openbsd-d.o.
28447 * config/openbsd-d.c: New file.
28449 2021-03-25 Stam Markianos-Wright <stam.markianos-wright@arm.com>
28451 PR tree-optimization/96974
28452 * tree-vect-stmts.c (vect_get_vector_types_for_stmt): Replace assert
28453 with graceful exit.
28455 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
28458 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
28462 * config/i386/i386.c (ix86_can_inline_p): Don't check ISA for
28463 always_inline in system headers.
28465 2021-03-25 Kewen Lin <linkw@linux.ibm.com>
28467 * tree-vect-loop.c (vect_model_reduction_cost): Init inside_cost.
28469 2021-03-25 Jakub Jelinek <jakub@redhat.com>
28472 * tree-core.h (enum operand_equal_flag): Add OEP_ADDRESS_OF_SAME_FIELD.
28473 * fold-const.c (operand_compare::operand_equal_p): Don't compare
28474 field offsets if OEP_ADDRESS_OF_SAME_FIELD.
28476 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
28480 * config/i386/i386.c (ix86_can_inline_p): Don't check ISA for
28481 always_inline in system headers.
28483 2021-03-25 Richard Biener <rguenther@suse.de>
28485 PR tree-optimization/99746
28486 * tree-vect-slp-patterns.c (complex_pattern::build): Do not mark
28487 the scalar stmt as patterned. Instead set up required things
28490 2021-03-25 Xionghu Luo <luoxhu@linux.ibm.com>
28492 * config/rs6000/rs6000.c (power8_costs): Change l2 cache
28495 2021-03-24 Martin Liska <mliska@suse.cz>
28498 * common/config/i386/i386-common.c (ARRAY_SIZE): Fix off-by-one
28500 * config/i386/i386-options.c (ix86_option_override_internal):
28501 Add run-time assert.
28503 2021-03-24 Martin Jambor <mjambor@suse.cz>
28506 * ipa-cp.c (initialize_node_lattices): Mark as bottom all
28507 parameters with unknown type.
28508 (ipacp_value_safe_for_type): New function.
28509 (propagate_vals_across_arith_jfunc): Verify that the constant type
28510 can be used for a type of the formal parameter.
28511 (propagate_vals_across_ancestor): Likewise.
28512 (propagate_scalar_across_jump_function): Likewise. Pass the type
28513 also to propagate_vals_across_ancestor.
28515 2021-03-24 Christophe Lyon <christophe.lyon@linaro.org>
28518 * config/arm/mve.md (movmisalign<mode>_mve_store): Use Ux
28520 (movmisalign<mode>_mve_load): Likewise.
28522 2021-03-24 Jakub Jelinek <jakub@redhat.com>
28525 * config/arm/vec-common.md (one_cmpl<mode>2, neg<mode>2,
28526 movmisalign<mode>): Disable expanders for TARGET_REALLY_IWMMXT.
28528 2021-03-24 Alexandre Oliva <oliva@adacore.com>
28530 * doc/sourcebuild.texi (sysconf): New effective target.
28532 2021-03-24 Alexandre Oliva <oliva@adacore.com>
28534 * config/i386/predicates.md (reg_or_const_vec_operand): New.
28535 * config/i386/sse.md (ssse3_pshufbv8qi3): Add an expander for
28536 the now *-prefixed insn_and_split, turn the splitter const vec
28537 into an input for the insn, making it an ignored immediate for
28538 non-split cases, and loaded into the scratch register
28541 2021-03-23 Vladimir N. Makarov <vmakarov@redhat.com>
28544 * config/aarch64/constraints.md (Utq, UOb, UOh, UOw, UOd, UOty):
28545 Use define_relaxed_memory_constraint for them.
28547 2021-03-23 Iain Sandoe <iain@sandoe.co.uk>
28550 * config/host-darwin.c (darwin_gt_pch_use_address): Add a
28551 colon to the diagnostic message.
28553 2021-03-23 Ilya Leoshkevich <iii@linux.ibm.com>
28555 * fwprop.c (fwprop_propagation::fwprop_propagation): Look at
28557 (try_fwprop_subst_note): Use set_info instead of insn_info.
28558 (try_fwprop_subst_pattern): Likewise.
28559 (try_fwprop_subst_notes): Likewise.
28560 (try_fwprop_subst): Likewise.
28561 (forward_propagate_subreg): Likewise.
28562 (forward_propagate_and_simplify): Likewise.
28563 (forward_propagate_into): Likewise.
28564 * rtl-ssa/accesses.h (set_info::single_nondebug_use) New
28566 (set_info::single_nondebug_insn_use): Likewise.
28567 (set_info::single_phi_use): Likewise.
28568 * rtl-ssa/member-fns.inl (set_info::single_nondebug_use) New
28570 (set_info::single_nondebug_insn_use): Likewise.
28571 (set_info::single_phi_use): Likewise.
28573 2021-03-23 Christophe Lyon <christophe.lyon@linaro.org>
28575 * doc/sourcebuild.texi (arm_dsp_ok, arm_dsp): Document.
28577 2021-03-23 Jakub Jelinek <jakub@redhat.com>
28580 * config/aarch64/aarch64.c (aarch64_add_offset): Tell
28581 expand_mult to perform an unsigned rather than a signed
28584 2021-03-23 H.J. Lu <hjl.tools@gmail.com>
28587 * config/i386/cpuid.h (__cpuid): Add __volatile__.
28588 (__cpuid_count): Likewise.
28590 2021-03-23 Richard Biener <rguenther@suse.de>
28592 PR tree-optimization/99721
28593 * tree-vect-slp.c (vect_slp_analyze_node_operations):
28594 Make sure we can schedule the node.
28596 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
28598 * config/riscv/riscv.c (riscv_subword): Take endianness into
28599 account when calculating the byte offset.
28601 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
28603 * config/riscv/predicates.md (subreg_lowpart_operator): New predicate
28604 * config/riscv/riscv.md (*addsi3_extended2, *subsi3_extended2)
28605 (*negsi2_extended2, *mulsi3_extended2, *<optab>si3_mask)
28606 (*<optab>si3_mask_1, *<optab>di3_mask, *<optab>di3_mask_1)
28607 (*<optab>si3_extend_mask, *<optab>si3_extend_mask_1): Use
28608 new predicate "subreg_lowpart_operator"
28610 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
28612 * config/riscv/riscv.c (riscv_swap_instruction): New function
28613 to byteswap an SImode rtx containing an instruction.
28614 (riscv_trampoline_init): Byteswap the generated instructions
28617 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
28619 * common/config/riscv/riscv-common.c
28620 (TARGET_DEFAULT_TARGET_FLAGS): Set default endianness.
28621 * config.gcc (riscv32be-*, riscv64be-*): Set
28622 TARGET_BIG_ENDIAN_DEFAULT to 1.
28623 * config/riscv/elf.h (LINK_SPEC): Change -melf* value
28624 depending on default endianness.
28625 * config/riscv/freebsd.h (LINK_SPEC): Likewise.
28626 * config/riscv/linux.h (LINK_SPEC): Likewise.
28627 * config/riscv/riscv.c (TARGET_DEFAULT_TARGET_FLAGS): Set
28628 default endianness.
28629 * config/riscv/riscv.h (DEFAULT_ENDIAN_SPEC): New macro.
28631 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
28633 * config/riscv/elf.h (LINK_SPEC): Pass linker endianness flag.
28634 * config/riscv/freebsd.h (LINK_SPEC): Likewise.
28635 * config/riscv/linux.h (LINK_SPEC): Likewise.
28636 * config/riscv/riscv.h (ASM_SPEC): Pass -mbig-endian and
28638 (BYTES_BIG_ENDIAN): Handle big endian.
28639 (WORDS_BIG_ENDIAN): Define to BYTES_BIG_ENDIAN.
28640 * config/riscv/riscv.opt (-mbig-endian, -mlittle-endian): New
28642 * doc/invoke.texi (-mbig-endian, -mlittle-endian): Document.
28644 2021-03-23 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
28646 * regcprop.c (find_oldest_value_reg): Ask target whether
28647 different mode is fine for replacement register.
28649 2021-03-23 Aldy Hernandez <aldyh@redhat.com>
28651 PR tree-optimization/99296
28652 * value-range.cc (irange::irange_set_1bit_anti_range): New.
28653 (irange::irange_set_anti_range): Call irange_set_1bit_anti_range
28654 * value-range.h (irange::irange_set_1bit_anti_range): New.
28656 2021-03-22 Vladimir N. Makarov <vmakarov@redhat.com>
28659 * config/aarch64/constraints.md (UtQ): Use
28660 define_relaxed_memory_constraint for it.
28661 * doc/md.texi (define_relaxed_memory_constraint): Describe it.
28662 * genoutput.c (main): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
28663 * genpreds.c (constraint_data): Add bitfield is_relaxed_memory.
28664 (have_relaxed_memory_constraints): New static var.
28665 (relaxed_memory_start, relaxed_memory_end): Ditto.
28666 (add_constraint): Add arg is_relaxed_memory. Check name for
28667 relaxed memory. Set up is_relaxed_memory in constraint_data and
28668 have_relaxed_memory_constraints. Adjust calls.
28669 (choose_enum_order): Process relaxed memory.
28670 (write_tm_preds_h): Ditto.
28671 (main): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
28672 * gensupport.c (process_rtx): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
28673 * ira-costs.c (record_reg_classes): Process CT_RELAXED_MEMORY.
28674 * ira-lives.c (single_reg_class): Use
28675 insn_extra_relaxed_memory_constraint.
28676 * ira.c (ira_setup_alts): CT_RELAXED_MEMORY.
28677 * lra-constraints.c (valid_address_p): Use
28678 insn_extra_relaxed_memory_constraint instead of other memory
28680 (process_alt_operands): Process CT_RELAXED_MEMORY.
28681 (curr_insn_transform): Use insn_extra_relaxed_memory_constraint.
28682 * recog.c (asm_operand_ok, preprocess_constraints): Process
28684 * reload.c (find_reloads): Ditto.
28685 * rtl.def (DEFINE_RELAXED_MEMORY_CONSTRAINT): New.
28686 * stmt.c (parse_input_constraint): Use
28687 insn_extra_relaxed_memory_constraint.
28689 2021-03-22 Segher Boessenkool <segher@kernel.crashing.org>
28692 * ubsan.c (ubsan_instrument_float_cast): Don't test for unordered if
28695 2021-03-22 Alex Coplan <alex.coplan@arm.com>
28698 * config/arm/arm-protos.h (neon_make_constant): Add generate
28699 argument to guard emitting insns, default to true.
28700 * config/arm/arm.c (arm_legitimate_constant_p_1): Reject
28701 CONST_VECTORs which neon_make_constant can't handle.
28702 (neon_vdup_constant): Add generate argument, avoid emitting
28703 insns if it's not set.
28704 (neon_make_constant): Plumb new generate argument through.
28705 * config/arm/constraints.md (Ui): New. Use it...
28706 * config/arm/mve.md (*mve_mov<mode>): ... here.
28707 * config/arm/vec-common.md (movv8hf): Use neon_make_constant to
28708 synthesize constants.
28710 2021-03-22 Richard Biener <rguenther@suse.de>
28712 * debug.h: Add deprecation warning.
28714 2021-03-22 Richard Biener <rguenther@suse.de>
28716 PR tree-optimization/99694
28717 * tree-ssa-sccvn.c (visit_phi): Ignore edges with the
28720 2021-03-22 Kito Cheng <kito.cheng@sifive.com>
28723 * config/riscv/riscv.c (riscv_expand_block_move): Get RTL value
28724 after type checking.
28726 2021-03-22 Jakub Jelinek <jakub@redhat.com>
28730 * dwarf2out.c (get_full_len): Use get_precision rather than
28732 (add_const_value_attribute): Make sure add_AT_wide argument has
28733 precision prec rather than some very wide one.
28735 2021-03-22 Kewen Lin <linkw@linux.ibm.com>
28737 * config/rs6000/rs6000.md (*rotldi3_insert_sf,
28738 *mov<SFDF:mode><SFDF2:mode>cc_p9, floatsi<mode>2_lfiwax,
28739 floatsi<mode>2_lfiwax_mem, floatunssi<mode>2_lfiwzx,
28740 floatunssi<mode>2_lfiwzx_mem, *floatsidf2_internal,
28741 *floatunssidf2_internal, fix_trunc<mode>si2_stfiwx,
28742 fix_trunc<mode>si2_internal, fixuns_trunc<mode>si2_stfiwx,
28743 *round32<mode>2_fprs, *roundu32<mode>2_fprs,
28744 *fix_trunc<mode>si2_internal): Fix empty split condition.
28745 * config/rs6000/vsx.md (*vsx_le_undo_permute_<mode>,
28746 vsx_reduc_<VEC_reduc_name>_v2df, vsx_reduc_<VEC_reduc_name>_v4sf,
28747 *vsx_reduc_<VEC_reduc_name>_v2df_scalar,
28748 *vsx_reduc_<VEC_reduc_name>_v4sf_scalar): Likewise.
28750 2021-03-22 Xionghu Luo <luoxhu@linux.ibm.com>
28753 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var_p9):
28754 Convert idx to DImode.
28755 (rs6000_expand_vector_set_var_p8): Likewise.
28757 2021-03-21 Jakub Jelinek <jakub@redhat.com>
28760 * dwarf2out.c (insert_float): Change return type from void to
28761 unsigned, handle GET_MODE_SIZE (mode) == 2 and return element size.
28762 (mem_loc_descriptor, loc_descriptor, add_const_value_attribute):
28765 2021-03-20 H.J. Lu <hjl.tools@gmail.com>
28768 * config/i386/i386.c (construct_container): Check cfun != NULL
28769 before accessing silent_p.
28771 2021-03-20 Ahamed Husni <ahamedhusni73@gmail.com>
28773 * asan.c: Fix typos in comments.
28775 2021-03-20 Vladimir N. Makarov <vmakarov@redhat.com>
28777 PR rtl-optimization/99680
28778 * lra-constraints.c (skip_contraint_modifiers): Rename to skip_constraint_modifiers.
28779 (process_address_1): Check empty constraint before using
28782 2021-03-19 Pat Haugen <pthaugen@linux.ibm.com>
28784 * config/rs6000/rs6000.c (power10_cost): New.
28785 (rs6000_option_override_internal): Set Power10 costs.
28786 (rs6000_issue_rate): Set Power10 issue rate.
28787 * config/rs6000/power10.md: Rewrite for Power10.
28789 2021-03-19 Vladimir N. Makarov <vmakarov@redhat.com>
28792 * lra-constraints.c (process_address_1): Don't use unknown
28793 constraint for address constraint.
28795 2021-03-19 Iain Sandoe <iain@sandoe.co.uk>
28798 * config.gcc (powerpc-*-darwin8): Delete the reference to
28799 the now removed darwin8.h.
28801 2021-03-19 Olivier Hainque <hainque@adacore.com>
28804 * config/vxworksae.h (VX_CPU_PREFIX): Define.
28806 2021-03-19 John David Anglin <danglin@gcc.gnu.org>
28808 * config/pa/pa.c (import_milli): Use memcpy instead of strncpy.
28810 2021-03-19 Tamar Christina <tamar.christina@arm.com>
28812 PR tree-optimization/99656
28813 * tree-vect-slp-patterns.c (linear_loads_p,
28814 complex_add_pattern::matches, is_eq_or_top,
28815 vect_validate_multiplication, complex_mul_pattern::matches,
28816 complex_fms_pattern::matches): Remove complex_perm_kinds_t.
28817 * tree-vectorizer.h: (complex_load_perm_t): Removed.
28818 (slp_tree_to_load_perm_map_t): Use complex_perm_kinds_t instead of
28819 complex_load_perm_t.
28821 2021-03-19 H.J. Lu <hjl.tools@gmail.com>
28824 * config/i386/i386-options.c (ix86_init_machine_status): Set
28826 * config/i386/i386.c (init_cumulative_args): Set silent_p to
28828 (construct_container): Return early for return and argument
28829 errors if silent_p is true.
28830 * config/i386/i386.h (machine_function): Add silent_p.
28832 2021-03-19 Jakub Jelinek <jakub@redhat.com>
28835 * config/arm/constraints.md (Ds): New constraint.
28836 * config/arm/vec-common.md (mve_vshlq_<supf><mode>): Use w,Ds
28837 constraint instead of w,Dm.
28839 2021-03-19 Andrew Stubbs <ams@codesourcery.com>
28841 * config/gcn/gcn.c (gcn_parse_amdgpu_hsa_kernel_attribute): Fix quotes
28844 2021-03-19 Eric Botcazou <ebotcazou@adacore.com>
28846 PR middle-end/99641
28847 * fold-const.c (native_encode_initializer) <CONSTRUCTOR>: For an
28848 array type, do the computation of the current position in sizetype.
28850 2021-03-18 Vladimir N. Makarov <vmakarov@redhat.com>
28853 * lra-constraints.c (process_address_1): Use lookup_constraint
28854 only for a single constraint.
28856 2021-03-18 Martin Sebor <msebor@redhat.com>
28858 PR middle-end/99502
28859 * gimple-array-bounds.cc (inbounds_vbase_memaccess_p): Rename...
28860 (inbounds_memaccess_p): ...to this. Check the ending offset of
28861 the accessed member.
28863 2021-03-18 Andrew Stubbs <ams@codesourcery.com>
28865 * config/gcn/gcn.c (gcn_parse_amdgpu_hsa_kernel_attribute): Add %< and
28866 %> quote markers to error messages.
28867 (gcn_goacc_validate_dims): Likewise.
28868 (gcn_conditional_register_usage): Remove exclaimation mark from error
28870 (gcn_vectorize_vec_perm_const): Ensure perm is fully uninitialized.
28872 2021-03-18 Jan Hubicka <hubicka@ucw.cz>
28874 * config/i386/x86-tune-costs.h (struct processor_costs): Fix costs of
28877 2021-03-18 Sinan Lin <sinan@isrc.iscas.ac.cn>
28878 Kito Cheng <kito.cheng@sifive.com>
28880 * config/riscv/riscv.c (riscv_block_move_straight): Change type
28881 to unsigned HOST_WIDE_INT for parameter and local variable with
28882 HOST_WIDE_INT type.
28883 (riscv_adjust_block_mem): Ditto.
28884 (riscv_block_move_loop): Ditto.
28885 (riscv_expand_block_move): Ditto.
28887 2021-03-18 Nick Clifton <nickc@redhat.com>
28889 * config/v850/v850.c (construct_restore_jr): Increase static
28891 (construct_save_jarl): Likewise.
28892 * config/v850/v850.h (DWARF2_DEBUGGING_INFO): Define.
28894 2021-03-18 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28896 * config/aarch64/aarch64.c (aarch64_adjust_generic_arch_tuning): Define.
28897 (aarch64_override_options_internal): Use it.
28898 (generic_tunings): Add AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS to
28901 2021-03-17 Sandra Loosemore <sandra@codesourcery.com>
28903 * config/nios2/nios2.c (nios2_custom_check_insns): Clean up
28904 error message format issues.
28905 (nios2_option_override): Likewise.
28906 (nios2_expand_fpu_builtin): Likewise.
28907 (nios2_init_custom_builtins): Adjust to avoid bogus strncpy
28908 truncation warning.
28909 (nios2_expand_custom_builtin): More error message format fixes.
28910 (nios2_expand_rdwrctl_builtin): Likewise.
28911 (nios2_expand_rdprs_builtin): Likewise.
28912 (nios2_expand_eni_builtin): Likewise.
28913 (nios2_expand_builtin): Likewise.
28914 (nios2_register_custom_code): Likewise.
28915 (nios2_valid_target_attribute_rec): Likewise.
28916 (nios2_add_insn_asm): Fix uninitialized variable warning.
28918 2021-03-17 Jan Hubicka <jh@suse.cz>
28920 * config/i386/x86-tune-costs.h (struct processor_costs): Update costs
28921 of gather to match reality.
28922 * config/i386/x86-tune.def (X86_TUNE_USE_GATHER): Enable for znver3.
28924 2021-03-17 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28926 * config/aarch64/aarch64-builtins.c (aarch64_expand_rng_builtin): Use EQ
28927 to compare against CC_REG rather than NE.
28929 2021-03-17 H.J. Lu <hjl.tools@gmail.com>
28932 * config/i386/i386.c (ix86_force_load_from_GOT_p): Support
28933 inline assembly statements.
28934 (ix86_print_operand): Update 'P' handling for -fno-plt.
28936 2021-03-17 Tamar Christina <tamar.christina@arm.com>
28939 * config/aarch64/aarch64.c
28940 (aarch64_simd_clone_compute_vecsize_and_simdlen): Remove unused var.
28942 2021-03-16 Segher Boessenkool <segher@kernel.crashing.org>
28945 * config/rs6000/predicates.md (branch_comparison_operator): Allow
28946 ordered and unordered for CCFPmode, if flag_finite_math_only.
28948 2021-03-16 Jakub Jelinek <jakub@redhat.com>
28951 * config/i386/i386-expand.c (ix86_split_lea_for_addr): Emit a MULT
28952 rather than ASHIFT.
28953 * config/i386/i386.md (mult by 1248 into ashift): New splitter.
28955 2021-03-16 Martin Liska <mliska@suse.cz>
28958 * optc-save-gen.awk: Add flag_ipa_ra to exceptions for
28959 cl_optimization_compare function.
28961 2021-03-16 Ilya Leoshkevich <iii@linux.ibm.com>
28963 * config/s390/s390.c (f_constraint_p): Treat "fv" constraints
28966 2021-03-16 Jakub Jelinek <jakub@redhat.com>
28969 * config/i386/i386.h (struct machine_function): Add
28970 has_explicit_vzeroupper bitfield.
28971 * config/i386/i386-expand.c (ix86_expand_builtin): Set
28972 cfun->machine->has_explicit_vzeroupper when expanding
28973 IX86_BUILTIN_VZEROUPPER.
28974 * config/i386/i386-features.c (rest_of_handle_insert_vzeroupper):
28975 Do the mode switching only when TARGET_VZEROUPPER, expensive
28976 optimizations turned on and not optimizing for size.
28977 (pass_insert_vzeroupper::gate): Enable even when
28978 cfun->machine->has_explicit_vzeroupper is set.
28980 2021-03-16 Jakub Jelinek <jakub@redhat.com>
28983 * config/aarch64/aarch64.c
28984 (aarch64_simd_clone_compute_vecsize_and_simdlen): If not a function
28985 definition, walk TYPE_ARG_TYPES list if non-NULL for argument types
28986 instead of DECL_ARGUMENTS. Ignore types for uniform arguments.
28988 2021-03-15 Richard Biener <rguenther@suse.de>
28990 PR tree-optimization/98834
28991 * tree-ssa-sccvn.c (vn_reference_lookup_3): Handle missing
28992 subsetting by truncating the access size.
28994 2021-03-15 Jan Hubicka <hubicka@ucw.cz>
28996 * config/i386/i386-options.c (processor_cost_table): Add znver3_cost.
28997 * config/i386/x86-tune-costs.h (znver3_cost): New gobal variable; copy
29000 2021-03-15 Martin Liska <mliska@suse.cz>
29002 * spellcheck.c: Add missing comma in initialization.
29004 2021-03-14 Uroš Bizjak <ubizjak@gmail.com>
29006 * config/i386/sse.md (*vec_extract<mode>): Merge alternative 0 with
29007 alternative 2 and alternative 1 with alternative 3 using
29008 YW register constraint.
29009 (*vec_extract<PEXTR_MODE12:mode>_zext): Merge alternatives
29010 using YW register constraint.
29011 (*vec_extractv16qi_zext): Ditto.
29012 (*vec_extractv4si): Merge alternatives 4 and 5
29013 using Yw register constraint.
29014 (*ssse3_palignr<mode>_perm): Use Yw instead of v for alternative 3.
29016 2021-03-13 Martin Sebor <msebor@redhat.com>
29018 PR tree-optimization/99489
29019 * builtins.c (gimple_call_alloc_size): Fail gracefully when argument
29020 is not a call statement.
29022 2021-03-13 Jakub Jelinek <jakub@redhat.com>
29024 PR tree-optimization/99544
29025 * match.pd (X + (X << C) -> X * (1 + (1 << C))): Don't simplify
29026 if for vector types multiplication can't be done in type's mode.
29028 2021-03-12 Eric Botcazou <ebotcazou@adacore.com>
29031 * config/sparc/constraints.md (w): Rename to...
29032 (W): ... this and ditch previous implementation.
29033 * config/sparc/sparc.md (*movdi_insn_sp64): Replace W with m.
29034 (*movdf_insn_sp64): Likewise.
29035 (*mov<VM64:mode>_insn_sp64): Likewise.
29036 * config/sparc/sync.md (*atomic_compare_and_swap<mode>_1): Replace
29038 (atomic_compare_and_swap_leon3_1): Likewise.
29039 (*atomic_compare_and_swapdi_v8plus): Likewise.
29040 * config/sparc/sparc.c (memory_ok_for_ldd): Remove useless test on
29041 architecture and add missing address validity check during LRA.
29043 2021-03-12 Tobias Burnus <tobias@codesourcery.com>
29046 * gimplify.c (omp_add_variable): Handle NULL_TREE as size
29047 occuring for assumed-size arrays in use_device_{ptr,addr}.
29049 2021-03-12 Jakub Jelinek <jakub@redhat.com>
29052 * config/i386/constraints.md (YW): New internal constraint.
29053 * config/i386/sse.md (v_Yw): Add V4TI, V2TI, V1TI and TI cases.
29054 (*<sse2_avx2>_<insn><mode>3<mask_name>,
29055 *<sse2_avx2>_uavg<mode>3<mask_name>, *abs<mode>2,
29056 *<s>mul<mode>3_highpart<mask_name>): Use <v_Yw> instead of v in
29058 (<sse2_avx2>_psadbw): Use YW instead of v in constraints.
29059 (*avx2_pmaddwd, *sse2_pmaddwd, *<code>v8hi3, *<code>v16qi3,
29060 avx2_pmaddubsw256, ssse3_pmaddubsw128): Merge last two alternatives
29061 into one, use Yw instead of former x,v.
29062 (ashr<mode>3, <insn><mode>3): Use <v_Yw> instead of x in constraints of
29063 the last alternative.
29064 (<sse2_avx2>_packsswb<mask_name>, <sse2_avx2>_packssdw<mask_name>,
29065 <sse2_avx2>_packuswb<mask_name>, <sse4_1_avx2>_packusdw<mask_name>,
29066 *<ssse3_avx2>_pmulhrsw<mode>3<mask_name>, <ssse3_avx2>_palignr<mode>,
29067 <ssse3_avx2>_pshufb<mode>3<mask_name>): Merge last two alternatives
29068 into one, use <v_Yw> instead of former x,v.
29069 (avx2_interleave_highv32qi<mask_name>,
29070 vec_interleave_highv16qi<mask_name>): Use Yw instead of v in
29071 constraints. Add && <mask_avx512bw_condition> to condition.
29072 (avx2_interleave_lowv32qi<mask_name>,
29073 vec_interleave_lowv16qi<mask_name>,
29074 avx2_interleave_highv16hi<mask_name>,
29075 vec_interleave_highv8hi<mask_name>,
29076 avx2_interleave_lowv16hi<mask_name>, vec_interleave_lowv8hi<mask_name>,
29077 avx2_pshuflw_1<mask_name>, sse2_pshuflw_1<mask_name>,
29078 avx2_pshufhw_1<mask_name>, sse2_pshufhw_1<mask_name>,
29079 avx2_<code>v16qiv16hi2<mask_name>, sse4_1_<code>v8qiv8hi2<mask_name>,
29080 *sse4_1_<code>v8qiv8hi2<mask_name>_1, <sse2_avx2>_<insn><mode>3): Use
29081 Yw instead of v in constraints.
29082 * config/i386/mmx.md (Yv_Yw): New define_mode_attr.
29083 (*mmx_<insn><mode>3, mmx_ashr<mode>3, mmx_<insn><mode>3): Use <Yv_Yw>
29084 instead of Yv in constraints.
29085 (*mmx_<insn><mode>3, *mmx_mulv4hi3, *mmx_smulv4hi3_highpart,
29086 *mmx_umulv4hi3_highpart, *mmx_pmaddwd, *mmx_<code>v4hi3,
29087 *mmx_<code>v8qi3, mmx_pack<s_trunsuffix>swb, mmx_packssdw,
29088 mmx_punpckhbw, mmx_punpcklbw, mmx_punpckhwd, mmx_punpcklwd,
29089 *mmx_uavgv8qi3, *mmx_uavgv4hi3, mmx_psadbw): Use Yw instead of Yv in
29091 (*mmx_pinsrw, *mmx_pinsrb, *mmx_pextrw, *mmx_pextrw_zext, *mmx_pextrb,
29092 *mmx_pextrb_zext): Use YW instead of Yv in constraints.
29093 (*mmx_eq<mode>3, mmx_gt<mode>3): Use x instead of Yv in constraints.
29094 (mmx_andnot<mode>3, *mmx_<code><mode>3): Split last alternative into
29095 two, one with just x, another isa avx512vl with v.
29097 2021-03-12 Martin Liska <mliska@suse.cz>
29099 * doc/invoke.texi: Add missing param documentation.
29101 2021-03-11 David Malcolm <dmalcolm@redhat.com>
29104 * Makefile.in (ANALYZER_OBJS): Add analyzer/feasible-graph.o and
29105 analyzer/trimmed-graph.o.
29106 * doc/analyzer.texi (Analyzer Paths): Rewrite description of
29107 feasibility checking to reflect new implementation.
29108 * doc/invoke.texi (-fdump-analyzer-feasibility): Document new
29110 * shortest-paths.h (shortest_paths::get_shortest_distance): New.
29112 2021-03-11 David Malcolm <dmalcolm@redhat.com>
29114 * digraph.cc (selftest::test_shortest_paths): Update
29115 shortest_paths init for new param. Add test of
29116 SPS_TO_GIVEN_TARGET.
29117 * shortest-paths.h (enum shortest_path_sense): New.
29118 (shortest_paths::shortest_paths): Add "sense" param.
29119 Update for renamings. Generalize to use "sense" param.
29120 (shortest_paths::get_shortest_path): Rename param.
29121 (shortest_paths::m_sense): New field.
29122 (shortest_paths::m_prev): Rename...
29123 (shortest_paths::m_best_edge): ...to this.
29124 (shortest_paths::get_shortest_path): Update for renamings.
29125 Conditionalize flipping of path on sense of traversal.
29127 2021-03-11 David Malcolm <dmalcolm@redhat.com>
29129 * digraph.cc (selftest::test_shortest_paths): Add test coverage
29130 for paths from B and C.
29131 * shortest-paths.h (shortest_paths::shortest_paths): Handle
29132 unreachable nodes, rather than asserting.
29134 2021-03-11 David Edelsohn <dje.gcc@gmail.com>
29137 * config/rs6000/rs6000.c (rs6000_xcoff_file_start): Don't create
29138 xcoff_tbss_section_name.
29139 * config/rs6000/xcoff.h (ASM_OUTPUT_TLS_COMMON): Use .lcomm.
29140 * xcoffout.c (xcoff_tbss_section_name): Delete.
29141 * xcoffout.h (xcoff_tbss_section_name): Delete.
29143 2021-03-11 Richard Biener <rguenther@suse.de>
29145 PR tree-optimization/99523
29146 * tree-cfg.c (dump_function_to_file): Dump SSA names
29147 w/o identifier to the decls section as well, not only those
29148 without a VAR_DECL.
29150 2021-03-11 Jakub Jelinek <jakub@redhat.com>
29153 * ipa-icf-gimple.c (func_checker::compare_gimple_call): For internal
29154 function calls with lhs fail if the lhs don't have compatible types.
29156 2021-03-11 Hans-Peter Nilsson <hp@axis.com>
29158 * config/cris/cris.h (HARD_FRAME_POINTER_REGNUM): Define.
29159 Change FRAME_POINTER_REGNUM to correspond to a new faked
29160 register faked_fp, part of GENNONACR_REGS like faked_ap.
29161 (CRIS_FAKED_REGS_CONTENTS): New helper macro.
29162 (FIRST_PSEUDO_REGISTER, FIXED_REGISTERS, CALL_USED_REGISTERS):
29163 (REG_ALLOC_ORDER, REG_CLASS_CONTENTS, REGNO_OK_FOR_BASE_P)
29164 (ELIMINABLE_REGS, REGISTER_NAMES): Adjust accordingly.
29165 * config/cris/cris.md (CRIS_FP_REGNUM): Renumber to new faked
29167 (CRIS_REAL_FP_REGNUM): New constant.
29168 * config/cris/cris.c (cris_reg_saved_in_regsave_area): Check
29169 for HARD_FRAME_POINTER_REGNUM instead of FRAME_POINTER_REGNUM.
29170 (cris_initial_elimination_offset): Handle elimination changes
29171 to HARD_FRAME_POINTER_REGNUM instead of FRAME_POINTER_REGNUM
29172 and add one from FRAME_POINTER_REGNUM to
29173 HARD_FRAME_POINTER_REGNUM.
29174 (cris_expand_prologue, cris_expand_epilogue): Emit code for
29175 hard_frame_pointer_rtx instead of frame_pointer_rtx.
29177 2021-03-10 David Edelsohn <dje.gcc@gmail.com>
29180 * config/rs6000/aix.h (ADJUST_FIELD_ALIGN): Add check for DCmode.
29181 * config/rs6000/rs6000.c (rs6000_special_round_type_align): Same.
29183 2021-03-10 Vladimir N. Makarov <vmakarov@redhat.com>
29186 * lra-constraints.c (process_address_1): Don't check unknown
29187 constraint, use X for empty constraint.
29189 2021-03-10 Alex Coplan <alex.coplan@arm.com>
29191 * config/aarch64/aarch64.c (aarch64_vfp_is_call_or_return_candidate):
29192 Fix typo in comment describing "is_ha" argument.
29194 2021-03-10 John David Anglin <danglin@gcc.gnu.org>
29196 * doc/sourcebuild.texi: Document LRA target selector.
29198 2021-03-10 David Malcolm <dmalcolm@redhat.com>
29200 * doc/ux.texi: Add subsection contrasting interactive versus
29201 batch usage of GCC.
29203 2021-03-10 Joel Hutton <joel.hutton@arm.com>
29206 * tree-vect-stmts.c (vectorizable_store): Fix scatter store mask
29208 (vectorizable_load): Fix gather load mask check condition.
29210 2021-03-10 Richard Biener <rguenther@suse.de>
29212 PR tree-optimization/99510
29213 * tree.c (check_aligned_type): Check that the candidate
29214 has TYPE_USER_ALIGN set instead of matching with the
29217 2021-03-10 Eric Botcazou <ebotcazou@adacore.com>
29219 * config/sparc/sparc.c (sparc_regmode_natural_size): Return 4 for
29220 float and vector integer modes only if the mode is not larger.
29222 2021-03-10 Hans-Peter Nilsson <hp@axis.com>
29224 * config/cris/cris.h (DWARF_FRAME_REGISTERS): Define.
29226 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
29228 * ira.c (ira_setup_alts, ira_get_dup_out_num): Process digital
29230 * ira-lives.c (single_reg_class): Ditto.
29232 2021-03-09 Sebastian Huber <sebastian.huber@embedded-brains.de>
29234 * config.gcc (aarch64-*-rtems*): Include general rtems.h after
29235 the architecture-specific rtems.h.
29236 (aarch64-*-rtems*): Likewise.
29237 (arm*-*-rtems*): Likewise.
29238 (epiphany-*-rtems*): Likewise.
29239 (riscv*-*-rtems*): Likewise.
29241 2021-03-09 Jakub Jelinek <jakub@redhat.com>
29243 PR tree-optimization/99305
29244 * tree-ssa-phiopt.c (conditional_replacement): Test integer_pow2p
29245 before integer_all_onesp instead of vice versa.
29247 2021-03-09 Richard Earnshaw <rearnsha@arm.com>
29249 * common/config/arm/arm-common.c (arm_config_default): Change type
29250 of 'i' to unsigned.
29252 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
29255 * lra-constraints.c (process_address_1): Process constraint 'g'
29256 separately and digital constraints containing more one digit.
29258 2021-03-09 Nick Clifton <nickc@redhat.com>
29260 * config/rx/rx.h (DBX_DEBUGGING_INFO): Define.
29261 (DWARF"_DEBUGGING_INFO): Define.
29263 2021-03-09 Eric Botcazou <ebotcazou@adacore.com>
29266 * calls.c (initialize_argument_information): When the argument
29267 is passed by reference, do not make a copy in a thunk only if
29268 the argument is already in memory. Remove redundant test for
29269 the case of callee copy.
29271 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
29274 * lra-constraints.c (process_address_1): Process 0..9 constraints
29275 in process_address_1.
29277 2021-03-09 Andreas Krebbel <krebbel@linux.ibm.com>
29279 * config/s390/s390.c (struct s390_processor processor_table):
29280 Binutils name string must not be empty.
29282 2021-03-09 Claudiu Zissulescu <claziss@synopsys.com>
29284 * config/arc/arc.c (arc_attr_type): Remove function.
29286 2021-03-09 Martin Liska <mliska@suse.cz>
29289 * config/i386/i386-options.c (ix86_option_override_internal):
29290 Set isa_flags for OPTS argument and not for the global
29293 2021-03-09 Aaron Sawdey <acsawdey@linux.ibm.com>
29295 * config/rs6000/predicates.md (ds_form_mem_operand): Check
29298 2021-03-09 Aaron Sawdey <acsawdey@linux.ibm.com>
29301 * config/rs6000/predicates.md (ds_form_mem_operand) New
29303 * config/rs6000/genfusion.pl (gen_ld_cmpi_p10) Use
29304 ds_form_mem_operand in ld/lwa patterns.
29305 * config/rs6000/fusion.md: Regenerate file.
29307 2021-03-08 Martin Sebor <msebor@redhat.com>
29309 PR middle-end/98266
29310 * gimple-array-bounds.cc (inbounds_vbase_memaccess_p): New function.
29311 (array_bounds_checker::check_array_bounds): Call it.
29313 2021-03-08 Martin Sebor <msebor@redhat.com>
29315 PR middle-end/97631
29316 * tree-ssa-strlen.c (maybe_warn_overflow): Test rawmem.
29317 (handle_builtin_stxncpy_strncat): Rename locals. Determine
29318 destination size from allocation calls. Issue a more appropriate
29320 (handle_builtin_memcpy): Pass true as rawmem to maybe_warn_overflow.
29321 (handle_builtin_memset): Same.
29323 2021-03-08 Peter Bergner <bergner@linux.ibm.com>
29326 * config/rs6000/rs6000.c (rs6000_emit_le_vsx_permute): Add an assert
29327 to ensure we do not have an Altivec style address.
29328 * config/rs6000/vsx.md (*vsx_le_perm_load_<mode>): Disable if passed
29329 an Altivec style address.
29330 (*vsx_le_perm_store_<mode>): Likewise.
29331 (splitters after *vsx_le_perm_store_<mode>): Likewise.
29332 (vsx_load_<mode>): Disable special expander if passed an Altivec
29334 (vsx_store_<mode>): Likewise.
29336 2021-03-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29339 * config/aarch64/predicates.md (aarch64_simd_shift_imm_vec_qi): Define.
29340 (aarch64_simd_shift_imm_vec_hi): Likewise.
29341 (aarch64_simd_shift_imm_vec_si): Likewise.
29342 (aarch64_simd_shift_imm_vec_di): Likewise.
29343 * config/aarch64/aarch64-simd.md (aarch64_shrn<mode>_insn_le): Use
29344 predicate from above.
29345 (aarch64_shrn<mode>_insn_be): Likewise.
29346 (aarch64_rshrn<mode>_insn_le): Likewise.
29347 (aarch64_rshrn<mode>_insn_be): Likewise.
29348 (aarch64_shrn2<mode>_insn_le): Likewise.
29349 (aarch64_shrn2<mode>_insn_be): Likewise.
29350 (aarch64_rshrn2<mode>_insn_le): Likewise.
29351 (aarch64_rshrn2<mode>_insn_be): Likewise.
29353 2021-03-08 Vladimir N. Makarov <vmakarov@redhat.com>
29356 * lra-constraints.c (skip_contraint_modifiers): New function.
29357 (process_address_1): Use it before lookup_constraint call.
29359 2021-03-08 Martin Liska <mliska@suse.cz>
29362 * config/i386/i386-options.c (ix86_option_override_internal):
29363 Enable UINTR and HRESET for -march that supports it.
29365 2021-03-08 Ilya Leoshkevich <iii@linux.ibm.com>
29367 * config/s390/s390.c (f_constraint_p): New function.
29368 (s390_md_asm_adjust): Implement TARGET_MD_ASM_ADJUST.
29369 (TARGET_MD_ASM_ADJUST): Likewise.
29371 2021-03-08 Tobias Burnus <tobias@codesourcery.com>
29374 * tree-nested.c (convert_local_reference_stmt): Avoid calling
29375 lookup_field_for_decl for Fortran module (= namespace context).
29377 2021-03-08 Andreas Krebbel <krebbel@linux.ibm.com>
29379 * config/s390/s390.c (s390_expand_vec_compare): Implement <0
29380 comparison with arithmetic right shift.
29381 (s390_expand_vcond): No need for a force_reg anymore.
29382 s390_vec_compare will do it.
29383 * config/s390/vector.md ("vec_cmp<mode><tointvec>"): Accept also
29384 immediate operands.
29386 2021-03-07 Jakub Jelinek <jakub@redhat.com>
29389 * config/i386/constraints.md (Yw): Use SSE_REGS if TARGET_SSE
29390 but TARGET_AVX512BW or TARGET_AVX512VL is not set. Adjust description
29392 * config/i386/sse.md (v_Yw): New define_mode_attr.
29393 (*<insn><mode>3, *mul<mode>3<mask_name>, *avx2_<code><mode>3,
29394 *sse4_1_<code><mode>3<mask_name>): Use <v_Yw> instead of v
29396 * config/i386/mmx.md (mmx_pshufw_1, *vec_dupv4hi): Use Yw instead of
29397 xYw in constraints.
29399 2021-03-06 Julian Brown <julian@codesourcery.com>
29401 * tree-pretty-print.c (dump_generic_node): Emit non-generic
29402 address space info for aggregates.
29404 2021-03-06 Hans-Peter Nilsson <hp@axis.com>
29406 * config/cris/cris.h (MAX_FIXED_MODE_SIZE): Don't define.
29408 2021-03-05 Jakub Jelinek <jakub@redhat.com>
29410 PR middle-end/99322
29411 * tree-cfg.c (bb_to_omp_idx): New variable.
29412 (execute_build_cfg): Release the bb_to_omp_idx vector after
29413 cleanup_tree_cfg returns.
29414 (handle_abnormal_edges): Remove bb_to_omp_idx argument, adjust
29415 for bb_to_omp_idx being a vec<int> instead of pointer to array
29417 (make_edges): Remove bb_to_omp_idx local variable, don't pass
29418 it to handle_abnormal_edges, adjust for bb_to_omp_idx being a
29419 vec<int> instead of pointer to array of ints and don't free/release
29421 (remove_bb): When removing a bb and placing forced label somewhere
29422 else, ensure it is put into the same OpenMP region during cfg
29423 pass if possible or to entry successor as fallback. Unregister
29424 bb from bb_to_omp_idx.
29426 2021-03-05 Vladimir N. Makarov <vmakarov@redhat.com>
29429 * lra-constraints.c (process_address_1): Skip decomposing address
29430 for asm insn operand with unknown constraint.
29432 2021-03-05 Martin Jambor <mjambor@suse.cz>
29435 * cgraph.c (cgraph_edge::set_call_stmt): Do not update all
29436 corresponding speculative edges if we are about to resolve
29437 sepculation. Make edge direct (and so resolve speculations) before
29438 removing it from call_site_hash.
29439 (cgraph_edge::make_direct): Relax the initial assert to allow calling
29440 the function on speculative direct edges.
29442 2021-03-05 Eric Botcazou <ebotcazou@adacore.com>
29444 PR rtl-optimization/99376
29445 * rtlanal.c (nonzero_bits1) <arithmetic operators>: If the number
29446 of low-order zero bits is too large, set the result to 0 directly.
29448 2021-03-04 Jakub Jelinek <jakub@redhat.com>
29450 PR middle-end/93235
29451 * expmed.c (store_bit_field_using_insv): Return false of xop0 is a
29452 SUBREG and a SUBREG to op_mode can't be created.
29454 2021-03-04 Alex Coplan <alex.coplan@arm.com>
29457 * config/aarch64/aarch64-sve-builtins.cc
29458 (function_resolver::require_vector_type): Handle error_mark_node.
29460 2021-03-04 Ilya Leoshkevich <iii@linux.ibm.com>
29462 * cfgexpand.c (expand_asm_loc): Pass new parameter.
29463 (expand_asm_stmt): Likewise.
29464 * config/arm/aarch-common-protos.h (arm_md_asm_adjust): Add new
29466 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
29467 * config/arm/arm.c (thumb1_md_asm_adjust): Likewise.
29468 * config/cris/cris.c (cris_md_asm_adjust): Likewise.
29469 * config/i386/i386.c (ix86_md_asm_adjust): Likewise.
29470 * config/mn10300/mn10300.c (mn10300_md_asm_adjust): Likewise.
29471 * config/nds32/nds32.c (nds32_md_asm_adjust): Likewise.
29472 * config/pdp11/pdp11.c (pdp11_md_asm_adjust): Likewise.
29473 * config/rs6000/rs6000.c (rs6000_md_asm_adjust): Likewise.
29474 * config/vax/vax.c (vax_md_asm_adjust): Likewise.
29475 * config/visium/visium.c (visium_md_asm_adjust): Likewise.
29476 * doc/tm.texi (md_asm_adjust): Likewise.
29477 * target.def (md_asm_adjust): Likewise.
29479 2021-03-04 Richard Biener <rguenther@suse.de>
29481 PR middle-end/97855
29482 * tree-pretty-print.c: Poison pp_printf.
29483 (dump_decl_name): Avoid use of pp_printf.
29484 (dump_block_node): Likewise.
29485 (dump_generic_node): Likewise.
29487 2021-03-04 Martin Sebor <msebor@redhat.com>
29489 PR middle-end/96963
29490 PR middle-end/94655
29491 * builtins.c (handle_array_ref): New helper.
29492 (handle_mem_ref): New helper.
29493 (compute_objsize_r): Factor out ARRAY_REF and MEM_REF handling
29494 into new helper functions. Correct a workaround for vectorized
29497 2021-03-03 Pat Haugen <pthaugen@linux.ibm.com>
29499 * config/rs6000/dfp.md (extendddtd2, trunctddd2, *cmp<mode>_internal1,
29500 floatditd2, ftrunc<mode>2, fix<mode>di2, dfp_ddedpd_<mode>,
29501 dfp_denbcd_<mode>, dfp_dxex_<mode>, dfp_diex_<mode>,
29502 *dfp_sgnfcnc_<mode>, dfp_dscli_<mode>, dfp_dscri_<mode>): Update size
29503 attribute for Power10.
29504 * config/rs6000/mma.md (*movoo): Likewise.
29505 * config/rs6000/rs6000.md (define_attr "size"): Add 256.
29506 (define_mode_attr bits): Add DD/TD modes.
29507 * config/rs6000/sync.md (load_quadpti, store_quadpti, load_lockedpti,
29508 store_conditionalpti): Update size attribute for Power10.
29510 2021-03-03 Rainer Orth <ro@CeBiTec.Uni-Bielefeld.DE>
29513 * config/sparc/t-sparc (tree-ssanames.o-warn): Don't error for
29514 -Wuninitialized, -Wmaybe-uninitialized.
29515 (wide-int.o-warn): Likewise.
29517 2021-03-03 Richard Earnshaw <rearnsha@arm.com>
29519 * common/config/arm/arm-common.c: Include configargs.h.
29520 (arm_config_default): New function.
29521 (arm_target_mode): Renamed from arm_target_thumb_only. Handle
29522 processors that do not support Thumb. Take into account the
29523 --with-mode configuration setting for selecting the default.
29524 * config/arm/arm.h (OPTION_DEFAULT_SPECS): Remove entry for 'mode'.
29525 (TARGET_MODE_SPEC_FUNCTIONS): Update for function name change.
29527 2021-03-03 Martin Liska <mliska@suse.cz>
29529 PR gcov-profile/97461
29530 * gcov-io.h (GCOV_PREALLOCATED_KVP): Remove.
29532 2021-03-03 Eric Botcazou <ebotcazou@adacore.com>
29535 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
29536 point back the hard frame pointer to its default location when the
29537 frame is larger than SEH_MAX_FRAME_SIZE.
29539 2021-03-03 Jakub Jelinek <jakub@redhat.com>
29542 * config/i386/predicates.md (logic_operator): New define_predicate.
29543 * config/i386/i386.md (mov + mem using comm arith peephole2):
29544 Punt if operands[1] is EXT_REX_SSE_REGNO_P, AVX512BW is not enabled
29545 and the inner mode is [QH]Imode.
29547 2021-03-03 Jakub Jelinek <jakub@redhat.com>
29550 * dwarf2out.c (dw_loc_list_struct): Add end_entry member.
29551 (new_loc_list): Clear end_entry.
29552 (output_loc_list): Only use DW_LLE_startx_length for -gsplit-dwarf
29553 if HAVE_AS_LEB128, otherwise use DW_LLE_startx_endx. Fix comment
29555 (index_location_lists): For dwarf_version >= 5 without HAVE_AS_LEB128,
29556 initialize also end_entry.
29558 2021-03-03 Jakub Jelinek <jakub@redhat.com>
29561 * cfgrtl.c (fixup_partitions): When changing some bbs from hot to cold
29562 partitions, if in non-layout mode after reorder_blocks also move
29563 affected blocks to ensure a single partition transition.
29565 2021-03-03 Jason Merrill <jason@redhat.com>
29568 * cgraphunit.c (process_function_and_variable_attributes): Don't
29569 warn about flatten on an alias if the target also has it.
29570 * cgraph.h (symtab_node::get_alias_target_tree): New.
29572 2021-03-02 David Edelsohn <dje.gcc@gmail.com>
29574 * config/rs6000/rs6000.md (tls_get_tpointer_internal): Prepend
29575 period to symbol name.
29576 (tls_get_addr_internal<mode>): Same.
29578 2021-03-02 David Malcolm <dmalcolm@redhat.com>
29581 * diagnostic-show-locus.c
29582 (selftest::test_one_liner_many_fixits_2): Fix accidental usage of
29585 2021-03-02 Martin Sebor <msebor@redhat.com>
29587 PR middle-end/99276
29588 * builtins.c (warn_for_access): Remove stray warning text.
29590 2021-03-02 Martin Sebor <msebor@redhat.com>
29592 PR middle-end/99295
29593 * doc/extend.texi (attribute malloc): Reword and clarify nonaliasing
29596 2021-03-02 Jakub Jelinek <jakub@redhat.com>
29599 * dwarf2out.c (output_macinfo_op): Use DW_MACRO_*_str* even with
29600 -gdwarf-5 -gstrict-dwarf. For -gsplit-dwarf -gdwarf-5 use
29601 DW_MACRO_*_strx instead of DW_MACRO_*_strp. Handle
29602 DW_MACRO_define_strx and DW_MACRO_undef_strx.
29603 (save_macinfo_strings): Use DW_MACRO_*_str* even with
29604 -gdwarf-5 -gstrict-dwarf. Handle DW_MACRO_define_strx and
29605 DW_MACRO_undef_strx.
29607 2021-03-02 Andreas Krebbel <krebbel@linux.ibm.com>
29609 * config/s390/s390-builtin-types.def (BT_FN_V4SF_V8HI_UINT): New
29611 (BT_FN_V8HI_V8HI_UINT): Likewise.
29612 (BT_FN_V8HI_V4SF_V4SF_UINT): Likewise.
29613 * config/s390/s390-builtins.def (B_NNPA): New macro definition.
29614 (s390_vclfnhs, s390_vclfnls, s390_vcrnfs, s390_vcfn, s390_vcnf):
29615 New builtin definitions.
29616 * config/s390/s390-c.c (s390_cpu_cpp_builtins_internal): Bump
29617 vector extension version.
29618 * config/s390/s390.c (s390_expand_builtin): Check if builtins are
29619 available with current -march level.
29620 * config/s390/s390.md (UNSPEC_NNPA_VCLFNHS_V8HI)
29621 (UNSPEC_NNPA_VCLFNLS_V8HI, UNSPEC_NNPA_VCRNFS_V8HI)
29622 (UNSPEC_NNPA_VCFN_V8HI, UNSPEC_NNPA_VCNF_V8HI): New constants.
29623 * config/s390/vecintrin.h (vec_extend_to_fp32_hi): New macro.
29624 (vec_extend_to_fp32_lo): Likewise.
29625 (vec_round_from_fp32): Likewise.
29626 (vec_convert_to_fp16): Likewise.
29627 (vec_convert_from_fp16): Likewise.
29628 * config/s390/vx-builtins.md (vclfnhs_v8hi): New insn pattern.
29629 (vclfnls_v8hi): Likewise.
29630 (vcrnfs_v8hi): Likewise.
29631 (vcfn_v8hi): Likewise.
29632 (vcnf_v8hi): Likewise.
29634 2021-03-02 Andreas Krebbel <krebbel@linux.ibm.com>
29636 * common/config/s390/s390-common.c (processor_flags_table): New entry.
29637 * config.gcc: Enable arch14 for --with-arch and --with-tune.
29638 * config/s390/driver-native.c (s390_host_detect_local_cpu): Pick
29639 arch14 for unknown CPU models.
29640 * config/s390/s390-opts.h (enum processor_type): Add PROCESSOR_ARCH14.
29641 * config/s390/s390.c (s390_issue_rate): Add case for PROCESSOR_ARCH14.
29642 (s390_get_sched_attrmask): Likewise.
29643 (s390_get_unit_mask): Likewise.
29644 * config/s390/s390.h (enum processor_flags): Add PF_NNPA and PF_ARCH14.
29645 (TARGET_CPU_ARCH14, TARGET_CPU_ARCH14_P, TARGET_CPU_NNPA)
29646 (TARGET_CPU_NNPA_P, TARGET_ARCH14, TARGET_ARCH14_P, TARGET_NNPA)
29647 (TARGET_NNPA_P): New macro definitions.
29648 * config/s390/s390.md ("cpu_facility", "enabled"): Add arch14 and nnpa.
29649 * config/s390/s390.opt: Add PROCESSOR_ARCH14.
29651 2021-03-02 Jakub Jelinek <jakub@redhat.com>
29653 PR middle-end/95757
29654 * tree-vrp.c (register_edge_assert_for): Remove superfluous ()s around
29655 condition. Call register_edge_assert_for_1 for == 0, != 0, == 1 and
29656 != 1 comparisons if name is lhs of a comparison.
29658 2021-03-01 Iain Sandoe <iain@sandoe.co.uk>
29662 * config/darwin-protos.h (darwin_should_restore_cfa_state): New.
29663 * config/darwin.c (darwin_should_restore_cfa_state): New.
29664 * config/darwin.h (TARGET_ASM_SHOULD_RESTORE_CFA_STATE): New.
29665 * doc/tm.texi: Regenerated.
29666 * doc/tm.texi.in: Document TARGET_ASM_SHOULD_RESTORE_CFA_STATE.
29667 * dwarf2cfi.c (connect_traces): If the target requests, restore
29668 the CFA expression after a DW_CFA_restore.
29669 * target.def (TARGET_ASM_SHOULD_RESTORE_CFA_STATE): New hook.
29671 2021-03-01 Martin Liska <mliska@suse.cz>
29674 * optc-save-gen.awk: Add 4 more exceptions.
29676 2021-03-01 Nathan Sidwell <nathan@acm.org>
29679 * tree.h (TYPE_ALIGN_RAW): New accessor.
29680 (TYPE_ALIGN): Use it.
29682 2021-03-01 Jan Hubicka <jh@suse.cz>
29685 * ipa-fnsummary.c (compute_fn_summary): Fix sanity check.
29687 2021-03-01 Eric Botcazou <ebotcazou@adacore.com>
29690 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
29691 point the hard frame pointer to the SSE register save area instead
29692 of the general register save area. Perform only minimal adjustment
29693 for small frames if it is initially not correctly aligned.
29694 (ix86_expand_prologue): Remove early saves for a SEH target.
29695 * config/i386/winnt.c (struct seh_frame_state): Document constraint.
29697 2021-02-28 Jakub Jelinek <jakub@redhat.com>
29700 * ipa.c (symbol_table::remove_unreachable_nodes): Fix a comment
29701 typo - referneced -> referenced.
29702 * tree.c (component_ref_size): Fix comment typo -
29703 refernce -> reference.
29704 * tree-ssa-alias.c (access_path_may_continue_p): Fix comment typo -
29705 traling -> trailing.
29706 (aliasing_component_refs_p): Fix comment typos -
29707 refernce -> reference and refernece -> reference and
29708 traling -> trailing.
29709 (nonoverlapping_refs_since_match_p): Fix comment typo -
29710 referneces -> references.
29711 * doc/invoke.texi (--param modref-max-bases): Fix a typo -
29712 referneces -> references.
29714 2021-02-27 Iain Sandoe <iain@sandoe.co.uk>
29716 * config/host-darwin.c (darwin_gt_pch_use_address): Modify
29717 diagnostic message to avoid use of a contraction and format
29720 2021-02-27 Jakub Jelinek <jakub@redhat.com>
29723 * gcse.c (gcse_or_cprop_is_too_expensive): Use %wu instead of
29724 HOST_WIDE_INT_PRINT_UNSIGNED in warning format string.
29725 * ipa-devirt.c (ipa_odr_read_section): Use %wd instead of
29726 HOST_WIDE_INT_PRINT_DEC in inform format string. Fix comment
29729 2021-02-26 Richard Biener <rguenther@suse.de>
29731 PR middle-end/99281
29732 * expr.c (store_field): For calls with return-slot optimization
29733 and addressable return type expand the store directly.
29735 2021-02-26 Richard Biener <rguenther@suse.de>
29738 * builtins.c (warn_string_no_nul): Fix diagnostic formatting.
29740 2021-02-26 Peter Bergner <bergner@linux.ibm.com>
29743 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Replace assert
29746 2021-02-26 Aaron Sawdey <acsawdey@linux.ibm.com>
29748 * config.gcc: Add rs6000-pcrel-opt.o.
29749 * config/rs6000/rs6000-pcrel-opt.c: New file.
29750 * config/rs6000/pcrel-opt.md: New file.
29751 * config/rs6000/predicates.md: Add d_form_memory predicate.
29752 * config/rs6000/rs6000-cpus.def: Add OPTION_MASK_PCREL_OPT.
29753 * config/rs6000/rs6000-passes.def: Add pass_pcrel_opt.
29754 * config/rs6000/rs6000-protos.h: Add reg_to_non_prefixed(),
29755 pcrel_opt_valid_mem_p(), output_pcrel_opt_reloc(),
29756 and make_pass_pcrel_opt().
29757 * config/rs6000/rs6000.c (reg_to_non_prefixed): Make global.
29758 (rs6000_option_override_internal): Add pcrel-opt.
29759 (rs6000_delegitimize_address): Support pcrel-opt.
29760 (rs6000_opt_masks): Add pcrel-opt.
29761 (pcrel_opt_valid_mem_p): New function.
29762 (reg_to_non_prefixed): Make global.
29763 (rs6000_asm_output_opcode): Reset prepend_p_to_next_insn.
29764 (output_pcrel_opt_reloc): New function.
29765 * config/rs6000/rs6000.md (loads_extern_addr): New attr.
29766 (pcrel_extern_addr): Set loads_extern_addr.
29767 Add include for pcrel-opt.md.
29768 * config/rs6000/rs6000.opt: Add -mpcrel-opt.
29769 * config/rs6000/t-rs6000: Add rules for pcrel-opt.c and
29772 2021-02-26 YunQiang Su <yunqiang.su@cipunited.com>
29775 * config/mips/mips.c (mips_expand_ext_as_unaligned_load):
29776 If TARGET_64BIT and dest is SUBREG, we check the width, if it
29777 equal to SImode, we use SImode operation, just like what we are
29780 2021-02-26 Marek Polacek <polacek@redhat.com>
29782 * builtins.c (warn_for_access): Fix typos.
29784 2021-02-25 Iain Sandoe <iain@sandoe.co.uk>
29786 * config/aarch64/aarch64.md (<optab>_rol<mode>3): Add a '#'
29787 mark in front of the immediate quantity.
29788 (<optab>_rolsi3_uxtw): Likewise.
29790 2021-02-25 Richard Earnshaw <rearnsha@arm.com>
29793 * config/arm/thumb2.md (nonsecure_call_reg_thumb2_fpcxt): New pattern.
29794 (nonsecure_call_value_reg_thumb2_fpcxt): Likewise.
29795 (nonsecure_call_reg_thumb2): Restrict to using r4 for the callee
29796 address and disable when the FPCXT is not available.
29797 (nonsecure_call_value_reg_thumb2): Likewise.
29799 2021-02-25 Nathan Sidwell <nathan@acm.org>
29802 * doc/invoke.texi (flang-info-module-cmi): Renamed option.
29804 2021-02-25 Tamar Christina <tamar.christina@arm.com>
29806 * tree-vect-slp.c (optimize_load_redistribution_1): Abort on NULL nodes.
29808 2021-02-25 Richard Biener <rguenther@suse.de>
29810 PR tree-optimization/99253
29811 * tree-vect-loop.c (check_reduction_path): First compute
29812 code, then verify out-of-loop uses.
29814 2021-02-25 Jakub Jelinek <jakub@redhat.com>
29817 * match.pd ((T)(A) + CST -> (T)(A + CST)): Add :s to convert.
29819 2021-02-25 Jakub Jelinek <jakub@redhat.com>
29821 PR tree-optimization/80635
29822 * tree-vrp.c (vrp_simplify_cond_using_ranges): Also handle
29823 VIEW_CONVERT_EXPR if modes are the same, innerop is integral and
29824 has mode precision.
29826 2021-02-25 Richard Biener <rguenther@suse.de>
29828 * tree-vect-slp.c (optimize_load_redistribution_1): Delay
29829 load_map population.
29830 (vect_match_slp_patterns_2): Revert part of last change.
29831 (vect_analyze_slp): Do not interleave optimize_load_redistribution
29832 with pattern detection but do it afterwards. Dump the
29833 whole SLP graph after pattern recognition and load
29834 redistribution optimization finished.
29836 2021-02-24 Jakub Jelinek <jakub@redhat.com>
29839 * omp-low.c (struct omp_context): Add teams_nested_p and
29840 nonteams_nested_p members.
29841 (scan_omp_target): Diagnose teams nested inside of target with other
29842 directives strictly nested inside of the same target.
29843 (check_omp_nesting_restrictions): Set ctx->teams_nested_p or
29844 ctx->nonteams_nested_p as needed.
29846 2021-02-24 Vladimir N. Makarov <vmakarov@redhat.com>
29848 PR inline-asm/99123
29849 * lra-constraints.c (uses_hard_regs_p): Don't use decompose_mem_address.
29851 2021-02-24 Hans-Peter Nilsson <hp@axis.com>
29853 * config/cris/cris.c (cris_expand_prologue): Set
29854 current_function_static_stack_size, if flag_stack_usage_info.
29856 2021-02-24 Pat Haugen <pthaugen@linux.ibm.com>
29858 * config/rs6000/rs6000.c (next_insn_prefixed_p): Rename.
29859 (rs6000_final_prescan_insn): Adjust.
29860 (rs6000_asm_output_opcode): Likewise.
29862 2021-02-24 Martin Sebor <msebor@redhat.com>
29864 PR middle-end/97172
29865 * attribs.c (attr_access::free_lang_data): Clear attribute arg spec
29866 from function arguments.
29868 2021-02-24 Tamar Christina <tamar.christina@arm.com>
29870 PR tree-optimization/99220
29871 * tree-vect-slp.c (optimize_load_redistribution_1): Remove
29872 node from cache when it's about to be deleted.
29874 2021-02-24 Jakub Jelinek <jakub@redhat.com>
29876 PR tree-optimization/99225
29877 * fold-const.c (fold_binary_loc) <case NE_EXPR>: In (x & (1 << y)) != 0
29878 to ((x >> y) & 1) != 0 simplifications use build_one_cst instead of
29879 build_int_cst (..., 1). Formatting fixes.
29881 2021-02-24 Tamar Christina <tamar.christina@arm.com>
29883 PR tree-optimization/99149
29884 * tree-vect-slp-patterns.c (vect_detect_pair_op): Don't recreate the
29886 (vect_slp_reset_pattern): Remove.
29887 (complex_fma_pattern::matches): Remove call to vect_slp_reset_pattern.
29888 (complex_mul_pattern::build, complex_fma_pattern::build,
29889 complex_fms_pattern::build): Fix ref counts.
29890 * tree-vect-slp.c (vect_free_slp_tree): Undo SLP only pattern relevancy
29891 when node is being deleted.
29892 (vect_match_slp_patterns_2): Correct result of cache hit on patterns.
29893 (vect_schedule_slp): Invalidate SLP_TREE_REPRESENTATIVE of removed
29895 * tree-vectorizer.c (vec_info::new_stmt_vec_info): Initialize value.
29897 2021-02-24 Matthias Klose <doko@ubuntu.com>
29900 2020-12-07 Matthias Klose <doko@ubuntu.com>
29902 * genextract.c (print_header): Undefine ENABLE_RTL_CHECKING
29903 and ENABLE_RTL_FLAG_CHECKING.
29905 2021-02-24 Richard Biener <rguenther@suse.de>
29908 * builtins.c (fold_builtin_next_arg): Avoid NULL arg.
29910 2021-02-23 Peter Bergner <bergner@linux.ibm.com>
29912 * config/rs6000/mma.md (mma_assemble_pair): Rename from this...
29913 (vsx_assemble_pair): ...to this.
29914 (*mma_assemble_pair): Rename from this...
29915 (*vsx_assemble_pair): ...to this.
29916 (mma_disassemble_pair): Rename from this...
29917 (vsx_disassemble_pair): ...to this.
29918 (*mma_disassemble_pair): Rename from this...
29919 (*vsx_disassemble_pair): ...to this.
29920 * config/rs6000/rs6000-builtin.def (BU_MMA_V2, BU_MMA_V3,
29921 BU_COMPAT): New macros.
29922 (mma_assemble_pair): Rename from this...
29923 (vsx_assemble_pair): ...to this.
29924 (mma_disassemble_pair): Rename from this...
29925 (vsx_disassemble_pair): ...to this.
29926 (mma_assemble_pair): New compatibility built-in.
29927 (mma_disassemble_pair): Likewise.
29928 * config/rs6000/rs6000-call.c (struct builtin_compatibility): New.
29929 (RS6000_BUILTIN_COMPAT): Define.
29930 (bdesc_compat): New.
29931 (mma_expand_builtin): Use VSX_BUILTIN_DISASSEMBLE_PAIR_INTERNAL.
29932 (rs6000_gimple_fold_mma_builtin): Use MMA_BUILTIN_DISASSEMBLE_PAIR
29933 and VSX_BUILTIN_ASSEMBLE_PAIR.
29934 (rs6000_init_builtins): Register compatibility built-ins.
29935 (mma_init_builtins): Use VSX_BUILTIN_ASSEMBLE_PAIR,
29936 VSX_BUILTIN_ASSEMBLE_PAIR_INTERNAL, VSX_BUILTIN_DISASSEMBLE_PAIR and
29937 VSX_BUILTIN_DISASSEMBLE_PAIR_INTERNAL.
29938 * doc/extend.texi (__builtin_mma_assemble_pair): Rename from this...
29939 (__builtin_vsx_assemble_pair): ...to this.
29940 (__builtin_mma_disassemble_pair): Rename from this...
29941 (__builtin_vsx_disassemble_pair): ...to this.
29943 2021-02-23 Martin Liska <mliska@suse.cz>
29946 * ipa-icf.c (sem_variable::merge): Do not merge 2 variables
29947 with different alignment. That leads to an invalid red zone
29948 size allocated in runtime.
29950 2021-02-23 Jakub Jelinek <jakub@redhat.com>
29952 PR tree-optimization/99204
29953 * fold-const.c (fold_read_from_constant_string): Check that
29954 tree_fits_uhwi_p (index) rather than just that index is INTEGER_CST.
29956 2021-02-23 Segher Boessenkool <segher@kernel.crashing.org>
29957 Kewen Lin <linkw@gcc.gnu.org>
29959 * config/rs6000/rs6000.md (*rotl<mode>3_insert_3): Renamed to...
29960 (rotl<mode>3_insert_3): ...this.
29961 (plus_ior_xor): New code_iterator.
29962 (define_split for GPR rl*imi): New splitter.
29963 * config/rs6000/vsx.md (vsx_init_v4si): Use gen_rotldi3_insert_3
29964 for integer merging.
29966 2021-02-22 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29968 * config/aarch64/aarch64-tuning-flags.def (cse_sve_vl_constants):
29970 * config/aarch64/aarch64.md (add<mode>3): Force CONST_POLY_INT immediates
29971 into a register when the above is enabled.
29972 * config/aarch64/aarch64.c (neoversev1_tunings):
29973 AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS.
29974 (aarch64_rtx_costs): Use AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS.
29976 2021-02-22 Hans-Peter Nilsson <hp@axis.com>
29978 * config/cris/cris.c (cris_print_operand) <'T'>: Change
29979 valid operand from is now an addi mult-value to shift-value.
29980 * config/cris/cris.md (*addi): Change expression of scaled
29981 operand from mult to ashift.
29982 * config/cris/cris.md (*addi_reload): New insn_and_split.
29984 2021-02-22 John David Anglin <danglin@gcc.gnu.org>
29987 * config/pa/pa.c (TARGET_ASM_CAN_OUTPUT_MI_THUNK): Define as
29988 hook_bool_const_tree_hwi_hwi_const_tree_true.
29989 (pa_asm_output_mi_thunk): Add support for nonzero vcall_offset.
29991 2021-02-22 Andre Vieira <andre.simoesdiasvieira@arm.com>
29993 PR rtl-optimization/98791
29994 * ira-conflicts.c (process_regs_for_copy): Don't create allocno copies
29995 for unordered modes.
29997 2021-02-22 Martin Liska <mliska@suse.cz>
29999 * tree-inline.c (inline_forbidden_p): Set
30000 inline_forbidden_reason.
30002 2021-02-22 Richard Biener <rguenther@suse.de>
30004 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Dump
30007 2021-02-22 Richard Biener <rguenther@suse.de>
30009 PR tree-optimization/99165
30010 * gimple-ssa-store-merging.c (pass_store_merging::process_store):
30011 Accumulate changed to ret.
30013 2021-02-21 Uros Bizjak <ubizjak@gmail.com>
30016 2020-12-09 Uroš Bizjak <ubizjak@gmail.com>
30018 * config/i386/i386.h (REG_ALLOC_ORDER): Remove
30020 2021-02-20 Ilya Leoshkevich <iii@linux.ibm.com>
30023 * config/s390/vector.md (trunctf<DFP_ALL:mode>2_vr): New
30025 (trunctf<DFP_ALL:mode>2): Likewise.
30026 (trunctdtf2_vr): Likewise.
30027 (trunctdtf2): Likewise.
30028 (extend<DFP_ALL:mode>tf2_vr): Likewise.
30029 (extend<DFP_ALL:mode>tf2): Likewise.
30030 (extendtftd2_vr): Likewise.
30031 (extendtftd2): Likewise.
30033 2021-02-20 Ilya Leoshkevich <iii@linux.ibm.com>
30035 * config/s390/vector.md (*fprx2_to_tf): Rename to fprx2_to_tf,
30036 add memory alternative.
30037 (tf_to_fprx2): New pattern.
30039 2021-02-19 Martin Sebor <msebor@redhat.com>
30042 * attribs.c (init_attr_rdwr_indices): Guard vblist use.
30043 (attr_access::free_lang_data): Remove a spurious test.
30045 2021-02-19 Nathan Sidwell <nathan@acm.org>
30047 * doc/invoke.texi (flang-info-module-read): Document.
30049 2021-02-19 Martin Liska <mliska@suse.cz>
30051 PR translation/99167
30052 * params.opt: Fix typo.
30054 2021-02-19 Richard Biener <rguenther@suse.de>
30056 PR middle-end/99122
30057 * tree-inline.c (inline_forbidden_p): Do not inline functions
30058 with VLA arguments or return value.
30060 2021-02-19 Jakub Jelinek <jakub@redhat.com>
30063 * config/arm/arm.md (*stack_protect_combined_set_insn,
30064 *stack_protect_combined_test_insn): If force_const_mem result
30065 is not valid general operand, force its address into the destination
30068 2021-02-19 Jakub Jelinek <jakub@redhat.com>
30071 * tree-cfg.c (gimple_merge_blocks): If bb a starts with eh landing
30072 pad or non-local label, put FORCED_LABELs from bb b after that label
30073 rather than before it.
30075 2021-02-19 Andre Vieira <andre.simoesdiasvieira@arm.com>
30078 * config/aarch64/aarch64-sve.md (<ASHIFT:optab><mode>3): Use
30079 expand_vector_broadcast' to emit the vec_duplicate operand.
30081 2021-02-18 Vladimir N. Makarov <vmakarov@redhat.com>
30083 PR rtl-optimization/96264
30084 * lra-remat.c (reg_overlap_for_remat_p): Check also output insn
30087 2021-02-18 H.J. Lu <hjl.tools@gmail.com>
30090 * varasm.c (get_section): Replace SUPPORTS_SHF_GNU_RETAIN with
30091 looking up the retain attribute.
30092 (resolve_unique_section): Likewise.
30093 (get_variable_section): Likewise.
30094 (switch_to_section): Likewise. Warn when a symbol without the
30095 retain attribute and a symbol with the retain attribute are
30096 placed in the section with the same name, instead of the used
30098 * doc/extend.texi: Document the "retain" attribute.
30100 2021-02-18 Nathan Sidwell <nathan@acm.org>
30103 * doc/invoke.texi (flang-info-include-translate): Document header
30106 2021-02-18 Richard Biener <rguenther@suse.de>
30108 PR middle-end/99122
30109 * ipa-fnsummary.c (analyze_function_body): Set
30110 CIF_FUNCTION_NOT_INLINABLE for VLA parameter calls.
30111 * tree-inline.c (insert_init_debug_bind): Pass NULL for
30112 error_mark_node values.
30113 (force_value_to_type): Do not build V_C_Es for WITH_SIZE_EXPR
30115 (setup_one_parameter): Delay force_value_to_type until when
30118 2021-02-18 Hans-Peter Nilsson <hp@axis.com>
30120 PR tree-optimization/99142
30121 * match.pd (clz cmp 0): Gate replacement on single_use of clz result.
30123 2021-02-18 Jakub Jelinek <jakub@redhat.com>
30125 * wide-int-bitmask.h (wide_int_bitmask::wide_int_bitmask (),
30126 wide_int_bitmask::wide_int_bitmask (uint64_t),
30127 wide_int_bitmask::wide_int_bitmask (uint64_t, uint64_t),
30128 wide_int_bitmask::operator ~ () const,
30129 wide_int_bitmask::operator | (wide_int_bitmask) const,
30130 wide_int_bitmask::operator & (wide_int_bitmask) const): Use constexpr
30132 * config/i386/i386.h (PTA_3DNOW, PTA_3DNOW_A, PTA_64BIT, PTA_ABM,
30133 PTA_AES, PTA_AVX, PTA_BMI, PTA_CX16, PTA_F16C, PTA_FMA, PTA_FMA4,
30134 PTA_FSGSBASE, PTA_LWP, PTA_LZCNT, PTA_MMX, PTA_MOVBE, PTA_NO_SAHF,
30135 PTA_PCLMUL, PTA_POPCNT, PTA_PREFETCH_SSE, PTA_RDRND, PTA_SSE, PTA_SSE2,
30136 PTA_SSE3, PTA_SSE4_1, PTA_SSE4_2, PTA_SSE4A, PTA_SSSE3, PTA_TBM,
30137 PTA_XOP, PTA_AVX2, PTA_BMI2, PTA_RTM, PTA_HLE, PTA_PRFCHW, PTA_RDSEED,
30138 PTA_ADX, PTA_FXSR, PTA_XSAVE, PTA_XSAVEOPT, PTA_AVX512F, PTA_AVX512ER,
30139 PTA_AVX512PF, PTA_AVX512CD, PTA_NO_TUNE, PTA_SHA, PTA_PREFETCHWT1,
30140 PTA_CLFLUSHOPT, PTA_XSAVEC, PTA_XSAVES, PTA_AVX512DQ, PTA_AVX512BW,
30141 PTA_AVX512VL, PTA_AVX512IFMA, PTA_AVX512VBMI, PTA_CLWB, PTA_MWAITX,
30142 PTA_CLZERO, PTA_NO_80387, PTA_PKU, PTA_AVX5124VNNIW, PTA_AVX5124FMAPS,
30143 PTA_AVX512VPOPCNTDQ, PTA_SGX, PTA_AVX512VNNI, PTA_GFNI, PTA_VAES,
30144 PTA_AVX512VBMI2, PTA_VPCLMULQDQ, PTA_AVX512BITALG, PTA_RDPID,
30145 PTA_PCONFIG, PTA_WBNOINVD, PTA_AVX512VP2INTERSECT, PTA_PTWRITE,
30146 PTA_AVX512BF16, PTA_WAITPKG, PTA_MOVDIRI, PTA_MOVDIR64B, PTA_ENQCMD,
30147 PTA_CLDEMOTE, PTA_SERIALIZE, PTA_TSXLDTRK, PTA_AMX_TILE, PTA_AMX_INT8,
30148 PTA_AMX_BF16, PTA_UINTR, PTA_HRESET, PTA_KL, PTA_WIDEKL, PTA_AVXVNNI,
30149 PTA_X86_64_BASELINE, PTA_X86_64_V2, PTA_X86_64_V3, PTA_X86_64_V4,
30150 PTA_CORE2, PTA_NEHALEM, PTA_WESTMERE, PTA_SANDYBRIDGE, PTA_IVYBRIDGE,
30151 PTA_HASWELL, PTA_BROADWELL, PTA_SKYLAKE, PTA_SKYLAKE_AVX512,
30152 PTA_CASCADELAKE, PTA_COOPERLAKE, PTA_CANNONLAKE, PTA_ICELAKE_CLIENT,
30153 PTA_ICELAKE_SERVER, PTA_TIGERLAKE, PTA_SAPPHIRERAPIDS, PTA_ALDERLAKE,
30154 PTA_KNL, PTA_BONNELL, PTA_SILVERMONT, PTA_GOLDMONT, PTA_GOLDMONT_PLUS,
30155 PTA_TREMONT, PTA_KNM): Use constexpr instead of const.
30157 2021-02-18 Jakub Jelinek <jakub@redhat.com>
30159 PR middle-end/99109
30160 * gimple-array-bounds.cc (build_zero_elt_array_type): Rename to ...
30161 (build_printable_array_type): ... this. Add nelts argument. For
30162 overaligned eltype, use TYPE_MAIN_VARIANT (eltype) instead. If
30163 nelts, call build_array_type_nelts.
30164 (array_bounds_checker::check_mem_ref): Use build_printable_array_type
30165 instead of build_zero_elt_array_type and build_array_type_nelts.
30167 2021-02-18 Jakub Jelinek <jakub@redhat.com>
30170 * config/i386/i386.c (distance_non_agu_define): Don't call
30171 extract_insn_cached here.
30172 (ix86_lea_outperforms): Save and restore recog_data around call
30173 to distance_non_agu_define and distance_agu_use.
30174 (ix86_ok_to_clobber_flags): Remove.
30175 (ix86_avoid_lea_for_add): Don't call ix86_ok_to_clobber_flags.
30176 (ix86_avoid_lea_for_addr): Likewise. Adjust function comment.
30177 * config/i386/i386.md (*lea<mode>): Change from define_insn_and_split
30178 into define_insn. Move the splitting to define_peephole2 and
30179 check there using peep2_regno_dead_p if FLAGS_REG is dead.
30181 2021-02-17 Julian Brown <julian@codesourcery.com>
30183 * gimplify.c (gimplify_scan_omp_clauses): Handle ATTACH_DETACH
30186 2021-02-17 Xi Ruoyao <xry111@mengyan1223.wang>
30189 * config/mips/mips.c (mips_symbol_insns): Do not use
30190 MSA_SUPPORTED_MODE_P if mode is MAX_MACHINE_MODE.
30192 2021-02-16 Vladimir N. Makarov <vmakarov@redhat.com>
30194 PR inline-asm/98096
30195 * stmt.c (resolve_operand_name_1): Take inout operands into account
30196 for access to labels by names.
30197 * doc/extend.texi: Describe counting operands for accessing labels.
30199 2021-02-16 Richard Biener <rguenther@suse.de>
30201 PR tree-optimization/38474
30202 * tree-ssa-structalias.c (variable_info::address_taken): New.
30203 (new_var_info): Initialize address_taken.
30204 (process_constraint): Set address_taken.
30205 (solve_constraints): Use the new address_taken flag rather
30206 than is_reg_var for sorting variables.
30207 (dump_constraint): Dump the variable number if the name
30210 2021-02-16 Jakub Jelinek <jakub@redhat.com>
30213 * tree-vect-stmts.c (vectorizable_simd_clone_call): For num_calls != 1
30214 multiply by 4096 and for inbranch by 8192.
30215 * config/i386/i386.c (ix86_simd_clone_usable): For TARGET_AVX512F,
30216 return 3, 2 or 1 for mangle letters 'b', 'c' or 'd'.
30218 2021-02-15 Maya Rashish <coypu@sdf.org>
30220 * config/aarch64/aarch64.c (aarch64_init_builtins):
30221 Call SUBTARGET_INIT_BUILTINS.
30223 2021-02-15 Peter Bergner <bergner@linux.ibm.com>
30225 PR rtl-optimization/98872
30226 * init-regs.c (initialize_uninitialized_regs): Skip initialization
30227 if CONST0_RTX is NULL.
30229 2021-02-15 Richard Sandiford <richard.sandiford@arm.com>
30231 PR rtl-optimization/98863
30232 * rtl-ssa/functions.h (function_info::bb_live_out_info): Delete.
30233 (function_info::build_info): Turn into a declaration, moving the
30234 definition to internals.h.
30235 (function_info::bb_walker): Declare.
30236 (function_info::create_reg_use): Likewise.
30237 (function_info::calculate_potential_phi_regs): Take a build_info
30239 (function_info::place_phis, function_info::create_ebbs): Declare.
30240 (function_info::calculate_ebb_live_in_for_debug): Likewise.
30241 (function_info::populate_backedge_phis): Delete.
30242 (function_info::start_block, function_info::end_block): Declare.
30243 (function_info::populate_phi_inputs): Delete.
30244 (function_info::m_potential_phi_regs): Move information to build_info.
30245 * rtl-ssa/internals.h: New file.
30246 (function_info::bb_phi_info): New class.
30247 (function_info::build_info): Moved from functions.h.
30248 Add a constructor and destructor.
30249 (function_info::build_info::ebb_use): Delete.
30250 (function_info::build_info::ebb_def): Likewise.
30251 (function_info::build_info::bb_live_out): Likewise.
30252 (function_info::build_info::tmp_ebb_live_in_for_debug): New variable.
30253 (function_info::build_info::potential_phi_regs): Likewise.
30254 (function_info::build_info::potential_phi_regs_for_debug): Likewise.
30255 (function_info::build_info::ebb_def_regs): Likewise.
30256 (function_info::build_info::bb_phis): Likewise.
30257 (function_info::build_info::bb_mem_live_out): Likewise.
30258 (function_info::build_info::bb_to_rpo): Likewise.
30259 (function_info::build_info::def_stack): Likewise.
30260 (function_info::build_info::old_def_stack_limit): Likewise.
30261 * rtl-ssa/internals.inl (function_info::build_info::record_reg_def):
30262 Remove the regno argument. Push the previous definition onto the
30263 definition stack where necessary.
30264 * rtl-ssa/accesses.cc: Include internals.h.
30265 * rtl-ssa/changes.cc: Likewise.
30266 * rtl-ssa/blocks.cc: Likewise.
30267 (function_info::build_info::build_info): Define.
30268 (function_info::build_info::~build_info): Likewise.
30269 (function_info::bb_walker): New class.
30270 (function_info::bb_walker::bb_walker): Define.
30271 (function_info::add_live_out_use): Convert a logarithmic-complexity
30272 test into a linear one. Allow the same definition to be passed
30274 (function_info::calculate_potential_phi_regs): Moved from
30275 functions.cc. Take a build_info parameter and store the
30276 information there instead.
30277 (function_info::place_phis): New function.
30278 (function_info::add_entry_block_defs): Update call to record_reg_def.
30279 (function_info::calculate_ebb_live_in_for_debug): New function.
30280 (function_info::add_phi_nodes): Use bb_phis to decide which
30281 registers need phi nodes and initialize ebb_def_regs accordingly.
30282 Do not add degenerate phis here.
30283 (function_info::add_artificial_accesses): Use create_reg_use.
30284 Assert that all definitions are listed in the DF LR sets.
30285 Update call to record_reg_def.
30286 (function_info::record_block_live_out): Record live-out register
30287 values in the phis of successor blocks. Use the live-out set
30288 when processing the last block in an EBB, instead of always
30289 using the live-in sets of successor blocks. AND the live sets
30290 with the set of registers that have been defined in the EBB,
30291 rather than with all potential phi registers. Cope correctly
30292 with branches back to the start of the current EBB.
30293 (function_info::start_block): New function.
30294 (function_info::end_block): Likewise.
30295 (function_info::populate_phi_inputs): Likewise.
30296 (function_info::create_ebbs): Likewise.
30297 (function_info::process_all_blocks): Rewrite into a multi-phase
30299 * rtl-ssa/functions.cc: Include internals.h.
30300 (function_info::calculate_potential_phi_regs): Move to blocks.cc.
30301 (function_info::init_function_data): Remove caller.
30302 * rtl-ssa/insns.cc: Include internals.h
30303 (function_info::create_reg_use): New function. Lazily any
30304 degenerate phis needed by the linear RPO view.
30305 (function_info::record_use): Use create_reg_use. When processing
30306 debug uses, use potential_phi_regs and test it before checking
30307 whether the register is live on entry to the current EBB. Lazily
30308 calculate ebb_live_in_for_debug.
30309 (function_info::record_call_clobbers): Update call to record_reg_def.
30310 (function_info::record_def): Likewise.
30312 2021-02-15 Martin Liska <mliska@suse.cz>
30314 * toplev.c (init_asm_output): Free output of
30315 gen_command_line_string function.
30316 (process_options): Likewise.
30318 2021-02-15 Martin Liska <mliska@suse.cz>
30320 * params.opt: Add 2 missing Param keywords.
30322 2021-02-15 Eric Botcazou <ebotcazou@adacore.com>
30324 * df-core.c (df_worklist_dataflow_doublequeue): Use proper cast.
30326 2021-02-15 Jakub Jelinek <jakub@redhat.com>
30328 PR tree-optimization/99079
30329 * match.pd (A % (pow2pcst << N) -> A & ((pow2pcst << N) - 1)): Remove
30330 useless tree_nop_conversion_p (type, TREE_TYPE (@3)) check. Instead
30331 require both type and TREE_TYPE (@1) to be integral types and either
30332 type having smaller or equal precision, or TREE_TYPE (@1) being
30333 unsigned type, or type being signed type. If TREE_TYPE (@1)
30334 doesn't have wrapping overflow, perform the subtraction of one in
30337 2021-02-14 Jan Hubicka <hubicka@ucw.cz>
30338 Richard Biener <rguether@suse.de>
30341 * ipa-reference.c (ipa_init): Only conditinally initialize
30342 reference_vars_to_consider.
30343 (propagate): Conditionally deninitialize reference_vars_to_consider.
30344 (ipa_reference_write_optimization_summary): Sanity check that
30345 reference_vars_to_consider is not allocated.
30347 2021-02-13 Levy Hsu <admin@levyhsu.com>
30350 * config/riscv/riscv-shorten-memrefs.c (pass_shorten_memrefs): Add
30351 extend parameter to get_si_mem_base_reg declaration.
30352 (get_si_mem_base_reg): Add extend parameter. Set it.
30353 (analyze): Pass extend arg to get_si_mem_base_reg.
30354 (transform): Likewise. Use it when rewriting mems.
30355 * config/riscv/riscv.c (riscv_legitimize_move): Check for subword
30356 loads and emit sign/zero extending load followed by subreg move.
30358 2021-02-13 Jim Wilson <jimw@sifive.com>
30361 * config/riscv/riscv.c (riscv_compressed_lw_address_p): Drop early
30362 exit when !reload_completed. Only perform check for compressed reg
30363 if reload_completed.
30364 (riscv_rtx_costs): In MEM case, when optimizing for size and
30365 shorten memrefs, if not compressible, then increase cost.
30367 2021-02-13 Jakub Jelinek <jakub@redhat.com>
30369 PR rtl-optimization/98439
30370 * recog.c (pass_split_before_regstack::gate): Enable even when
30371 pass_split_before_sched2 is enabled if -fselective-scheduling2 is
30374 2021-02-13 Jakub Jelinek <jakub@redhat.com>
30377 * config/i386/mmx.md (*mmx_pshufd_1): Add a combine splitter for
30378 swap of V2SImode elements in memory into DImode memory rotate by 32.
30380 2021-02-12 Martin Sebor <msebor@redhat.com>
30382 * tree-pretty-print.c (print_generic_expr_to_str): Update comment.
30384 2021-02-12 Richard Sandiford <richard.sandiford@arm.com>
30386 * rtl-ssa/accesses.cc (function_info::make_use_available): Use
30387 m_temp_obstack rather than m_obstack to allocate the temporary use.
30389 2021-02-12 Richard Sandiford <richard.sandiford@arm.com>
30391 * df-problems.c (df_lr_bb_local_compute): Treat partial definitions
30392 as read-modify operations.
30394 2021-02-12 Richard Biener <rguenther@suse.de>
30396 PR middle-end/38474
30397 * ipa-fnsummary.c (unmodified_parm_1): Only walk when
30398 fbi->aa_walk_budget is bigger than zero. Update
30399 fbi->aa_walk_budget.
30400 (param_change_prob): Likewise.
30401 * ipa-prop.c (detect_type_change_from_memory_writes):
30402 Properly account walk_aliased_vdefs.
30403 (parm_preserved_before_stmt_p): Canonicalize updates.
30404 (parm_ref_data_preserved_p): Likewise.
30405 (parm_ref_data_pass_through_p): Likewise.
30406 (determine_known_aggregate_parts): Account own alias queries.
30408 2021-02-12 Martin Liska <mliska@suse.cz>
30410 * opts-common.c (decode_cmdline_option): Release werror_arg.
30411 * opts.c (gen_producer_string): Release output of
30412 gen_command_line_string.
30414 2021-02-12 Richard Biener <rguenther@suse.de>
30416 PR tree-optimization/38474
30417 * params.opt (-param=max-store-chains-to-track=): New param.
30418 (-param=max-stores-to-track=): Likewise.
30419 * doc/invoke.texi (max-store-chains-to-track): Document.
30420 (max-stores-to-track): Likewise.
30421 * gimple-ssa-store-merging.c (pass_store_merging::m_n_chains):
30423 (pass_store_merging::m_n_stores): Likewise.
30424 (pass_store_merging::terminate_and_process_chain): Update
30425 m_n_stores and m_n_chains.
30426 (pass_store_merging::process_store): Likewise. Terminate
30427 oldest chains if the number of stores or chains get too large.
30428 (imm_store_chain_info::terminate_and_process_chain): Dump
30431 2021-02-11 Eric Botcazou <ebotcazou@adacore.com>
30433 * config/i386/winnt.c (i386_pe_seh_unwind_emit): When switching to
30434 the cold section, emit a nop before the directive if the previous
30435 active instruction can throw.
30437 2021-02-11 Peter Bergner <bergner@linux.ibm.com>
30440 * config/rs6000/predicates.md (mma_assemble_input_operand): Restrict
30441 memory addresses that are legal for quad word accesses.
30443 2021-02-11 Andrea Corallo <andrea.corallo@arm.com>
30446 * config/arm/thumb2.md (*doloop_end_internal): Generate
30447 alternative sequence to handle long range branches.
30449 2021-02-11 Joel Hutton <joel.hutton@arm.com>
30451 PR tree-optimization/98772
30452 * optabs-tree.c (supportable_half_widening_operation): New function
30453 to check for supportable V8QI->V8HI widening patterns.
30454 * optabs-tree.h (supportable_half_widening_operation): New function.
30455 * tree-vect-stmts.c (vect_create_half_widening_stmts): New function
30456 to create promotion stmts for V8QI->V8HI widening patterns.
30457 (vectorizable_conversion): Add case for V8QI->V8HI.
30459 2021-02-11 Richard Biener <rguenther@suse.de>
30461 * sparseset.h (SPARSESET_ELT_BITS): Remove.
30462 (SPARSESET_ELT_TYPE): Use unsigned int.
30463 * fwprop.c: Do not include sparseset.h.
30465 2021-02-10 Jakub Jelinek <jakub@redhat.com>
30468 * varasm.c (declare_weak): For -fsyntax-only, allow even
30469 TREE_ASM_WRITTEN function decls.
30471 2021-02-10 Jakub Jelinek <jakub@redhat.com>
30474 * config/i386/sse.md (fix<fixunssuffix>_truncv2sfv2di2,
30475 <insn>v8qiv8hi2, <insn>v8qiv8si2, <insn>v4qiv4si2, <insn>v4hiv4si2,
30476 <insn>v8qiv8di2, <insn>v4qiv4di2, <insn>v2qiv2di2, <insn>v4hiv4di2,
30477 <insn>v2hiv2di2, <insn>v2siv2di2): Force operands[1] into REG before
30478 calling simplify_gen_subreg on it.
30480 2021-02-10 Martin Liska <mliska@suse.cz>
30482 * config/nvptx/nvptx.c (nvptx_option_override): Use
30483 flag_patchable_function_entry instead of the removed
30484 function_entry_patch_area_size.
30486 2021-02-10 Martin Liska <mliska@suse.cz>
30488 PR tree-optimization/99002
30489 PR tree-optimization/99026
30490 * gimple-if-to-switch.cc (if_chain::is_beneficial): Fix memory
30491 leak when adjacent cases are merged.
30492 * tree-switch-conversion.c (switch_decision_tree::analyze_switch_statement): Use
30494 (make_pass_lower_switch): Remove trailing whitespace.
30495 * tree-switch-conversion.h (release_clusters): New.
30497 2021-02-10 Richard Biener <rguenther@suse.de>
30499 PR rtl-optimization/99054
30500 * cfgrtl.c (rtl-optimization/99054): Return an auto_vec.
30501 (fixup_partitions): Adjust.
30502 (rtl_verify_edges): Likewise.
30504 2021-02-10 Jakub Jelinek <jakub@redhat.com>
30506 PR middle-end/99007
30507 * gimplify.c (gimplify_scan_omp_clauses): For MEM_REF on reductions,
30508 temporarily disable gimplify_ctxp->into_ssa around gimplify_expr
30511 2021-02-10 Richard Biener <rguenther@suse.de>
30514 * ipa-pure-const.c (propagate_malloc): Use an auto_vec<>
30517 2021-02-10 Richard Biener <rguenther@suse.de>
30519 PR tree-optimization/99024
30520 * tree-vect-loop.c (_loop_vec_info::~_loop_vec_info): Only
30521 clear loop->aux if it is associated with the destroyed loop_vinfo.
30523 2021-02-10 Martin Liska <mliska@suse.cz>
30525 PR tree-optimization/99002
30526 * gimple-if-to-switch.cc (find_conditions): Fix memory leak
30529 2021-02-10 Martin Liska <mliska@suse.cz>
30532 * ipa-icf.c (sem_item::add_reference): Fix memory leak when
30533 a reference exists.
30535 2021-02-10 Jakub Jelinek <jakub@redhat.com>
30538 * dwarf2out.c (prune_unused_types_walk): Mark DW_TAG_variable DIEs
30539 at class scope for DWARF5+.
30541 2021-02-09 Eric Botcazou <ebotcazou@adacore.com>
30543 PR rtl-optimization/96015
30544 * reorg.c (skip_consecutive_labels): Minor comment tweaks.
30545 (relax_delay_slots): When deleting a jump to the next active
30546 instruction over a barrier, first delete the barrier if the
30547 jump is the only way to reach the target label.
30549 2021-02-09 Andre Vieira <andre.simoesdiasvieira@arm.com>
30551 * config/aarch64/aarch64-cost-tables.h: Add entries for vect.mul.
30552 * config/aarch64/aarch64.c (aarch64_rtx_mult_cost): Use vect.mul for
30553 vector multiplies and vect.alu for SSRA.
30554 * config/arm/aarch-common-protos.h (struct vector_cost_table): Define
30555 vect.mul cost field.
30556 * config/arm/aarch-cost-tables.h: Add entries for vect.mul.
30557 * config/arm/arm.c: Likewise.
30559 2021-02-09 Richard Biener <rguenther@suse.de>
30561 PR tree-optimization/98863
30562 * tree-ssa-sccvn.h (vn_avail::next_undo): Add.
30563 * tree-ssa-sccvn.c (last_pushed_avail): New global.
30564 (rpo_elim::eliminate_push_avail): Chain pushed avails.
30565 (unwind_state::avail_top): Add.
30566 (do_unwind): Rewrite unwinding of avail entries.
30567 (do_rpo_vn): Initialize last_pushed_avail and
30568 avail_top of the undo state.
30570 2021-02-09 Jakub Jelinek <jakub@redhat.com>
30572 PR middle-end/99004
30573 * calls.c (maybe_warn_rdwr_sizes): Change s0 and s1 type from
30574 const char * to char * and free those pointers after use.
30576 2021-02-09 Richard Biener <rguenther@suse.de>
30578 PR tree-optimization/99017
30579 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Allow
30580 zero vector cost entries.
30582 2021-02-08 Andre Vieira <andre.simoesdiasvieira@arm.com>
30584 PR middle-end/98974
30585 * tree-vect-stmts.c (vectorizable_condition): Remove shadow vec_num
30586 parameter in vectorizable_condition.
30588 2021-02-08 Richard Biener <rguenther@suse.de>
30591 * tree.c (walk_tree_1): Walk VECTOR_CST elements.
30593 2021-02-08 Martin Liska <mliska@suse.cz>
30596 * cfgexpand.c (pass_expand::execute): Parse per-function option
30597 flag_patchable_function_entry and use it.
30598 * common.opt: Remove function_entry_patch_area_size and
30599 function_entry_patch_area_start global variables.
30600 * opts.c (parse_and_check_patch_area): New function.
30601 (common_handle_option): Use it.
30602 * opts.h (parse_and_check_patch_area): New function.
30603 * toplev.c (process_options): Parse and use
30604 function_entry_patch_area_size.
30606 2021-02-08 Martin Sebor <msebor@redhat.com>
30608 * doc/extend.texi (attribute malloc): Correct typos.
30610 2021-02-05 Nathan Sidwell <nathan@acm.org>
30613 * gcc.c (driver::maybe_run_linker): Check for input file
30614 accessibility if not linking.
30616 2021-02-05 Richard Biener <rguenther@suse.de>
30618 PR tree-optimization/98855
30619 * tree-vectorizer.h (add_stmt_cost): New overload.
30620 * tree-vect-slp.c (li_cost_vec_cmp): New.
30621 (vect_bb_slp_scalar_cost): Cost individual loop regions
30622 separately. Account for the scalar instance root stmt.
30624 2021-02-05 Tom de Vries <tdevries@suse.de>
30627 * tree-switch-conversion.c (jump_table_cluster::emit): Add loc
30629 (bit_test_cluster::emit): Reuse location_t for newly created
30631 (switch_decision_tree::try_switch_expansion): Preserve
30633 * tree-switch-conversion.h: Change function signatures.
30635 2021-02-05 Jakub Jelinek <jakub@redhat.com>
30638 * config/i386/i386-options.c (m_NONE, m_ALL): Define.
30639 * config/i386/x86-tune.def (X86_TUNE_BRANCH_PREDICTION_HINTS,
30640 X86_TUNE_PROMOTE_QI_REGS): Use m_NONE instead of 0U.
30641 (X86_TUNE_QIMODE_MATH): Use m_ALL instead of ~0U.
30643 2021-02-05 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30645 * config/aarch64/aarch64-simd-builtins.def (get_high): Define builtin.
30646 * config/aarch64/aarch64-simd.md (aarch64_get_high<mode>): Define.
30647 * config/aarch64/arm_neon.h (__GET_HIGH): Delete.
30648 (vget_high_f16): Reimplement using new builtin.
30649 (vget_high_f32): Likewise.
30650 (vget_high_f64): Likewise.
30651 (vget_high_p8): Likewise.
30652 (vget_high_p16): Likewise.
30653 (vget_high_p64): Likewise.
30654 (vget_high_s8): Likewise.
30655 (vget_high_s16): Likewise.
30656 (vget_high_s32): Likewise.
30657 (vget_high_s64): Likewise.
30658 (vget_high_u8): Likewise.
30659 (vget_high_u16): Likewise.
30660 (vget_high_u32): Likewise.
30661 (vget_high_u64): Likewise.
30663 2021-02-05 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30665 * config/aarch64/aarch64-simd-builtins.def (get_low): Define builtin.
30666 * config/aarch64/aarch64-simd.md (aarch64_get_low<mode>): Define.
30667 * config/aarch64/arm_neon.h (__GET_LOW): Delete.
30668 (vget_low_f16): Reimplement using new builtin.
30669 (vget_low_f32): Likewise.
30670 (vget_low_f64): Likewise.
30671 (vget_low_p8): Likewise.
30672 (vget_low_p16): Likewise.
30673 (vget_low_p64): Likewise.
30674 (vget_low_s8): Likewise.
30675 (vget_low_s16): Likewise.
30676 (vget_low_s32): Likewise.
30677 (vget_low_s64): Likewise.
30678 (vget_low_u8): Likewise.
30679 (vget_low_u16): Likewise.
30680 (vget_low_u32): Likewise.
30681 (vget_low_u64): Likewise.
30683 2021-02-05 Kito Cheng <kito.cheng@sifive.com>
30685 * gcc.c (print_multilib_info): Check all required argument is provided
30688 2021-02-05 liuhongt <hongtao.liu@intel.com>
30691 * config/i386/i386-expand.c (ix86_expand_sse_cmp): Don't
30692 generate integer mask comparison for 128/256-bits vector when
30693 op_true/op_false is NULL_RTX or CONSTM1_RTX/CONST0_RTX. Also
30694 delete redundant !maskcmp condition.
30695 (ix86_expand_int_vec_cmp): Ditto but no redundant deletion
30697 (ix86_expand_sse_movcc): Delete definition of maskcmp, add the
30698 condition directly to if (maskcmp), add extra check for
30699 cmpmode, it should be MODE_INT.
30700 (ix86_expand_fp_vec_cmp): Pass NULL to ix86_expand_sse_cmp's
30701 parameters op_true/op_false.
30702 (ix86_use_mask_cmp_p): New.
30704 2021-02-05 liuhongt <hongtao.liu@intel.com>
30707 * config/i386/x86-tune.def (X86_TUNE_AVX256_UNALIGNED_LOAD_OPTIMAL):
30708 Remove m_GENERIC from ~list.
30709 (X86_TUNE_AVX256_UNALIGNED_STORE_OPTIMAL): Ditto.
30711 2021-02-04 David Malcolm <dmalcolm@redhat.com>
30714 * diagnostic-show-locus.c (compatible_locations_p): Require
30715 locations in the same macro map to be either both from the
30716 macro definition, or both from the macro arguments.
30718 2021-02-04 Jonathan Wright <jonathan.wright@arm.com>
30720 * config/aarch64/aarch64-simd-builtins.def: Add
30721 [su]mull_hi_lane[q] builtin generator macros.
30722 * config/aarch64/aarch64-simd.md
30723 (aarch64_<su>mull_hi_lane<mode>_insn): Define.
30724 (aarch64_<su>mull_hi_lane<mode>): Define.
30725 (aarch64_<su>mull_hi_laneq<mode>_insn): Define.
30726 (aarch64_<su>mull_hi_laneq<mode>): Define.
30727 * config/aarch64/arm_neon.h (vmull_high_lane_s16): Use RTL
30728 builtin instead of inline asm.
30729 (vmull_high_lane_s32): Likewise.
30730 (vmull_high_lane_u16): Likewise.
30731 (vmull_high_lane_u32): Likewise.
30732 (vmull_high_laneq_s16): Likewise.
30733 (vmull_high_laneq_s32): Likewise.
30734 (vmull_high_laneq_u16): Likewise.
30735 (vmull_high_laneq_u32): Liekwise.
30737 2021-02-04 Jonathan Wright <jonathan.wright@arm.com>
30739 * config/aarch64/aarch64-simd-builtins.def: Add [su]mull_hi_n
30740 builtin generator macros.
30741 * config/aarch64/aarch64-simd.md
30742 (aarch64_<su>mull_hi_n<mode>_insn): Define.
30743 (aarch64_<su>mull_hi_n<mode>): Define.
30744 * config/aarch64/arm_neon.h (vmull_high_n_s16): Use RTL builtin
30745 instead of inline asm.
30746 (vmull_high_n_s32): Likewise.
30747 (vmull_high_n_u16): Likewise.
30748 (vmull_high_n_u32): Likewise.
30750 2021-02-04 Richard Biener <rguenther@suse.de>
30752 PR tree-optimization/98855
30753 * tree-vect-loop.c (vectorizable_phi): Do not cost
30754 single-argument PHIs.
30755 * tree-vect-slp.c (vect_bb_slp_scalar_cost): Likewise.
30756 * tree-vect-stmts.c (vectorizable_bswap): Also perform
30757 costing for SLP operation.
30759 2021-02-04 Martin Liska <mliska@suse.cz>
30761 * doc/extend.texi: Mention -mprefer-vector-width in target
30764 2021-02-03 Martin Sebor <msebor@redhat.com>
30766 PR tree-optimization/98937
30767 * tree-ssa-strlen.c (strlen_dom_walker::~strlen_dom_walker): Define.
30768 Flush pointer_query cache.
30770 2021-02-03 Aaron Sawdey <acsawdey@linux.ibm.com>
30772 * config/rs6000/genfusion.pl (gen_2logical): Add missing
30773 fixes based on patch review.
30774 * config/rs6000/fusion.md: Regenerate file.
30776 2021-02-03 Aaron Sawdey <acsawdey@linux.ibm.com>
30778 * config/rs6000/t-rs6000: Comment out auto generation of
30781 2021-02-03 Andrew Stubbs <ams@codesourcery.com>
30783 * config/gcn/gcn-opts.h (enum processor_type): Add PROCESSOR_GFX908.
30784 * config/gcn/gcn.c (gcn_omp_device_kind_arch_isa): Add gfx908.
30785 (output_file_start): Add gfx908.
30786 * config/gcn/gcn.opt (gpu_type): Add gfx908.
30787 * config/gcn/t-gcn-hsa (MULTILIB_OPTIONS): Add march=gfx908.
30788 (MULTILIB_DIRNAMES): Add gfx908.
30789 * config/gcn/mkoffload.c (EF_AMDGPU_MACH_AMDGCN_GFX908): New define.
30790 (main): Recognize gfx908.
30791 * config/gcn/t-omp-device: Add gfx908.
30793 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
30795 * config/aarch64/aarch64-simd-builtins.def: Add
30796 [su]mlsl_hi_lane[q] builtin macro generators.
30797 * config/aarch64/aarch64-simd.md
30798 (aarch64_<su>mlsl_hi_lane<mode>_insn): Define.
30799 (aarch64_<su>mlsl_hi_lane<mode>): Define.
30800 (aarch64_<su>mlsl_hi_laneq<mode>_insn): Define.
30801 (aarch64_<su>mlsl_hi_laneq<mode>): Define.
30802 * config/aarch64/arm_neon.h (vmlsl_high_lane_s16): Use RTL
30803 builtin instead of inline asm.
30804 (vmlsl_high_lane_s32): Likewise.
30805 (vmlsl_high_lane_u16): Likewise.
30806 (vmlsl_high_lane_u32): Likewise.
30807 (vmlsl_high_laneq_s16): Likewise.
30808 (vmlsl_high_laneq_s32): Likewise.
30809 (vmlsl_high_laneq_u16): Likewise.
30810 (vmlsl_high_laneq_u32): Likewise.
30811 (vmlal_high_laneq_u32): Likewise.
30813 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
30815 * config/aarch64/aarch64-simd-builtins.def: Add
30816 [su]mlal_hi_lane[q] builtin generator macros.
30817 * config/aarch64/aarch64-simd.md
30818 (aarch64_<su>mlal_hi_lane<mode>_insn): Define.
30819 (aarch64_<su>mlal_hi_lane<mode>): Define.
30820 (aarch64_<su>mlal_hi_laneq<mode>_insn): Define.
30821 (aarch64_<su>mlal_hi_laneq<mode>): Define.
30822 * config/aarch64/arm_neon.h (vmlal_high_lane_s16): Use RTL
30823 builtin instead of inline asm.
30824 (vmlal_high_lane_s32): Likewise.
30825 (vmlal_high_lane_u16): Likewise.
30826 (vmlal_high_lane_u32): Likewise.
30827 (vmlal_high_laneq_s16): Likewise.
30828 (vmlal_high_laneq_s32): Likewise.
30829 (vmlal_high_laneq_u16): Likewise.
30830 (vmlal_high_laneq_u32): Likewise.
30832 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
30834 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_hi_n
30835 builtin generator macros.
30836 * config/aarch64/aarch64-simd.md (aarch64_<su>mlsl_hi_n<mode>_insn):
30838 (aarch64_<su>mlsl_hi_n<mode>): Define.
30839 * config/aarch64/arm_neon.h (vmlsl_high_n_s16): Use RTL builtin
30840 instead of inline asm.
30841 (vmlsl_high_n_s32): Likewise.
30842 (vmlsl_high_n_u16): Likewise.
30843 (vmlsl_high_n_u32): Likewise.
30845 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
30847 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal_hi_n
30848 builtin generator macros.
30849 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_hi_n<mode>_insn):
30851 (aarch64_<su>mlal_hi_n<mode>): Define.
30852 * config/aarch64/arm_neon.h (vmlal_high_n_s16): Use RTL builtin
30853 instead of inline asm.
30854 (vmlal_high_n_s32): Likewise.
30855 (vmlal_high_n_u16): Likewise.
30856 (vmlal_high_n_u32): Likewise.
30858 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
30860 * config/aarch64/aarch64-simd-builtins.def: Add RTL builtin
30862 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlal_hi<mode>):
30864 (aarch64_<su>mlal_hi<mode>_insn): This.
30865 (aarch64_<su>mlal_hi<mode>): Define.
30866 * config/aarch64/arm_neon.h (vmlal_high_s8): Use RTL builtin
30867 instead of inline asm.
30868 (vmlal_high_s16): Likewise.
30869 (vmlal_high_s32): Likewise.
30870 (vmlal_high_u8): Likewise.
30871 (vmlal_high_u16): Likewise.
30872 (vmlal_high_u32): Likewise.
30874 2021-02-03 Ilya Leoshkevich <iii@linux.ibm.com>
30876 * lra-spills.c (remove_pseudos): Call lra_update_insn_recog_data()
30877 after calling alter_subreg() on a (mem).
30879 2021-02-03 Martin Liska <mliska@suse.cz>
30882 * lto-streamer-out.c (produce_lto_section): Fill up missing
30884 * lto-streamer.h (struct lto_section): Add _padding field.
30886 2021-02-03 Richard Biener <rguenther@suse.de>
30888 * lto-streamer.c (lto_get_section_name): Free temporary
30890 * tree-loop-distribution.c
30891 (loop_distribution::merge_dep_scc_partitions): Free edge data.
30893 2021-02-03 Jakub Jelinek <jakub@redhat.com>
30895 PR middle-end/97487
30896 * ifcvt.c (noce_can_force_operand): New function.
30897 (noce_emit_move_insn): Use it.
30898 (noce_try_sign_mask): Likewise. Formatting fix.
30900 2021-02-03 Jakub Jelinek <jakub@redhat.com>
30902 PR middle-end/97971
30903 * lra-constraints.c (process_alt_operands): For inline asm, don't call
30904 fatal_insn, but instead return false.
30906 2021-02-03 Jakub Jelinek <jakub@redhat.com>
30908 PR tree-optimization/98287
30909 * config/i386/mmx.md (<insn><mode>3): For shifts don't enable expander
30912 2021-02-03 Tamar Christina <tamar.christina@arm.com>
30914 PR tree-optimization/98928
30915 * tree-vect-loop.c (vect_analyze_loop_2): Change
30916 STMT_VINFO_SLP_VECT_ONLY to STMT_VINFO_SLP_VECT_ONLY_PATTERN.
30917 * tree-vect-slp-patterns.c (complex_pattern::build): Likewise.
30918 * tree-vectorizer.h (STMT_VINFO_SLP_VECT_ONLY_PATTERN): New.
30919 (class _stmt_vec_info): Add slp_vect_pattern_only_p.
30921 2021-02-02 Richard Biener <rguenther@suse.de>
30923 * gimple-loop-interchange.cc (prepare_data_references):
30925 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Likewise.
30926 * tree-ssa-loop-im.c (hoist_memory_references): Likewise.
30927 * tree-vect-stmts.c (vectorizable_condition): Do not
30929 (vectorizable_comparison): Likewise.
30931 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30933 * config/aarch64/aarch64-simd-builtins.def (ursqrte): Define builtin.
30934 * config/aarch64/aarch64-simd.md (aarch64_ursqrte<mode>): New pattern.
30935 * config/aarch64/arm_neon.h (vrsqrte_u32): Reimplement using builtin.
30936 (vrsqrteq_u32): Likewise.
30938 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30940 * config/aarch64/aarch64-simd-builtins.def (sqxtun2): Define builtin.
30941 * config/aarch64/aarch64-simd.md (aarch64_sqxtun2<mode>_le): Define.
30942 (aarch64_sqxtun2<mode>_be): Likewise.
30943 (aarch64_sqxtun2<mode>): Likewise.
30944 * config/aarch64/arm_neon.h (vqmovun_high_s16): Reimplement using builtin.
30945 (vqmovun_high_s32): Likewise.
30946 (vqmovun_high_s64): Likewise.
30947 * config/aarch64/iterators.md (UNSPEC_SQXTUN2): Define.
30949 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30951 * config/aarch64/aarch64-simd-builtins.def (bfdot_lane, bfdot_laneq): Use
30953 (bfmlalb_lane, bfmlalt_lane, bfmlalb_lane_q, bfmlalt_lane_q): Use FP flags.
30955 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30957 * config/aarch64/aarch64-simd-builtins.def (fcmla_lane0, fcmla_lane90,
30958 fcmla_lane180, fcmla_lane270, fcmlaq_lane0, fcmlaq_lane90, fcmlaq_lane180,
30959 fcmlaq_lane270, scvtf, ucvtf, fcvtzs, fcvtzu, scvtfsi, scvtfdi, ucvtfsi,
30960 ucvtfdi, fcvtzshf, fcvtzuhf, fmlal_lane_low, fmlsl_lane_low,
30961 fmlal_laneq_low, fmlsl_laneq_low, fmlalq_lane_low, fmlslq_lane_low,
30962 fmlalq_laneq_low, fmlslq_laneq_low, fmlal_lane_high, fmlsl_lane_high,
30963 fmlal_laneq_high, fmlsl_laneq_high, fmlalq_lane_high, fmlslq_lane_high,
30964 fmlalq_laneq_high, fmlslq_laneq_high): Use FP flags.
30966 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30968 * config/aarch64/aarch64-builtins.c (FLAG_LOAD): Define.
30969 * config/aarch64/aarch64-simd-builtins.def (ld1x2, ld2, ld3, ld4, ld2r,
30970 ld3r, ld4r, ld1, ld1x3, ld1x4): Use LOAD flags.
30972 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30974 * config/aarch64/aarch64-simd-builtins.def (combine, zip1, zip2,
30975 uzp1, uzp2, trn1, trn2, simd_bsl): Use AUTO_FP flags.
30977 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30979 * config/aarch64/aarch64-simd-builtins.def (clrsb, clz, ctz, popcount,
30980 vec_smult_lane_, vec_smlal_lane_, vec_smult_laneq_, vec_smlal_laneq_,
30981 vec_umult_lane_, vec_umlal_lane_, vec_umult_laneq_, vec_umlal_laneq_,
30982 ashl, sshl, ushl, srshl, urshl, sdot_lane, udot_lane, sdot_laneq,
30983 udot_laneq, usdot_lane, usdot_laneq, sudot_lane, sudot_laneq, ashr,
30984 ashr_simd, lshr, lshr_simd, srshr_n, urshr_n, ssra_n, usra_n, srsra_n,
30985 ursra_n, sshll_n, ushll_n, sshll2_n, ushll2_n, ssri_n, usri_n, ssli_n,
30986 ssli_n, usli_n, bswap, rbit, simd_bsl, eor3q, rax1q, xarq, bcaxq): Use
30987 NONE builtin flags.
30989 2021-02-02 Jakub Jelinek <jakub@redhat.com>
30991 PR tree-optimization/98848
30992 * tree-vect-patterns.c (vect_recog_over_widening_pattern): Punt if
30993 STMT_VINFO_DEF_TYPE (last_stmt_info) is vect_reduction_def.
30995 2021-02-02 Kito Cheng <kito.cheng@sifive.com>
30998 * expr.c: Check mode before calling store_expr.
31000 2021-02-02 Christophe Lyon <christophe.lyon@linaro.org>
31002 * config/arm/iterators.md (supf): Remove VORNQ_S and VORNQ_U.
31004 * config/arm/mve.md (mve_vornq_s<mode>): New entry for vorn
31005 instruction using expression ior.
31006 (mve_vornq_u<mode>): New expander.
31007 (mve_vornq_f<mode>): Use ior code instead of unspec.
31008 * config/arm/unspecs.md (VORNQ_S, VORNQ_U, VORNQ_F): Remove.
31010 2021-02-02 Alexandre Oliva <oliva@adacore.com>
31012 * tree-nested.c (convert_nonlocal_reference_op): Move
31013 current_function_decl restore after re-gimplification.
31014 (convert_local_reference_op): Likewise.
31016 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31018 * config/aarch64/aarch64-simd-builtins.def (rshrn, rshrn2):
31020 * config/aarch64/aarch64-simd.md (aarch64_rshrn<mode>_insn_le):
31022 (aarch64_rshrn<mode>_insn_be): Likewise.
31023 (aarch64_rshrn<mode>): Likewise.
31024 (aarch64_rshrn2<mode>_insn_le): Likewise.
31025 (aarch64_rshrn2<mode>_insn_be): Likewise.
31026 (aarch64_rshrn2<mode>): Likewise.
31027 * config/aarch64/aarch64.md (unspec): Add UNSPEC_RSHRN.
31028 * config/aarch64/arm_neon.h (vrshrn_high_n_s16): Reimplement
31030 (vrshrn_high_n_s32): Likewise.
31031 (vrshrn_high_n_s64): Likewise.
31032 (vrshrn_high_n_u16): Likewise.
31033 (vrshrn_high_n_u32): Likewise.
31034 (vrshrn_high_n_u64): Likewise.
31035 (vrshrn_n_s16): Likewise.
31036 (vrshrn_n_s32): Likewise.
31037 (vrshrn_n_s64): Likewise.
31038 (vrshrn_n_u16): Likewise.
31039 (vrshrn_n_u32): Likewise.
31040 (vrshrn_n_u64): Likewise.
31042 2021-02-01 Sergei Trofimovich <siarheit@google.com>
31044 PR tree-optimization/98499
31045 * ipa-modref.c (analyze_ssa_name_flags): treat RVO
31046 conservatively and assume all possible side-effects.
31048 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31050 * config/aarch64/aarch64-simd-builtins.def (vec_unpacks_hi,
31051 vec_unpacku_hi_): Define builtins.
31052 * config/aarch64/arm_neon.h (vmovl_high_s8): Reimplement using
31054 (vmovl_high_s16): Likewise.
31055 (vmovl_high_s32): Likewise.
31056 (vmovl_high_u8): Likewise.
31057 (vmovl_high_u16): Likewise.
31058 (vmovl_high_u32): Likewise.
31060 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31062 * config/aarch64/aarch64-simd-builtins.def (sabdl, uabdl):
31064 * config/aarch64/aarch64-simd.md (aarch64_<sur>abdl<mode>): New
31066 * config/aarch64/aarch64.md (unspec): Define UNSPEC_SABDL,
31068 * config/aarch64/arm_neon.h (vabdl_s8): Reimplemet using
31070 (vabdl_s16): Likewise.
31071 (vabdl_s32): Likewise.
31072 (vabdl_u8): Likewise.
31073 (vabdl_u16): Likewise.
31074 (vabdl_u32): Likewise.
31075 * config/aarch64/iterators.md (ABDL): New int iterator.
31076 (sur): Handle UNSPEC_SABDL, UNSPEC_UABDL.
31078 2021-02-01 Martin Sebor <msebor@redhat.com>
31080 * tree.h (BLOCK_VARS): Add comment.
31081 (BLOCK_SUBBLOCKS): Same.
31082 (BLOCK_SUPERCONTEXT): Same.
31083 (BLOCK_ABSTRACT_ORIGIN): Same.
31084 (inlined_function_outer_scope_p): Same.
31086 2021-02-01 Martin Sebor <msebor@redhat.com>
31088 PR middle-end/97172
31089 * attribs.c (attr_access::free_lang_data): Define new function.
31090 * attribs.h (attr_access::free_lang_data): Declare new function.
31092 2021-02-01 Richard Biener <rguenther@suse.de>
31094 * vec.h (auto_vec::auto_vec): Add memory stat parameters
31096 * bitmap.h (auto_bitmap::auto_bitmap): Likewise.
31098 2021-02-01 Tamar Christina <tamar.christina@arm.com>
31100 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_n<mode>,
31101 aarch64_<su>mlsl<mode>, aarch64_<su>mlsl_n<mode>): Flip mult operands.
31103 2021-02-01 Richard Biener <rguenther@suse.de>
31105 PR rtl-optimization/98863
31106 * config/i386/i386-features.c (convert_scalars_to_vector):
31107 Set DF_RD_PRUNE_DEAD_DEFS.
31109 2021-01-31 Eric Botcazou <ebotcazou@adacore.com>
31111 * system.h (SIZE_MAX): Define if not already defined.
31113 2021-01-30 Aaron Sawdey <acsawdey@linux.ibm.com>
31115 * config/rs6000/genfusion.pl (gen_2logical): New function to
31116 generate patterns for logical-logical fusion.
31117 * config/rs6000/fusion.md: Regenerated patterns.
31118 * config/rs6000/rs6000-cpus.def: Add
31119 OPTION_MASK_P10_FUSION_2LOGICAL.
31120 * config/rs6000/rs6000.c (rs6000_option_override_internal):
31121 Enable logical-logical fusion for p10.
31122 * config/rs6000/rs6000.opt: Add -mpower10-fusion-2logical.
31124 2021-01-30 David Edelsohn <dje.gcc@gmail.com>
31126 * config/rs6000/rs6000.opt: Add periods to new AIX options.
31128 2021-01-30 David Edelsohn <dje.gcc@gmail.com>
31130 * config/rs6000/rs6000.opt (mabi=vec-extabi): New.
31131 (mabi=vec-default): New.
31132 * config/rs6000/rs6000-c.c (rs6000_target_modify_macros): Define
31133 __EXTABI__ for AIX Vector extended ABI.
31134 * config/rs6000/rs6000.c (rs6000_debug_reg_global): Print AIX Vector
31136 (conditional_register_usage): If AIX vec_extabi enabled, vs20-vs31
31138 * doc/invoke.texi (PowerPC mabi): Add AIX vec-extabi and vec-default.
31140 2021-01-30 Jakub Jelinek <jakub@redhat.com>
31142 * config/i386/i386-features.c (remove_partial_avx_dependency): Clear
31143 DF_DEFER_INSN_RESCAN after calling df_process_deferred_rescans.
31145 2021-01-29 Vladimir N. Makarov <vmakarov@redhat.com>
31148 * lra-constraints.c (in_class_p): Don't narrow class only for REG
31151 2021-01-29 Will Schmidt <will_schmidt@vnet.ibm.com>
31153 * config/rs6000/rs6000-call.c (rs6000_expand_binup_builtin): Add
31154 clauses for CODE_FOR_vsx_xvcvuxddp_scale and
31155 CODE_FOR_vsx_xvcvsxddp_scale to the parameter checking code.
31157 2021-01-29 Andrew MacLeod <amacleod@redhat.com>
31159 PR tree-optimization/98866
31160 * gimple-range-gori.h (gori_compute:set_range_invariant): New.
31161 * gimple-range-gori.cc (gori_map::set_range_invariant): New.
31162 (gori_map::m_maybe_invariant): Rename from all_outgoing.
31163 (gori_map::gori_map): Rename all_outgoing to m_maybe_invariant.
31164 (gori_map::is_export_p): Ditto.
31165 (gori_map::calculate_gori): Ditto.
31166 (gori_compute::set_range_invariant): New.
31167 * gimple-range.cc (gimple_ranger::range_of_stmt): Set range
31168 invariant for pointers evaluating to [1, +INF].
31170 2021-01-29 Richard Biener <rguenther@suse.de>
31172 PR rtl-optimization/98863
31173 * config/i386/i386-features.c (remove_partial_avx_dependency):
31174 Do not perform DF analysis.
31175 (pass_data_remove_partial_avx_dependency): Remove
31178 2021-01-29 Jonathan Wright <jonathan.wright@arm.com>
31180 * config/aarch64/aarch64-simd-builtins.def: Add [su]mull_n
31181 builtin generator macros.
31182 * config/aarch64/aarch64-simd.md (aarch64_<su>mull_n<mode>):
31184 * config/aarch64/arm_neon.h (vmull_n_s16): Use RTL builtin
31185 instead of inline asm.
31186 (vmull_n_s32): Likewise.
31187 (vmull_n_u16): Likewise.
31188 (vmull_n_u32): Likewise.
31190 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31192 * config/aarch64/aarch64-simd-builtins.def (sabdl2, uabdl2):
31194 * config/aarch64/aarch64-simd.md (aarch64_<sur>abdl2<mode>_3):
31196 (aarch64_<sur>abdl2<mode>): ... This.
31197 (<sur>sadv16qi): Adjust use of above.
31198 * config/aarch64/arm_neon.h (vabdl_high_s8): Reimplement using
31200 (vabdl_high_s16): Likewise.
31201 (vabdl_high_s32): Likewise.
31202 (vabdl_high_u8): Likewise.
31203 (vabdl_high_u16): Likewise.
31204 (vabdl_high_u32): Likewise.
31206 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31208 * config/aarch64/aarch64-simd-builtins.def (sabal2): Define
31210 (uabal2): Likewise.
31211 * config/aarch64/aarch64-simd.md (aarch64_<sur>abal2<mode>): New
31213 * config/aarch64/aarch64.md (unspec): Add UNSPEC_SABAL2 and
31215 * config/aarch64/arm_neon.h (vabal_high_s8): Reimplement using
31217 (vabal_high_s16): Likewise.
31218 (vabal_high_s32): Likewise.
31219 (vabal_high_u8): Likewise.
31220 (vabal_high_u16): Likewise.
31221 (vabal_high_u32): Likewise.
31222 * config/aarch64/iterators.md (ABAL2): New mode iterator.
31223 (sur): Handle UNSPEC_SABAL2, UNSPEC_UABAL2.
31225 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31227 * config/aarch64/aarch64-simd-builtins.def (sabal): Define
31230 * config/aarch64/aarch64-simd.md (aarch64_<sur>abal<mode>_4):
31232 (aarch64_<sur>abal<mode>): ... This
31233 (<sur>sadv16qi): Adust use of the above.
31234 * config/aarch64/arm_neon.h (vabal_s8): Reimplement using
31236 (vabal_s16): Likewise.
31237 (vabal_s32): Likewise.
31238 (vabal_u8): Likewise.
31239 (vabal_u16): Likewise.
31240 (vabal_u32): Likewise.
31242 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31244 * config/aarch64/aarch64-simd-builtins.def (saddlv, uaddlv):
31246 * config/aarch64/aarch64-simd.md (aarch64_<su>addlv<mode>):
31248 * config/aarch64/arm_neon.h (vaddlv_s8): Reimplement using
31250 (vaddlv_s16): Likewise.
31251 (vaddlv_u8): Likewise.
31252 (vaddlv_u16): Likewise.
31253 (vaddlvq_s8): Likewise.
31254 (vaddlvq_s16): Likewise.
31255 (vaddlvq_s32): Likewise.
31256 (vaddlvq_u8): Likewise.
31257 (vaddlvq_u16): Likewise.
31258 (vaddlvq_u32): Likewise.
31259 (vaddlv_s32): Likewise.
31260 (vaddlv_u32): Likewise.
31261 * config/aarch64/iterators.md (VDQV_L): New mode iterator.
31262 (unspec): Add UNSPEC_SADDLV, UNSPEC_UADDLV.
31263 (Vwstype): New mode attribute.
31265 (VWIDE_S): Likewise.
31266 (USADDLV): New int iterator.
31267 (su): Handle UNSPEC_SADDLV, UNSPEC_UADDLV.
31269 2021-01-29 Jonathan Wright <jonathan.wright@arm.com>
31271 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_lane[q]
31272 builtin generator macros.
31273 * config/aarch64/aarch64-simd.md (aarch64_vec_<su>mlsl_lane<Qlane>):
31275 * config/aarch64/arm_neon.h (vmlsl_lane_s16): Use RTL builtin
31276 instead of inline asm.
31277 (vmlsl_lane_s32): Likewise.
31278 (vmlsl_lane_u16): Likewise.
31279 (vmlsl_lane_u32): Likewise.
31280 (vmlsl_laneq_s16): Likewise.
31281 (vmlsl_laneq_s32): Likewise.
31282 (vmlsl_laneq_u16): Likewise.
31283 (vmlsl_laneq_u32): Likewise.
31285 2021-01-29 Richard Biener <rguenther@suse.de>
31287 * doc/invoke.texi (--param max-gcse-memory): Document unit
31289 * gcse.c (gcse_or_cprop_is_too_expensive): Adjust.
31290 * params.opt (--param max-gcse-memory): Adjust default and
31291 document unit of size.
31293 2021-01-29 Richard Biener <rguenther@suse.de>
31295 PR rtl-optimization/98863
31296 * gcse.c (gcse_or_cprop_is_too_expensive): Use unsigned
31297 HOST_WIDE_INT for the memory estimate.
31299 2021-01-29 Bin Cheng <bin.cheng@linux.alibaba.com>
31300 Richard Biener <rguenther@suse.de>
31302 PR tree-optimization/97627
31303 * tree-ssa-loop-niter.c (number_of_iterations_exit_assumptions):
31304 Do not analyze fake edges.
31306 2021-01-29 Richard Biener <rguenther@suse.de>
31308 PR rtl-optimization/98144
31309 * df.h (df_mir_bb_info): Add con_visited member.
31310 * df-problems.c (df_mir_alloc): Initialize con_visited,
31311 do not fully populate IN and OUT.
31312 (df_mir_reset): Likewise.
31313 (df_mir_confluence_0): Set con_visited.
31314 (df_mir_confluence_n): Properly handle implicitely
31315 fully populated IN and OUT as designated by con_visited
31316 and update con_visited accordingly.
31318 2021-01-29 Jakub Jelinek <jakub@redhat.com>
31321 * config/arm/vec-common.md (mve_vshlq_<supf><mode>,
31322 vashl<mode>3, vashr<mode>3, vlshr<mode>3): Add
31323 && !TARGET_REALLY_IWMMXT to conditions.
31325 2021-01-29 Jakub Jelinek <jakub@redhat.com>
31328 * cfgbuild.c (find_bb_boundaries): Reset debug_insn when seeing
31331 2021-01-28 Marek Polacek <polacek@redhat.com>
31334 * stor-layout.c (finalize_type_size): If we reset TYPE_USER_ALIGN in
31335 the main variant, maybe reset it in its variants too.
31336 * tree.c (check_base_type): Return true only if TYPE_USER_ALIGN match.
31337 (check_aligned_type): Check if TYPE_USER_ALIGN match.
31339 2021-01-28 Christophe Lyon <christophe.lyon@linaro.org>
31342 * config/arm/arm.c (arm_rtx_costs_internal): Adjust cost of vector
31343 of constant zero for comparisons.
31345 2021-01-28 Michael Meissner <meissner@linux.ibm.com>
31347 * config/rs6000/rs6000.c (rs6000_mangle_decl_assembler_name): Add
31348 support for mapping built-in function names for long double
31349 built-in functions if long double is IEEE 128-bit.
31351 2021-01-28 Jonathan Wright <jonathan.wright@arm.com>
31353 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_n
31354 builtin generator macros.
31355 * config/aarch64/aarch64-simd.md (aarch64_<su>mlsl_n<mode>):
31357 * config/aarch64/arm_neon.h (vmlsl_n_s16): Use RTL builtin
31358 instead of inline asm.
31359 (vmlsl_n_s32): Likewise.
31360 (vmlsl_n_u16): Likewise.
31361 (vmlsl_n_u32): Likewise.
31363 2021-01-28 Jonathan Wright <jonathan.wright@arm.com>
31365 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal_n
31366 builtin generator macros.
31367 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_n<mode>):
31369 * config/aarch64/arm_neon.h (vmlal_n_s16): Use RTL builtin
31370 instead of inline asm.
31371 (vmlal_n_s32): Likewise.
31372 (vmlal_n_u16): Likewise.
31373 (vmlal_n_u32): Likewise.
31375 2021-01-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31377 * config/aarch64/aarch64-simd-builtins.def (shrn2): Define
31379 * config/aarch64/aarch64-simd.md (aarch64_shrn2<mode>_insn_le):
31381 (aarch64_shrn2<mode>_insn_be): Likewise.
31382 (aarch64_shrn2<mode>): Likewise.
31383 * config/aarch64/arm_neon.h (vshrn_high_n_s16): Reimlplement
31385 (vshrn_high_n_s32): Likewise.
31386 (vshrn_high_n_s64): Likewise.
31387 (vshrn_high_n_u16): Likewise.
31388 (vshrn_high_n_u32): Likewise.
31389 (vshrn_high_n_u64): Likewise.
31391 2021-01-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31393 * config/aarch64/aarch64-simd-builtins.def (shrn): Define
31395 * config/aarch64/aarch64-simd.md (aarch64_shrn<mode>_insn_le):
31397 (aarch64_shrn<mode>_insn_be): Likewise.
31398 (aarch64_shrn<mode>): Likewise.
31399 * config/aarch64/arm_neon.h (vshrn_n_s16): Reimplement using
31401 (vshrn_n_s32): Likewise.
31402 (vshrn_n_s64): Likewise.
31403 (vshrn_n_u16): Likewise.
31404 (vshrn_n_u32): Likewise.
31405 (vshrn_n_u64): Likewise.
31406 * config/aarch64/iterators.md (vn_mode): New mode attribute.
31408 2021-01-28 Richard Biener <rguenther@suse.de>
31410 PR rtl-optimization/80960
31411 * dse.c (check_mem_read_rtx): Call get_addr on the
31414 2021-01-28 Xionghu Luo <luoxhu@linux.ibm.com>
31415 David Edelsohn <dje.gcc@gmail.com>
31418 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
31419 Don't generate VIEW_CONVERT_EXPR for fcode ALTIVEC_BUILTIN_VEC_INSERT
31421 * config/rs6000/rs6000-protos.h (rs6000_expand_vector_set_var):
31423 * config/rs6000/rs6000.c (rs6000_expand_vector_set): Remove the
31424 wrapper call rs6000_expand_vector_set_var for cleanup. Call
31425 rs6000_expand_vector_set_var_p9 and rs6000_expand_vector_set_var_p8
31427 (rs6000_expand_vector_set_var): Delete.
31428 (rs6000_expand_vector_set_var_p9): Make static.
31429 (rs6000_expand_vector_set_var_p8): Make static.
31431 2021-01-28 Xing GUO <higuoxing@gmail.com>
31433 * common/config/riscv/riscv-common.c
31434 (riscv_subset_list::parsing_subset_version): Fix -march option parsing
31435 when `p` extension exists.
31437 2021-01-27 Vladimir N. Makarov <vmakarov@redhat.com>
31439 PR rtl-optimization/97684
31440 * ira.c (ira): Call ira_set_pseudo_classes before
31441 update_equiv_regs when it is necessary.
31443 2021-01-27 Jakub Jelinek <jakub@redhat.com>
31446 * config/aarch64/aarch64.md (*aarch64_bfxilsi_uxtw): Use
31447 %w0, %w1 and %2 instead of %0, %1 and %2.
31449 2021-01-27 Aaron Sawdey <acsawdey@linux.ibm.com>
31451 * config/rs6000/genfusion.pl: New script to generate
31452 define_insn_and_split patterns so combine can arrange fused
31453 instructions next to each other.
31454 * config/rs6000/fusion.md: New file, generated fused instruction
31455 patterns for combine.
31456 * config/rs6000/predicates.md (const_m1_to_1_operand): New predicate.
31457 (non_update_memory_operand): New predicate.
31458 * config/rs6000/rs6000-cpus.def: Add OPTION_MASK_P10_FUSION and
31459 OPTION_MASK_P10_FUSION_LD_CMPI to ISA_3_1_MASKS_SERVER and
31461 * config/rs6000/rs6000-protos.h (address_is_non_pfx_d_or_x): Add
31463 * config/rs6000/rs6000.c (rs6000_option_override_internal):
31464 Automatically set OPTION_MASK_P10_FUSION and
31465 OPTION_MASK_P10_FUSION_LD_CMPI if target is power10.
31466 (rs600_opt_masks): Allow -mpower10-fusion
31467 in function attributes.
31468 (address_is_non_pfx_d_or_x): New function.
31469 * config/rs6000/rs6000.h: Add MASK_P10_FUSION.
31470 * config/rs6000/rs6000.md: Include fusion.md.
31471 * config/rs6000/rs6000.opt: Add -mpower10-fusion
31472 and -mpower10-fusion-ld-cmpi.
31473 * config/rs6000/t-rs6000: Add dependencies involving fusion.md.
31475 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
31477 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal
31478 builtin generator macros.
31479 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlal<mode>):
31481 (aarch64_<su>mlal<mode>): This.
31482 * config/aarch64/arm_neon.h (vmlal_s8): Use RTL builtin
31483 instead of inline asm.
31484 (vmlal_s16): Likewise.
31485 (vmlal_s32): Likewise.
31486 (vmlal_u8): Likewise.
31487 (vmlal_u16): Likewise.
31488 (vmlal_u32): Likewise.
31490 2021-01-27 Richard Biener <rguenther@suse.de>
31492 PR tree-optimization/98854
31493 * tree-vect-slp.c (vect_build_slp_tree_2): Also build
31494 PHIs from scalars when the number of CTORs matches the
31495 number of children.
31497 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
31499 * config/aarch64/aarch64-simd-builtins.def: Add mls_n builtin
31501 * config/aarch64/aarch64-simd.md (*aarch64_mls_elt_merge<mode>):
31503 (aarch64_mls_n<mode>): This.
31504 * config/aarch64/arm_neon.h (vmls_n_s16): Use RTL builtin
31506 (vmls_n_s32): Likewise.
31507 (vmls_n_u16): Likewise.
31508 (vmls_n_u32): Likewise.
31509 (vmlsq_n_s16): Likewise.
31510 (vmlsq_n_s32): Likewise.
31511 (vmlsq_n_u16): Likewise.
31512 (vmlsq_n_u32): Likewise.
31514 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
31516 * config/aarch64/aarch64-simd-builtins.def: Add mls builtin
31518 * config/aarch64/arm_neon.h (vmls_s8): Use RTL builtin rather
31520 (vmls_s16): Likewise.
31521 (vmls_s32): Likewise.
31522 (vmls_u8): Likewise.
31523 (vmls_u16): Likewise.
31524 (vmls_u32): Likewise.
31525 (vmlsq_s8): Likewise.
31526 (vmlsq_s16): Likewise.
31527 (vmlsq_s32): Likewise.
31528 (vmlsq_u8): Likewise.
31529 (vmlsq_u16): Likewise.
31530 (vmlsq_u32): Likewise.
31532 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
31534 * config/aarch64/aarch64-simd-builtins.def: Add mla_n builtin
31536 * config/aarch64/aarch64-simd.md (*aarch64_mla_elt_merge<mode>):
31538 (aarch64_mla_n<mode>): This.
31539 * config/aarch64/arm_neon.h (vmla_n_s16): Use RTL builtin
31541 (vmla_n_s32): Likewise.
31542 (vmla_n_u16): Likewise.
31543 (vmla_n_u32): Likewise.
31544 (vmlaq_n_s16): Likewise.
31545 (vmlaq_n_s32): Likewise.
31546 (vmlaq_n_u16): Likewise.
31547 (vmlaq_n_u32): Likewise.
31549 2021-01-27 liuhongt <hongtao.liu@intel.com>
31552 * config/i386/sse.md (sse2_gt<mode>3): Drop !TARGET_XOP in condition.
31553 (*sse2_eq<mode>3): Ditto.
31555 2021-01-27 Jakub Jelinek <jakub@redhat.com>
31557 * tree-pass.h (PROP_trees): Rename to ...
31558 (PROP_gimple): ... this.
31559 * cfgexpand.c (pass_data_expand): Replace PROP_trees with PROP_gimple.
31560 * passes.c (execute_function_dump, execute_function_todo,
31561 execute_one_ipa_transform_pass, execute_one_pass): Likewise.
31562 * varpool.c (ctor_for_folding): Likewise.
31564 2021-01-27 Jakub Jelinek <jakub@redhat.com>
31566 PR tree-optimization/97260
31567 * varpool.c: Include tree-pass.h.
31568 (ctor_for_folding): In GENERIC return DECL_INITIAL for TREE_READONLY
31569 non-TREE_SIDE_EFFECTS automatic variables.
31571 2021-01-26 Paul Fee <paul.f.fee@gmail.com>
31573 * doc/cpp.texi (__cplusplus): Document value for -std=c++23
31575 * doc/invoke.texi: Document -std=c++23 and -std=gnu++23.
31576 * dwarf2out.c (highest_c_language): Recognise C++20 and C++23.
31577 (gen_compile_unit_die): Recognise C++23.
31579 2021-01-26 Jakub Jelinek <jakub@redhat.com>
31582 * dwarf2asm.c (dw2_assemble_integer): Cast DWARF2_ADDR_SIZE to int
31585 2021-01-26 Jakub Jelinek <jakub@redhat.com>
31588 * config/aarch64/aarch64.c (aarch64_mask_and_shift_for_ubfiz_p):
31589 Use UINTVAL (shft_amnt) and UINTVAL (mask) instead of INTVAL (shft_amnt)
31590 and INTVAL (mask). Add && INTVAL (mask) > 0 condition.
31592 2021-01-26 Richard Biener <rguenther@suse.de>
31594 * gimple-pretty-print.c (dump_binary_rhs): Handle
31595 VEC_WIDEN_{PLUS,MINUS}_{LO,HI}_EXPR.
31597 2021-01-26 Richard Biener <rguenther@suse.de>
31599 PR middle-end/98726
31600 * tree.h (vector_cst_int_elt): Remove.
31601 * tree.c (vector_cst_int_elt): Use poly_wide_int for computations,
31604 2021-01-26 Andrew Stubbs <ams@codesourcery.com>
31606 * config/gcn/gcn.c (gcn_expand_reduc_scalar): Use move instructions
31607 for V64DFmode min/max reductions.
31609 2021-01-26 Jakub Jelinek <jakub@redhat.com>
31611 * dwarf2asm.c (dw2_assemble_integer): Handle size twice as large
31612 as DWARF2_ADDR_SIZE if x is not a scalar int by emitting it as
31613 two halves, one with x and the other with const0_rtx, ordered
31614 depending on endianity.
31616 2021-01-26 Alexandre Oliva <oliva@adacore.com>
31618 * gimplify.c (gimplify_decl_expr): Skip asan marking calls for
31619 temporaries not seen in binding block, and not about to be
31620 added as gimple variables.
31622 2021-01-25 Martin Sebor <msebor@redhat.com>
31625 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Adjust warning text.
31627 2021-01-25 Martin Liska <mliska@suse.cz>
31629 * value-prof.c (get_nth_most_common_value): Use %s instead
31632 2021-01-25 Jakub Jelinek <jakub@redhat.com>
31635 * configure.ac (HAVE_AS_GDWARF_5_DEBUG_FLAG): Only define if
31636 readelf -wi is able to read the emitted .debug_info back.
31637 * configure: Regenerated.
31639 2021-01-25 Martin Liska <mliska@suse.cz>
31641 PR gcov-profile/98739
31642 * common.opt: Add missing sign symbol.
31643 * value-prof.c (get_nth_most_common_value): Restore handling
31644 of PROFILE_REPRODUCIBILITY_PARALLEL_RUNS and
31645 PROFILE_REPRODUCIBILITY_MULTITHREADED.
31647 2021-01-25 Richard Biener <rguenther@suse.de>
31649 PR middle-end/98807
31650 * tree.c (vector_element_bits): Always use precision of
31651 the element type for boolean vectors.
31653 2021-01-25 Sebastian Huber <sebastian.huber@embedded-brains.de>
31655 * config/rtems.h (STARTFILE_SPEC): Remove qnolinkcmds.
31656 (ENDFILE_SPEC): Evaluate qnolinkcmds.
31658 2021-01-25 Sebastian Huber <sebastian.huber@embedded-brains.de>
31660 * config/rtems.h (STARTFILE_SPEC): Remove nostdlib and
31661 nostartfiles handling since this is already done by
31662 LINK_COMMAND_SPEC. Evaluate qnolinkcmds.
31663 (ENDFILE_SPEC): Remove nostdlib and nostartfiles handling since this
31664 is already done by LINK_COMMAND_SPEC.
31665 (LIB_SPECS): Remove nostdlib and nodefaultlibs handling since
31666 this is already done by LINK_COMMAND_SPEC. Remove qnolinkcmds
31669 2021-01-25 Jakub Jelinek <jakub@redhat.com>
31672 * fold-const-call.c (host_size_t_cst_p): Renamed to ...
31673 (size_t_cst_p): ... this. Check and store unsigned HOST_WIDE_INT
31674 value rather than host size_t.
31675 (fold_const_call): Change type of s2 from size_t to
31676 unsigned HOST_WIDE_INT. Use size_t_cst_p instead of
31677 host_size_t_cst_p. For strncmp calls, pass MIN (s2, SIZE_MAX)
31678 instead of s2 as last argument.
31680 2021-01-25 Tamar Christina <tamar.christina@arm.com>
31682 * config/arm/iterators.md (rotsplit1, rotsplit2, conj_op, fcmac1,
31683 VCMLA_OP, VCMUL_OP): New.
31684 * config/arm/mve.md (mve_vcmlaq<mve_rot><mode>): Support vec_dup 0.
31685 * config/arm/neon.md (cmul<conj_op><mode>3): New.
31686 * config/arm/unspecs.md (UNSPEC_VCMLA_CONJ, UNSPEC_VCMLA180_CONJ,
31687 UNSPEC_VCMUL_CONJ): New.
31688 * config/arm/vec-common.md (cmul<conj_op><mode>3, arm_vcmla<rot><mode>,
31689 cml<fcmac1><conj_op><mode>4): New.
31691 2021-01-23 Jakub Jelinek <jakub@redhat.com>
31694 * config/rs6000/mmintrin.h (__m64): Add __may_alias__ attribute.
31696 2021-01-22 Jonathan Wright <jonathan.wright@arm.com>
31698 * config/aarch64/aarch64-simd-builtins.def: Add mla builtin
31700 * config/aarch64/arm_neon.h (vmla_s8): Use RTL builtin rather
31702 (vmla_s16): Likewise.
31703 (vmla_s32): Likewise.
31704 (vmla_u8): Likewise.
31705 (vmla_u16): Likewise.
31706 (vmla_u32): Likewise.
31707 (vmlaq_s8): Likewise.
31708 (vmlaq_s16): Likewise.
31709 (vmlaq_s32): Likewise.
31710 (vmlaq_u8): Likewise.
31711 (vmlaq_u16): Likewise.
31712 (vmlaq_u32): Likewise.
31714 2021-01-22 David Malcolm <dmalcolm@redhat.com>
31716 * doc/invoke.texi (GCC_EXTRA_DIAGNOSTIC_OUTPUT): Add @findex
31719 2021-01-22 Jakub Jelinek <jakub@redhat.com>
31722 * dwarf2out.c (output_file_names): For -gdwarf-5, if there are no
31723 filenames to emit, still emit the required 0 index directory and
31724 filename entries that match DW_AT_comp_dir and DW_AT_name of the
31727 2021-01-22 Marek Polacek <polacek@redhat.com>
31730 * doc/invoke.texi: Update C++ ABI Version 15 description.
31732 2021-01-22 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31734 PR tree-optimization/98766
31735 * tree-ssa-math-opts.c (convert_mult_to_fma): Use maybe_le when
31736 comparing against type size with param_avoid_fma_max_bits.
31738 2021-01-22 Richard Biener <rguenther@suse.de>
31740 PR middle-end/98793
31741 * tree.c (vector_element_bits): Key single-bit bool vector on
31742 integer mode rather than not vector mode.
31744 2021-01-22 Xionghu Luo <luoxhu@linux.ibm.com>
31747 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
31748 Generate ARRAY_REF(VIEW_CONVERT_EXPR) for P8 and later
31750 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var): Update
31751 to call different path for P8 and P9.
31752 (rs6000_expand_vector_set_var_p9): New function.
31753 (rs6000_expand_vector_set_var_p8): New function.
31755 2021-01-22 Xionghu Luo <luoxhu@linux.ibm.com>
31759 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
31760 Ajdust variable index vec_insert from address dereference to
31761 ARRAY_REF(VIEW_CONVERT_EXPR) tree expression.
31762 * config/rs6000/rs6000-protos.h (rs6000_expand_vector_set_var):
31764 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var): New function.
31766 2021-01-22 Martin Liska <mliska@suse.cz>
31768 PR gcov-profile/98739
31769 * profile.c (compute_value_histograms): Drop time profile for
31770 -fprofile-reproducible=multithreaded.
31772 2021-01-22 Nathan Sidwell <nathan@acm.org>
31774 * gcc.c (process_command): Don't check OPT_SPECIAL_input_file
31777 2021-01-22 Richard Biener <rguenther@suse.de>
31779 PR middle-end/98773
31780 * tree-data-ref.c (initalize_matrix_A): Revert previous
31781 change, retaining failing on HOST_WIDE_INT_MIN CHREC_RIGHT.
31783 2021-01-22 Jakub Jelinek <jakub@redhat.com>
31785 PR tree-optimization/90248
31786 * match.pd (X cmp 0.0 ? 1.0 : -1.0 -> copysign(1, +-X),
31787 X cmp 0.0 ? -1.0 : +1.0 -> copysign(1, -+X)): Remove
31789 (X * (X cmp 0.0 ? 1.0 : -1.0) -> +-abs(X),
31790 X * (X cmp 0.0 ? -1.0 : 1.0) -> +-abs(X)): New simplifications.
31792 2021-01-22 Jakub Jelinek <jakub@redhat.com>
31794 PR tree-optimization/98255
31795 * tree-dfa.c (get_ref_base_and_extent): For ARRAY_REFs, sign
31796 extend index - low_bound from sizetype's precision rather than index
31798 (get_addr_base_and_unit_offset_1): Likewise.
31799 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Likewise.
31800 * gimple-fold.c (fold_const_aggregate_ref_1): Likewise.
31802 2021-01-22 Richard Biener <rguenther@suse.de>
31804 PR tree-optimization/98786
31805 * tree-ssa-phiopt.c (factor_out_conditional_conversion): Avoid
31806 adding new uses of abnormals. Verify we deal with a conditional
31809 2021-01-22 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
31812 * optc-save-gen.awk: Add arm_fp16_format to checked_options.
31814 2021-01-22 liuhongt <hongtao.liu@intel.com>
31818 * config/i386/sse.md (VI_128_256): New mode iterator.
31819 (*avx_cmp<mode>3_1, *avx_cmp<mode>3_2, *avx_cmp<mode>3_3,
31820 *avx_cmp<mode>3_4, *avx2_eq<mode>3, *avx2_pcmp<mode>3_1,
31821 *avx2_pcmp<mode>3_2, *avx2_gt<mode>3): New
31822 define_insn_and_split to lower avx512 vector comparison to avx
31823 version when dest is vector.
31824 (*<avx512>_cmp<mode>3,*<avx512>_cmp<mode>3,*<avx512>_ucmp<mode>3):
31825 define_insn_and_split for negating the comparison result.
31826 * config/i386/predicates.md (float_vector_all_ones_operand):
31828 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Use
31829 general NOT operator without UNSPEC_MASKOP.
31831 2021-01-21 Vladimir N. Makarov <vmakarov@redhat.com>
31833 PR rtl-optimization/98777
31834 * lra-int.h (lra_pmode_pseudo): New extern.
31835 * lra.c (lra_pmode_pseudo): New global.
31837 * lra-eliminations.c (eliminate_regs_in_insn): Use it.
31839 2021-01-21 Ilya Leoshkevich <iii@linux.ibm.com>
31841 * fwprop.c (fwprop_propagation::classify_result): Allow
31842 (subreg (mem)) simplifications.
31844 2021-01-21 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31846 * config/aarch64/aarch64-simd.md (aarch64_sqdml<SBINQOPS:as>l<mode>):
31848 (aarch64_sqdmlal<mode>): ... This...
31849 (aarch64_sqdmlsl<mode>): ... And this.
31850 (aarch64_sqdml<SBINQOPS:as>l_lane<mode>): Split into...
31851 (aarch64_sqdmlal_lane<mode>): ... This...
31852 (aarch64_sqdmlsl_lane<mode>): ... And this.
31853 (aarch64_sqdml<SBINQOPS:as>l_laneq<mode>): Split into...
31854 (aarch64_sqdmlsl_laneq<mode>): ... This...
31855 (aarch64_sqdmlal_laneq<mode>): ... And this.
31856 (aarch64_sqdml<SBINQOPS:as>l_n<mode>): Split into...
31857 (aarch64_sqdmlsl_n<mode>): ... This...
31858 (aarch64_sqdmlal_n<mode>): ... And this.
31859 (aarch64_sqdml<SBINQOPS:as>l2<mode>_internal): Split into...
31860 (aarch64_sqdmlal2<mode>_internal): ... This...
31861 (aarch64_sqdmlsl2<mode>_internal): ... And this.
31863 2021-01-21 Christophe Lyon <christophe.lyon@linaro.org>
31865 * config/arm/arm_mve.h (__arm_vcmpneq_s8): Fix return type.
31867 2021-01-21 Andrea Corallo <andrea.corallo@arm.com>
31870 * doc/sourcebuild.texi (arm_thumb2_no_arm_v8_1_lob): Document.
31872 2021-01-21 liuhongt <hongtao.liu@intel.com>
31874 PR rtl-optimization/98694
31875 * regcprop.c (copy_value): If SRC had been assigned a mode
31876 narrower than the copy, we can't link DEST into the chain even
31877 they have same hard_regno_nregs(i.e. HImode/SImode in i386
31880 2021-01-20 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31882 * config/aarch64/aarch64-simd.md (aarch64_get_lane<mode>):
31883 Convert to define_insn_and_split. Split into simple move when moving
31886 2021-01-20 Segher Boessenkool <segher@kernel.crashing.org>
31888 * config/rs6000/rs6000.c (rs6000_emit_le_vsx_store): Change assert.
31889 Adjust comment. Simplify code.
31891 2021-01-20 Jakub Jelinek <jakub@redhat.com>
31894 * dwarf2out.c (reset_indirect_string): Also reset indirect strings
31895 with DW_FORM_line_strp form.
31896 (prune_unused_types_update_strings): Don't add into debug_str_hash
31897 indirect strings with DW_FORM_line_strp form.
31898 (adjust_name_comp_dir): New function.
31899 (dwarf2out_finish): Call it on CU DIEs after resetting
31900 debug_line_str_hash.
31902 2021-01-20 Vladimir N. Makarov <vmakarov@redhat.com>
31904 PR rtl-optimization/98722
31905 * lra-eliminations.c (eliminate_regs_in_insn): Check that target
31906 has no 3-op add insn to transform insns containing two pluses.
31908 2021-01-20 Richard Biener <rguenther@suse.de>
31910 * hwint.h (add_hwi): New function.
31911 (mul_hwi): Likewise.
31912 * tree-data-ref.c (initialize_matrix_A): Properly translate
31913 tree constants and avoid HOST_WIDE_INT_MIN.
31914 (lambda_matrix_row_add): Avoid undefined integer overflow
31915 and return true on such overflow.
31916 (lambda_matrix_right_hermite): Handle overflow from
31917 lambda_matrix_row_add gracefully. Simplify previous fix.
31918 (analyze_subscript_affine_affine): Likewise.
31920 2021-01-20 Eugene Rozenfeld <erozen@microsoft.com>
31922 PR tree-optimization/96674
31923 * match.pd: New patterns: x < y || y == XXX_MIN --> x <= y - 1
31924 x >= y && y != XXX_MIN --> x > y - 1
31926 2021-01-20 Richard Sandiford <richard.sandiford@arm.com>
31928 PR tree-optimization/98535
31929 * tree-vect-slp.c (duplicate_and_interleave): Use quick_grow_cleared.
31930 If the high and low permutes are the same, remove the high permutes
31931 from the working set and only continue with the low ones.
31933 2021-01-20 Jakub Jelinek <jakub@redhat.com>
31935 PR tree-optimization/98721
31936 * builtins.c (access_ref::inform_access): Don't assume
31937 SSA_NAME_IDENTIFIER must be non-NULL. Print messages about
31938 object whenever allocfn is NULL, rather than only when DECL_P
31939 is true. Use %qE instead of %qD for that. Formatting fixes.
31941 2021-01-20 Richard Biener <rguenther@suse.de>
31943 PR tree-optimization/98758
31944 * tree-data-ref.c (int_divides_p): Use lambda_int arguments.
31945 (lambda_matrix_right_hermite): Avoid undefinedness with
31946 signed integer abs and multiplication.
31947 (analyze_subscript_affine_affine): Use lambda_int.
31949 2021-01-20 David Malcolm <dmalcolm@redhat.com>
31952 * dwarf2out.c (output_line_info): Rename static variable
31953 "generation", moving it out of the function to...
31954 (output_line_info_generation): New.
31955 (init_sections_and_labels): Likewise, renaming the variable to...
31956 (init_sections_and_labels_generation): New.
31957 (dwarf2out_c_finalize): Reset the new variables.
31959 2021-01-19 Martin Sebor <msebor@redhat.com>
31961 PR middle-end/98664
31962 * tree-ssa-live.c (remove_unused_scope_block_p): Keep scopes for
31963 all functions, even if they're not declared artificial or inline.
31964 * tree.c (tree_inlined_location): Use macro expansion location
31965 only if scope traversal fails to expose one.
31967 2021-01-19 Richard Sandiford <richard.sandiford@arm.com>
31969 PR rtl-optimization/92294
31970 * alias.c (compare_base_symbol_refs): Take an extra parameter
31971 and add the distance between two symbols to it. Enshrine in
31972 comments that -1 means "either 0 or 1, but we can't tell
31973 which at compile time".
31974 (memrefs_conflict_p): Update call accordingly.
31975 (rtx_equal_for_memref_p): Likewise. Take the distance between symbols
31978 2021-01-19 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31980 * config/aarch64/aarch64-simd-builtins.def (sqshl, uqshl,
31981 sqrshl, uqrshl, sqadd, uqadd, sqsub, uqsub, suqadd, usqadd, sqmovn,
31982 uqmovn, sqxtn2, uqxtn2, sqabs, sqneg, sqdmlal, sqdmlsl, sqdmlal_lane,
31983 sqdmlsl_lane, sqdmlal_laneq, sqdmlsl_laneq, sqdmlal_n, sqdmlsl_n,
31984 sqdmlal2, sqdmlsl2, sqdmlal2_lane, sqdmlsl2_lane, sqdmlal2_laneq,
31985 sqdmlsl2_laneq, sqdmlal2_n, sqdmlsl2_n, sqdmull, sqdmull_lane,
31986 sqdmull_laneq, sqdmull_n, sqdmull2, sqdmull2_lane, sqdmull2_laneq,
31987 sqdmull2_n, sqdmulh, sqrdmulh, sqdmulh_lane, sqdmulh_laneq,
31988 sqrdmulh_lane, sqrdmulh_laneq, sqshrun_n, sqrshrun_n, sqshrn_n,
31989 uqshrn_n, sqrshrn_n, uqrshrn_n, sqshlu_n, sqshl_n, uqshl_n, sqrdmlah,
31990 sqrdmlsh, sqrdmlah_lane, sqrdmlsh_lane, sqrdmlah_laneq, sqrdmlsh_laneq,
31991 sqmovun): Use NONE flags.
31993 2021-01-19 Richard Biener <rguenther@suse.de>
31996 * ipa-modref.c (analyze_stmt): Only record a summary for a
31999 2021-01-19 Richard Biener <rguenther@suse.de>
32001 PR middle-end/98638
32002 * tree-ssanames.c (fini_ssanames): Zero SSA_NAME_DEF_STMT.
32004 2021-01-19 Daniel Hellstrom <daniel@gaisler.com>
32006 * config/sparc/rtemself.h (TARGET_OS_CPP_BUILTINS): Add
32007 built-in define __FIX_LEON3FT_TN0018.
32009 2021-01-19 Richard Biener <rguenther@suse.de>
32012 * tree-inline.c (tree_function_versioning): Set input_location
32013 to UNKNOWN_LOCATION throughout the function.
32015 2021-01-19 Tobias Burnus <tobias@codesourcery.com>
32018 * omp-low.c (lower_omp_target): Handle nonpointer is_device_ptr.
32020 2021-01-19 Martin Jambor <mjambor@suse.cz>
32023 * ipa-sra.c (ssa_name_only_returned_p): New parameter fun. Check
32024 whether non-call exceptions allow removal of a statement.
32025 (isra_analyze_call): Pass the appropriate function to
32026 ssa_name_only_returned_p.
32028 2021-01-19 Geng Qi <gengqi@linux.alibaba.com>
32030 * config/riscv/arch-canonicalize (longext_sort): New function for
32031 sorting 'multi-letter'.
32032 * config/riscv/multilib-generator: Adjusting the loop of 'alt' in
32033 'alts'. The 'arch' may not be the first of 'alts'.
32034 (_expand_combination): Add underline for the 'ext' without '*'.
32035 This is because, a single-letter extension can always be treated well
32036 with a '_' prefix, but it cannot be separated out if it is appended
32039 2021-01-18 Vladimir N. Makarov <vmakarov@redhat.com>
32042 * ira.c (ira): Skip abnormal critical edge splitting.
32044 2021-01-18 Jakub Jelinek <jakub@redhat.com>
32046 PR tree-optimization/98727
32047 * tree-ssa-math-opts.c (match_arith_overflow): Fix up computation of
32048 second .MUL_OVERFLOW operand for signed multiplication with overflow
32049 checking if the second operand of multiplication is not constant.
32051 2021-01-18 David Edelsohn <dje.gcc@gmail.com>
32053 * doc/invoke.texi (-gdwarf): TPF defaults to version 2 and AIX
32054 defaults to version 4.
32056 2021-01-18 David Malcolm <dmalcolm@redhat.com>
32058 * attribs.h (fndecl_dealloc_argno): New decl.
32059 * builtins.c (call_dealloc_argno): Split out second half of
32061 (fndecl_dealloc_argno): New.
32062 * doc/extend.texi (Common Function Attributes): Document the
32063 interaction between the analyzer and the malloc attribute.
32064 * doc/invoke.texi (Static Analyzer Options): Likewise.
32066 2021-01-17 David Edelsohn <dje.gcc@gmail.com>
32068 * config/rs6000/aix71.h (SUBTARGET_OVERRIDE_OPTIONS): Override
32069 dwarf_version to 4.
32070 * config/rs6000/aix72.h (SUBTARGET_OVERRIDE_OPTIONS): Same.
32072 2021-01-17 Martin Jambor <mjambor@suse.cz>
32075 * cgraph.c (clone_of_p): Check also former_clone_of as we climb
32078 2021-01-17 Mark Wielaard <mark@klomp.org>
32080 * common.opt (gdwarf-): Init(5).
32081 * doc/invoke.texi (-gdwarf): Document default to 5.
32083 2021-01-16 Kwok Cheung Yeung <kcy@codesourcery.com>
32085 * builtin-types.def
32086 (BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT): Rename
32088 (BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT_PTR):
32089 ...this. Add extra argument.
32090 * gimplify.c (omp_default_clause): Ensure that event handle is
32091 firstprivate in a task region.
32092 (gimplify_scan_omp_clauses): Handle OMP_CLAUSE_DETACH.
32093 (gimplify_adjust_omp_clauses): Likewise.
32094 * omp-builtins.def (BUILT_IN_GOMP_TASK): Change function type to
32095 BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT_PTR.
32096 * omp-expand.c (expand_task_call): Add GOMP_TASK_FLAG_DETACH to flags
32097 if detach clause specified. Add detach argument when generating
32099 * omp-low.c (scan_sharing_clauses): Setup data environment for detach
32101 (finish_taskreg_scan): Move field for variable containing the event
32102 handle to the front of the struct.
32103 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_DETACH. Fix
32105 * tree-nested.c (convert_nonlocal_omp_clauses): Handle
32106 OMP_CLAUSE_DETACH clause.
32107 (convert_local_omp_clauses): Handle OMP_CLAUSE_DETACH clause.
32108 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_DETACH.
32109 * tree.c (omp_clause_num_ops): Add entry for OMP_CLAUSE_DETACH.
32111 (omp_clause_code_name): Add entry for OMP_CLAUSE_DETACH. Fix
32113 (walk_tree_1): Handle OMP_CLAUSE_DETACH.
32115 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
32117 * config/nios2/t-rtems: Reset all MULTILIB_* variables. Shorten
32118 multilib directory names. Use MULTILIB_REQUIRED instead of
32119 MULTILIB_EXCEPTIONS. Add -mhw-mul -mhw-mulx -mhw-div
32120 -mcustom-fpu-cfg=fph2 multilib.
32122 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
32124 * config/nios2/nios2.c (NIOS2_FPU_CONFIG_NUM): Adjust value.
32125 (nios2_init_fpu_configs): Provide register values for new
32126 -mcustom-fpu-cfg=fph2 option variant.
32127 * doc/invoke.texi (-mcustom-fpu-cfg=fph2): Document new option
32130 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
32132 * config/nios2/nios2.c (nios2_custom_check_insns): Remove
32133 custom instruction warnings.
32135 2021-01-16 Jakub Jelinek <jakub@redhat.com>
32137 PR tree-optimization/96669
32138 * match.pd ((CST << x) & 1 -> x == 0): New simplification.
32140 2021-01-16 Jakub Jelinek <jakub@redhat.com>
32142 PR tree-optimization/96271
32143 * passes.def: Pass false argument to first two pass_cd_dce
32144 instances and true to last instance. Add comment that
32145 last instance rewrites no longer addressed locals.
32146 * tree-ssa-dce.c (pass_cd_dce): Add update_address_taken_p member and
32148 (pass_cd_dce::set_pass_param): New method.
32149 (pass_cd_dce::execute): Return TODO_update_address_taken from
32150 last cd_dce instance.
32152 2021-01-15 Carl Love <cel@us.ibm.com>
32154 * config/rs6000/altivec.h (vec_mulh, vec_div, vec_dive, vec_mod):
32156 * config/rs6000/altivec.md (VIlong): Move define to file vsx.md.
32157 * config/rs6000/rs6000-builtin.def (DIVES_V4SI, DIVES_V2DI,
32158 DIVEU_V4SI, DIVEU_V2DI, DIVS_V4SI, DIVS_V2DI, DIVU_V4SI,
32159 DIVU_V2DI, MODS_V2DI, MODS_V4SI, MODU_V2DI, MODU_V4SI,
32160 MULHS_V2DI, MULHS_V4SI, MULHU_V2DI, MULHU_V4SI, MULLD_V2DI):
32161 Add builtin define.
32162 (MULH, DIVE, MOD): Add new BU_P10_OVERLOAD_2 definitions.
32163 * config/rs6000/rs6000-call.c (VSX_BUILTIN_VEC_DIV,
32164 VSX_BUILTIN_VEC_DIVE, P10_BUILTIN_VEC_MOD, P10_BUILTIN_VEC_MULH):
32165 New overloaded definitions.
32166 (builtin_function_type) [P10V_BUILTIN_DIVEU_V4SI,
32167 P10V_BUILTIN_DIVEU_V2DI, P10V_BUILTIN_DIVU_V4SI,
32168 P10V_BUILTIN_DIVU_V2DI, P10V_BUILTIN_MODU_V2DI,
32169 P10V_BUILTIN_MODU_V4SI, P10V_BUILTIN_MULHU_V2DI,
32170 P10V_BUILTIN_MULHU_V4SI]: Add case
32171 statement for builtins.
32172 * config/rs6000/rs6000.md (bits): Add new attribute sizes V4SI, V2DI.
32173 * config/rs6000/vsx.md (VIlong): Moved from config/rs6000/altivec.md.
32174 (UNSPEC_VDIVES, UNSPEC_VDIVEU): New unspec definitions.
32175 (vsx_mul_v2di): Add if TARGET_POWER10 statement.
32176 (vsx_udiv_v2di): Add if TARGET_POWER10 statement.
32177 (dives_<mode>, diveu_<mode>, div<mode>3, uvdiv<mode>3,
32178 mods_<mode>, modu_<mode>, mulhs_<mode>, mulhu_<mode>, mulv2di3):
32179 Add define_insn, mode is VIlong.
32180 * doc/extend.texi (vec_mulh, vec_mul, vec_div, vec_dive, vec_mod):
32181 Add builtin descriptions.
32183 2021-01-15 Eric Botcazou <ebotcazou@adacore.com>
32185 * final.c (final_start_function_1): Reset force_source_line.
32187 2021-01-15 Jakub Jelinek <jakub@redhat.com>
32189 PR tree-optimization/96669
32190 * match.pd (((1 << A) & 1) != 0 -> A == 0,
32191 ((1 << A) & 1) == 0 -> A != 0): Generalize for 1s replaced by
32192 possibly different power of two constants and to right shift too.
32194 2021-01-15 Jakub Jelinek <jakub@redhat.com>
32196 PR tree-optimization/96681
32197 * match.pd ((x < 0) ^ (y < 0) to (x ^ y) < 0): New simplification.
32198 ((x >= 0) ^ (y >= 0) to (x ^ y) < 0): Likewise.
32199 ((x < 0) ^ (y >= 0) to (x ^ y) >= 0): Likewise.
32200 ((x >= 0) ^ (y < 0) to (x ^ y) >= 0): Likewise.
32202 2021-01-15 Alexandre Oliva <oliva@adacore.com>
32204 * opts.c (gen_command_line_string): Exclude -dumpbase-ext.
32206 2021-01-15 Tamar Christina <tamar.christina@arm.com>
32208 * config/aarch64/aarch64-simd.md (cml<fcmac1><conj_op><mode>4,
32209 cmul<conj_op><mode>3): New.
32210 * config/aarch64/iterators.md (UNSPEC_FCMUL,
32211 UNSPEC_FCMUL180, UNSPEC_FCMLA_CONJ, UNSPEC_FCMLA180_CONJ,
32212 UNSPEC_CMLA_CONJ, UNSPEC_CMLA180_CONJ, UNSPEC_CMUL, UNSPEC_CMUL180,
32213 FCMLA_OP, FCMUL_OP, conj_op, rotsplit1, rotsplit2, fcmac1, sve_rot1,
32214 sve_rot2, SVE2_INT_CMLA_OP, SVE2_INT_CMUL_OP, SVE2_INT_CADD_OP): New.
32215 (rot): Add UNSPEC_FCMUL, UNSPEC_FCMUL180.
32216 (rot_op): Renamed to conj_op.
32217 * config/aarch64/aarch64-sve.md (cml<fcmac1><conj_op><mode>4,
32218 cmul<conj_op><mode>3): New.
32219 * config/aarch64/aarch64-sve2.md (cml<fcmac1><conj_op><mode>4,
32220 cmul<conj_op><mode>3): New.
32222 2021-01-15 David Malcolm <dmalcolm@redhat.com>
32226 (selftest::test_print_parseable_fixits_bytes_vs_display_columns):
32227 Escape the tempfile name when constructing the expected output.
32229 2021-01-15 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32231 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlsl_hi<mode>):
32233 (aarch64_<su>mlsl_hi<mode>): ... This.
32234 (aarch64_<su>mlsl_hi<mode>): Define.
32235 (*aarch64_<su>mlsl<mode): Rename to...
32236 (aarch64_<su>mlsl<mode): ... This.
32237 * config/aarch64/aarch64-simd-builtins.def (smlsl, umlsl,
32238 smlsl_hi, umlsl_hi): Define builtins.
32239 * config/aarch64/arm_neon.h (vmlsl_high_s8, vmlsl_high_s16,
32240 vmlsl_high_s32, vmlsl_high_u8, vmlsl_high_u16, vmlsl_high_u32,
32241 vmlsl_s8, vmlsl_s16, vmlsl_s32, vmlsl_u8,
32242 vmlsl_u16, vmlsl_u32): Reimplement with builtins.
32244 2021-01-15 Uroš Bizjak <ubizjak@gmail.com>
32246 * config/i386/i386-c.c (ix86_target_macros):
32247 Use cpp_define_formatted for __SIZEOF_FLOAT80__ definition.
32249 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
32252 * config.gcc (aarch64*-*-*): Add aarch64-cc-fusion.o to extra_objs.
32253 * Makefile.in (RTL_SSA_H): New variable.
32254 * config/aarch64/t-aarch64 (aarch64-cc-fusion.o): New rule.
32255 * config/aarch64/aarch64-protos.h (make_pass_cc_fusion): Declare.
32256 * config/aarch64/aarch64-passes.def: Add pass_cc_fusion after
32258 * config/aarch64/aarch64-cc-fusion.cc: New file.
32260 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
32262 * recog.h (insn_change_watermark::~insn_change_watermark): Avoid
32263 calling cancel_changes for changes that no longer exist.
32265 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
32267 * rtl-ssa/functions.h (function_info::ref_defs): Rename to...
32268 (function_info::reg_defs): ...this.
32269 * rtl-ssa/member-fns.inl (function_info::ref_defs): Rename to...
32270 (function_info::reg_defs): ...this.
32272 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32275 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
32277 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32280 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32283 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
32285 2021-01-15 Richard Biener <rguenther@suse.de>
32287 PR tree-optimization/96376
32288 * tree-vect-stmts.c (get_load_store_type): Disregard alignment
32289 for VMAT_INVARIANT.
32291 2021-01-15 Martin Liska <mliska@suse.cz>
32293 * doc/install.texi: Document that some tests need pytest module.
32294 * doc/sourcebuild.texi: Likewise.
32296 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32299 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
32301 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32303 * config/arm/mve.md (mve_vshrq_n_s<mode>_imm): New entry.
32304 (mve_vshrq_n_u<mode>_imm): Likewise.
32305 * config/arm/neon.md (vashr<mode>3, vlshr<mode>3): Move to ...
32306 * config/arm/vec-common.md: ... here.
32308 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32310 * config/arm/mve.md (mve_vshlq_<supf><mode>): Move to
32312 * config/arm/neon.md (vashl<mode>3): Delete.
32313 * config/arm/vec-common.md (mve_vshlq_<supf><mode>): New.
32314 (vasl<mode>3): New expander.
32316 2021-01-15 Richard Biener <rguenther@suse.de>
32318 PR tree-optimization/98685
32319 * tree-vect-slp.c (vect_schedule_slp_node): Refactor handling
32320 of vector extern defs.
32322 2021-01-14 David Malcolm <dmalcolm@redhat.com>
32325 * diagnostic.c (diagnostic_kind_text): Break out this array
32327 (diagnostic_build_prefix): ...here.
32328 (fancy_abort): Detect when diagnostic_initialize has not yet been
32329 called and fall back to a minimal implementation of printing the
32330 ICE, rather than segfaulting in internal_error.
32332 2021-01-14 David Malcolm <dmalcolm@redhat.com>
32334 * diagnostic.c (diagnostic_initialize): Eliminate
32335 parseable_fixits_p in favor of initializing extra_output_kind from
32336 GCC_EXTRA_DIAGNOSTIC_OUTPUT.
32337 (convert_column_unit): New function, split out from...
32338 (diagnostic_converted_column): ...this.
32339 (print_parseable_fixits): Add "column_unit" and "tabstop" params.
32340 Use them to call convert_column_unit on the column values.
32341 (diagnostic_report_diagnostic): Eliminate conditional on
32342 parseable_fixits_p in favor of a switch statement on
32343 extra_output_kind, passing the appropriate values to the new
32344 params of print_parseable_fixits.
32345 (selftest::test_print_parseable_fixits_none): Update for new
32346 params of print_parseable_fixits.
32347 (selftest::test_print_parseable_fixits_insert): Likewise.
32348 (selftest::test_print_parseable_fixits_remove): Likewise.
32349 (selftest::test_print_parseable_fixits_replace): Likewise.
32350 (selftest::test_print_parseable_fixits_bytes_vs_display_columns):
32352 (selftest::diagnostic_c_tests): Call it.
32353 * diagnostic.h (enum diagnostics_extra_output_kind): New.
32354 (diagnostic_context::parseable_fixits_p): Delete field in favor
32356 (diagnostic_context::extra_output_kind): ...this new field.
32357 * doc/invoke.texi (Environment Variables): Add
32358 GCC_EXTRA_DIAGNOSTIC_OUTPUT.
32359 * opts.c (common_handle_option): Update handling of
32360 OPT_fdiagnostics_parseable_fixits for change to diagnostic_context
32363 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32365 * tree-vect-slp-patterns.c (class complex_operations_pattern,
32366 complex_operations_pattern::matches,
32367 complex_operations_pattern::recognize,
32368 complex_operations_pattern::build): New.
32369 (slp_patterns): Use it.
32371 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32373 * internal-fn.def (COMPLEX_FMS, COMPLEX_FMS_CONJ): New.
32374 * optabs.def (cmls_optab, cmls_conj_optab): New.
32375 * doc/md.texi: Document them.
32376 * tree-vect-slp-patterns.c (class complex_fms_pattern,
32377 complex_fms_pattern::matches, complex_fms_pattern::recognize,
32378 complex_fms_pattern::build): New.
32380 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32382 * internal-fn.def (COMPLEX_FMA, COMPLEX_FMA_CONJ): New.
32383 * optabs.def (cmla_optab, cmla_conj_optab): New.
32384 * doc/md.texi: Document them.
32385 * tree-vect-slp-patterns.c (vect_match_call_p,
32386 class complex_fma_pattern, vect_slp_reset_pattern,
32387 complex_fma_pattern::matches, complex_fma_pattern::recognize,
32388 complex_fma_pattern::build): New.
32390 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32392 * internal-fn.def (COMPLEX_MUL, COMPLEX_MUL_CONJ): New.
32393 * optabs.def (cmul_optab, cmul_conj_optab): New.
32394 * doc/md.texi: Document them.
32395 * tree-vect-slp-patterns.c (vect_match_call_complex_mla,
32396 vect_normalize_conj_loc, is_eq_or_top, vect_validate_multiplication,
32397 vect_build_combine_node, class complex_mul_pattern,
32398 complex_mul_pattern::matches, complex_mul_pattern::recognize,
32399 complex_mul_pattern::build): New.
32401 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32403 * tree-vect-slp.c (optimize_load_redistribution_1): New.
32404 (optimize_load_redistribution, vect_is_slp_load_node): New.
32405 (vect_match_slp_patterns): Use it.
32407 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32409 * tree-vect-slp-patterns.c (complex_add_pattern::build):
32412 2021-01-14 Thomas Schwinge <thomas@codesourcery.com>
32414 * config/gcn/mkoffload.c (main): Create an offload image only in
32415 64-bit configurations.
32417 2021-01-14 H.J. Lu <hjl.tools@gmail.com>
32420 * config/i386/i386-options.c (ix86_option_override_internal):
32421 Issue an error for -fcf-protection with CF_BRANCH when compiling
32422 for 32-bit non-TARGET_CMOV targets.
32424 2021-01-14 Uroš Bizjak <ubizjak@gmail.com>
32427 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p):
32428 Remove declaration and initialization of shadow variable "ret".
32429 (ix86_option_override_internal): Remove delcaration of
32430 shadow variable "i". Redeclare shadowed variable to unsigned.
32431 * common/config/i386/i386-common.c (pta_size): Redeclare to unsigned.
32432 * config/i386/i386-builtins.c (get_builtin_code_for_version):
32433 Update for redeclaration.
32434 * config/i386/i386.h (pta_size): Ditto.
32436 2021-01-14 Richard Biener <rguenther@suse.de>
32438 PR tree-optimization/98674
32439 * tree-data-ref.c (base_supports_access_fn_components_p): New.
32440 (initialize_data_dependence_relation): For two bases without
32441 possible access fns resort to type size equality when determining
32442 shape compatibility.
32444 2021-01-14 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
32447 * config/arm/arm_neon.h: Replace calls to __builtin_vcge* by
32448 <=, >= operators in vcle and vcge intrinsics respectively.
32449 * config/arm/arm_neon_builtins.def: Remove entry for
32452 2021-01-14 Uroš Bizjak <ubizjak@gmail.com>
32455 * config/i386/i386-options.c (ix86_function_specific_save):
32456 Remove redundant assignment to opts->x_ix86_branch_cost.
32457 * config/i386/i386.c (ix86_prefetch_sse):
32458 Rename from x86_prefetch_sse. Update all uses.
32459 * config/i386/i386.h: Update for rename.
32460 * config/i386/i386-options.h: Ditto.
32462 2021-01-14 Jakub Jelinek <jakub@redhat.com>
32465 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3,
32466 *sse4_1_zero_extendv4hiv4si2_3, *sse4_1_zero_extendv2siv2di2_3):
32467 Use Bm instead of m for non-avx. Add isa attribute.
32469 2021-01-14 Jakub Jelinek <jakub@redhat.com>
32471 PR tree-optimization/96688
32472 * match.pd (~(X >> Y) -> ~X >> Y): New simplification if
32473 ~X can be simplified.
32475 2021-01-14 Richard Sandiford <richard.sandiford@arm.com>
32477 * tree-vect-stmts.c (vect_model_load_cost): Account for unused
32478 IFN_LOAD_LANES results.
32480 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32482 * config/aarch64/aarch64-simd.md (aarch64_<su>xtl<mode>):
32484 (aarch64_xtn<mode>): Likewise.
32485 * config/aarch64/aarch64-simd-builtins.def (sxtl, uxtl, xtn):
32488 * config/aarch64/arm_neon.h (vmovl_s8): Reimplement using
32490 (vmovl_s16): Likewise.
32491 (vmovl_s32): Likewise.
32492 (vmovl_u8): Likewise.
32493 (vmovl_u16): Likewise.
32494 (vmovl_u32): Likewise.
32495 (vmovn_s16): Likewise.
32496 (vmovn_s32): Likewise.
32497 (vmovn_s64): Likewise.
32498 (vmovn_u16): Likewise.
32499 (vmovn_u32): Likewise.
32500 (vmovn_u64): Likewise.
32502 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32504 * config/aarch64/aarch64-simd.md (aarch64_<su>qxtn2<mode>_le):
32506 (aarch64_<su>qxtn2<mode>_be): Likewise.
32507 (aarch64_<su>qxtn2<mode>): Likewise.
32508 * config/aarch64/aarch64-simd-builtins.def (sqxtn2, uqxtn2):
32510 * config/aarch64/iterators.md (SAT_TRUNC): Define code_iterator.
32511 (su): Handle ss_truncate and us_truncate.
32512 * config/aarch64/arm_neon.h (vqmovn_high_s16): Reimplement using
32514 (vqmovn_high_s32): Likewise.
32515 (vqmovn_high_s64): Likewise.
32516 (vqmovn_high_u16): Likewise.
32517 (vqmovn_high_u32): Likewise.
32518 (vqmovn_high_u64): Likewise.
32520 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32522 * config/aarch64/aarch64-simd.md (aarch64_xtn2<mode>_le):
32524 (aarch64_xtn2<mode>_be): Likewise.
32525 (aarch64_xtn2<mode>): Likewise.
32526 * config/aarch64/aarch64-simd-builtins.def (xtn2): Define
32528 * config/aarch64/arm_neon.h (vmovn_high_s16): Reimplement using
32530 (vmovn_high_s32): Likewise.
32531 (vmovn_high_s64): Likewise.
32532 (vmovn_high_u16): Likewise.
32533 (vmovn_high_u32): Likewise.
32534 (vmovn_high_u64): Likewise.
32536 2021-01-13 Stafford Horne <shorne@gmail.com>
32538 * config/or1k/or1k.h (ASM_PREFERRED_EH_DATA_FORMAT): New macro.
32540 2021-01-13 Stafford Horne <shorne@gmail.com>
32542 * config/or1k/linux.h (TARGET_ASM_FILE_END): Define macro.
32544 2021-01-13 Stafford Horne <shorne@gmail.com>
32546 * config/or1k/or1k.h (TARGET_CPU_CPP_BUILTINS): Add builtin
32547 define for __or1k_hard_float__.
32549 2021-01-13 Stafford Horne <shorne@gmail.com>
32551 * config/or1k/or1k.h (NO_PROFILE_COUNTERS): Define as 1.
32552 (PROFILE_HOOK): Define to call _mcount.
32553 (FUNCTION_PROFILER): Change from abort to no-op.
32555 2021-01-13 Jakub Jelinek <jakub@redhat.com>
32557 PR tree-optimization/96691
32558 * match.pd ((~X | C) ^ D -> (X | C) ^ (~D ^ C),
32559 (~X & C) ^ D -> (X & C) ^ (D ^ C)): New simplifications if
32560 (~D ^ C) or (D ^ C) can be simplified.
32562 2021-01-13 Richard Biener <rguenther@suse.de>
32564 PR tree-optimization/92645
32565 * match.pd (BIT_FIELD_REF to conversion): Delay canonicalization
32566 until after vector lowering.
32568 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
32570 * config/aarch64/aarch64-sve.md (fnma<mode>4): Extend from SVE_FULL_I
32572 (@aarch64_pred_fnma<mode>, cond_fnma<mode>, *cond_fnma<mode>_2)
32573 (*cond_fnma<mode>_4, *cond_fnma<mode>_any): Likewise.
32575 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
32577 * config/aarch64/aarch64-sve.md (fma<mode>4): Extend from SVE_FULL_I
32579 (@aarch64_pred_fma<mode>, cond_fma<mode>, *cond_fma<mode>_2)
32580 (*cond_fma<mode>_4, *cond_fma<mode>_any): Likewise.
32582 2021-01-13 Richard Biener <rguenther@suse.de>
32584 PR tree-optimization/92645
32585 * tree-vect-slp.c (vect_build_slp_tree_1): Relax supported
32586 BIT_FIELD_REF argument.
32587 (vect_build_slp_tree_2): Record the desired vector type
32588 on the external vector def.
32589 (vectorizable_slp_permutation): Handle required punning
32590 of existing vector defs.
32592 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
32594 * rtl-ssa/accesses.h (def_lookup): Fix order of comparison results.
32596 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
32598 * config/sh/sh.md (movsf_ie): Remove operands[2] test.
32600 2021-01-13 Samuel Thibault <samuel.thibault@ens-lyon.org>
32602 * config.gcc [$target == *-*-gnu*]: Enable
32603 'default_gnu_indirect_function'.
32605 2021-01-13 Jakub Jelinek <jakub@redhat.com>
32608 * optabs.c (expand_vec_perm_const): Don't force v0 and v1 into
32609 registers before calling targetm.vectorize.vec_perm_const, only after
32611 * config/i386/i386-expand.c (ix86_vectorize_vec_perm_const): Handle
32612 two argument permutation when one operand is zero vector and only
32613 after that force operands into registers.
32614 * config/i386/sse.md (*avx2_zero_extendv16qiv16hi2_1): New
32615 define_insn_and_split pattern.
32616 (*avx512bw_zero_extendv32qiv32hi2_1): Likewise.
32617 (*avx512f_zero_extendv16hiv16si2_1): Likewise.
32618 (*avx2_zero_extendv8hiv8si2_1): Likewise.
32619 (*avx512f_zero_extendv8siv8di2_1): Likewise.
32620 (*avx2_zero_extendv4siv4di2_1): Likewise.
32621 * config/mips/mips.c (mips_vectorize_vec_perm_const): Force operands
32623 * config/arm/arm.c (arm_vectorize_vec_perm_const): Likewise.
32624 * config/sparc/sparc.c (sparc_vectorize_vec_perm_const): Likewise.
32625 * config/ia64/ia64.c (ia64_vectorize_vec_perm_const): Likewise.
32626 * config/aarch64/aarch64.c (aarch64_vectorize_vec_perm_const): Likewise.
32627 * config/rs6000/rs6000.c (rs6000_vectorize_vec_perm_const): Likewise.
32628 * config/gcn/gcn.c (gcn_vectorize_vec_perm_const): Likewise. Use std::swap.
32630 2021-01-13 Martin Liska <mliska@suse.cz>
32632 PR tree-optimization/98455
32633 * gimple-if-to-switch.cc (condition_info::record_phi_mapping):
32634 Record also virtual PHIs.
32635 (pass_if_to_switch::execute): Return TODO_cleanup_cfg only
32638 2021-01-13 Jonathan Wakely <jwakely@redhat.com>
32640 * doc/invoke.texi (C++ Modules): Fix typos.
32642 2021-01-13 Richard Biener <rguenther@suse.de>
32644 PR tree-optimization/98640
32645 * tree-ssa-sccvn.c (visit_nary_op): Do not try to
32646 handle plus or minus from a truncated operand to be
32649 2021-01-13 Jakub Jelinek <jakub@redhat.com>
32652 * config/i386/i386.md (*btr<mode>_1, *btr<mode>_2): New
32653 define_insn_and_split patterns.
32654 (splitter after *btr<mode>_2): New splitter.
32656 2021-01-13 Martin Liska <mliska@suse.cz>
32659 * cgraphunit.c (analyze_functions): Remove dead code.
32661 2021-01-13 Qian Jianhua <qianjh@cn.fujitsu.com>
32663 * config/aarch64/aarch64-cost-tables.h (a64fx_extra_costs): New.
32664 * config/aarch64/aarch64.c (a64fx_addrcost_table): New.
32665 (a64fx_regmove_cost, a64fx_vector_cost): New.
32666 (a64fx_tunings): Use the new added cost tables.
32668 2021-01-13 Jakub Jelinek <jakub@redhat.com>
32671 * config/i386/predicates.md (pmovzx_parallel): New predicate.
32672 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3): New
32673 define_insn_and_split pattern.
32674 (*sse4_1_zero_extendv4hiv4si2_3): Likewise.
32675 (*sse4_1_zero_extendv2siv2di2_3): Likewise.
32677 2021-01-13 Julian Brown <julian@codesourcery.com>
32679 * config/gcn/gcn.c (gcn_conditional_register_usage): Remove dead code
32680 to fix v0 register.
32682 2021-01-13 Julian Brown <julian@codesourcery.com>
32684 * config/gcn/gcn.c (gcn_md_reorg): Fix case where EXEC reg is live
32687 2021-01-13 Julian Brown <julian@codesourcery.com>
32689 * config/gcn/gcn-valu.md (recip<mode>2<exec>, recip<mode>2): Use unspec
32690 for reciprocal-approximation instructions.
32691 (div<mode>3): Use fused multiply-accumulate operations for reciprocal
32692 refinement and division result.
32693 * config/gcn/gcn.md (UNSPEC_RCP): New unspec constant.
32695 2021-01-13 Julian Brown <julian@codesourcery.com>
32697 * config/gcn/gcn-valu.md (subdf): Rename to...
32700 2021-01-12 Martin Liska <mliska@suse.cz>
32702 * gcov.c (source_info::debug): Fix printf format for 32-bit hosts.
32704 2021-01-12 Andrea Corallo <andrea.corallo@arm.com>
32706 * function-abi.h: Fix typo.
32708 2021-01-12 Christophe Lyon <christophe.lyon@linaro.org>
32712 * config/arm/arm.h (ARM_HAVE_NEON_V8QI_LDST): New macro.
32713 (ARM_HAVE_NEON_V16QI_LDST, ARM_HAVE_NEON_V4HI_LDST): Likewise.
32714 (ARM_HAVE_NEON_V8HI_LDST, ARM_HAVE_NEON_V2SI_LDST): Likewise.
32715 (ARM_HAVE_NEON_V4SI_LDST, ARM_HAVE_NEON_V4HF_LDST): Likewise.
32716 (ARM_HAVE_NEON_V8HF_LDST, ARM_HAVE_NEON_V4BF_LDST): Likewise.
32717 (ARM_HAVE_NEON_V8BF_LDST, ARM_HAVE_NEON_V2SF_LDST): Likewise.
32718 (ARM_HAVE_NEON_V4SF_LDST, ARM_HAVE_NEON_DI_LDST): Likewise.
32719 (ARM_HAVE_NEON_V2DI_LDST): Likewise.
32720 (ARM_HAVE_V8QI_LDST, ARM_HAVE_V16QI_LDST): Likewise.
32721 (ARM_HAVE_V4HI_LDST, ARM_HAVE_V8HI_LDST): Likewise.
32722 (ARM_HAVE_V2SI_LDST, ARM_HAVE_V4SI_LDST, ARM_HAVE_V4HF_LDST): Likewise.
32723 (ARM_HAVE_V8HF_LDST, ARM_HAVE_V4BF_LDST, ARM_HAVE_V8BF_LDST): Likewise.
32724 (ARM_HAVE_V2SF_LDST, ARM_HAVE_V4SF_LDST, ARM_HAVE_DI_LDST): Likewise.
32725 (ARM_HAVE_V2DI_LDST): Likewise.
32726 * config/arm/mve.md (*movmisalign<mode>_mve_store): New pattern.
32727 (*movmisalign<mode>_mve_load): New pattern.
32728 * config/arm/neon.md (movmisalign<mode>): Move to ...
32729 * config/arm/vec-common.md: ... here.
32731 2021-01-12 Vladimir N. Makarov <vmakarov@redhat.com>
32734 * lra-eliminations.c (eliminate_regs_in_insn): Add transformation
32735 of pattern 'plus (plus (hard reg, const), pseudo)'.
32737 2021-01-12 Richard Biener <rguenther@suse.de>
32739 PR tree-optimization/98550
32740 * tree-vect-slp.c (vect_record_max_nunits): Check whether
32741 the group size is a multiple of the vector element count.
32742 (vect_build_slp_tree_1): When we need to fail because
32743 the vector type choosen causes unrolling do so lazily
32744 without affecting matches only at the end to guide group splitting.
32746 2021-01-12 Martin Liska <mliska@suse.cz>
32749 * optc-save-gen.awk: Compare also n_target_save vars with
32752 2021-01-12 Martin Liska <mliska@suse.cz>
32754 * gcov.c (source_info::debug): New.
32755 (print_usage): Add --debug (-D) option.
32756 (process_args): Likewise.
32757 (generate_results): Call src->debug after
32758 accumulate_line_counts.
32759 (read_graph_file): Properly assign id for EXIT_BLOCK.
32760 * profile.c (branch_prob): Dump function body before it is
32763 2021-01-12 Jakub Jelinek <jakub@redhat.com>
32765 PR tree-optimization/98629
32766 * tree-ssa-math-opts.c (arith_overflow_check_p): Don't update use_stmt
32767 unless returning non-zero.
32769 2021-01-12 Jakub Jelinek <jakub@redhat.com>
32771 PR tree-optimization/95731
32772 * tree-ssa-reassoc.c (optimize_range_tests_cmp_bitwise): Also optimize
32773 x < 0 && y < 0 && z < 0 into (x | y | z) < 0 for signed x, y, z.
32774 (optimize_range_tests): Call optimize_range_tests_cmp_bitwise
32775 only after optimize_range_tests_var_bound.
32777 2021-01-12 Jakub Jelinek <jakub@redhat.com>
32779 * configure.ac: Ensure c/Make-lang.in comes first in @all_lang_makefrags@.
32780 * configure: Regenerated.
32782 2021-01-12 liuhongt <hongtao.liu@intel.com>
32785 * config/i386/i386-builtins.h (BUILTIN_DESC_SWAP_OPERANDS):
32787 * config/i386/i386-expand.c (ix86_expand_sse_comi): Delete
32790 2021-01-12 Alexandre Oliva <oliva@adacore.com>
32792 * ssa-iterators.h (end_imm_use_stmt_traverse): Forward
32794 (auto_end_imm_use_stmt_traverse): New struct.
32795 (FOR_EACH_IMM_USE_STMT): Use it.
32796 (BREAK_FROM_IMM_USE_STMT, RETURN_FROM_IMM_USE_STMT): Remove,
32798 * gimple-ssa-strength-reduction.c: ... here, ...
32799 * graphite-scop-detection.c: ... here, ...
32800 * ipa-modref.c, ipa-pure-const.c, ipa-sra.c: ... here, ...
32801 * tree-predcom.c, tree-ssa-ccp.c: ... here, ...
32802 * tree-ssa-dce.c, tree-ssa-dse.c: ... here, ...
32803 * tree-ssa-loop-ivopts.c, tree-ssa-math-opts.c: ... here, ...
32804 * tree-ssa-phiprop.c, tree-ssa.c: ... here, ...
32805 * tree-vect-slp.c: ... and here, ...
32806 * doc/tree-ssa.texi: ... and the example here.
32808 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32810 * config/aarch64/aarch64-sve.md (sdiv_pow2<mode>3): Extend from
32811 SVE_FULL_I to SVE_I. Generate an UNSPEC_PRED_X.
32812 (*sdiv_pow2<mode>3): New pattern.
32813 (@cond_<sve_int_op><mode>): Extend from SVE_FULL_I to SVE_I.
32814 Wrap the ASRD in an UNSPEC_PRED_X.
32815 (*cond_<sve_int_op><mode>_2): Likewise. Replace the UNSPEC_PRED_X
32816 predicate with a constant PTRUE, if it isn't already.
32817 (*cond_<sve_int_op><mode>_z): Replace with...
32818 (*cond_<sve_int_op><mode>_any): ...this new pattern.
32820 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32822 * config/aarch64/aarch64-sve.md (*cond_bic<mode>_2): Extend from
32823 SVE_FULL_I to SVE_I.
32824 (*cond_bic<mode>_any): Likewise.
32826 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32828 * config/aarch64/aarch64-sve.md (<su>mul<mode>3_highpart)
32829 (@aarch64_pred_<MUL_HIGHPART:optab><mode>): Extend from SVE_FULL_I
32832 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32834 * config/aarch64/aarch64-sve.md (<su>abd<mode>_3): Extend from
32835 SVE_FULL_I to SVE_I.
32836 (*aarch64_cond_<su>abd<mode>_2): Likewise.
32837 (*aarch64_cond_<su>abd<mode>_any): Likewise.
32838 (@aarch64_pred_<su>abd<mode>): Likewise. Use UNSPEC_PRED_X
32839 for the max and min but not for the minus.
32840 (*aarch64_cond_<su>abd<mode>_3): New pattern.
32842 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32844 * config/aarch64/iterators.md (SVE_24I): New iterator.
32845 * config/aarch64/aarch64-sve.md (*aarch64_adr<mode>_shift): Extend from
32846 SVE_FULL_SDI to SVE_24I. Use containers rather than elements.
32848 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32850 * config/aarch64/aarch64-sve.md (@cond_<SVE_INT_BINARY:optab><mode>)
32851 (*cond_<SVE_INT_BINARY:optab><mode>_2): Extend from SVE_FULL_I
32853 (*cond_<SVE_INT_BINARY:optab><mode>_3): Likewise.
32854 (*cond_<SVE_INT_BINARY:optab><mode>_any): Likewise.
32855 (*cond_<SVE_INT_BINARY:optab><mode>_2_const): Likewise.
32856 (*cond_<SVE_INT_BINARY:optab><mode>_any_const): Likewise.
32858 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32860 * config/aarch64/aarch64-sve.md (<SVE_INT_BINARY_IMM:optab><mode>3)
32861 (@aarch64_pred_<SVE_INT_BINARY_IMM:optab><mode>)
32862 (*post_ra_<SVE_INT_BINARY_IMM:optab><mode>3): Extend from SVE_FULL_I
32865 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32867 * config/aarch64/aarch64-sve.md (<ASHIFT:optab><mode>3)
32868 (v<ASHIFT:optab><mode>3, @aarch64_pred_<optab><mode>)
32869 (*post_ra_v<ASHIFT:optab><mode>3): Extend from SVE_FULL_I to SVE_I.
32871 2021-01-11 Martin Liska <mliska@suse.cz>
32874 * symtab-clones.h (clone_info::release): Release
32875 symtab::m_clones with ggc_delete as it's a GGC memory.
32877 2021-01-11 Matthias Klose <doko@ubuntu.com>
32879 * Makefile.in (LINK_PROGRESS): Show the link target.
32881 2021-01-11 Richard Biener <rguenther@suse.de>
32883 PR tree-optimization/91403
32884 * tree-vect-data-refs.c (vect_analyze_group_access_1): Cap
32885 single-element interleaving group size at 4096 elements.
32887 2021-01-11 Richard Biener <rguenther@suse.de>
32889 PR tree-optimization/98526
32890 * tree-vect-loop.c (vect_model_reduction_cost): Remove costing
32891 of the actual reduction op for the regular case.
32892 (vectorizable_reduction): Cost the stmts
32893 vect_transform_reduction produces here.
32895 2021-01-11 Andreas Krebbel <krebbel@linux.ibm.com>
32897 * tree-ssa-forwprop.c (simplify_vector_constructor): For
32898 big-endian, use UNPACK[_FLOAT]_HI.
32900 2021-01-11 Tamar Christina <tamar.christina@arm.com>
32902 * tree-vect-slp-patterns.c (class complex_pattern,
32903 class complex_add_pattern): Add parameters to matches.
32904 (complex_add_pattern::build): Free memory.
32905 (complex_add_pattern::matches): Move validation end of match.
32906 (complex_add_pattern::recognize): Likewise.
32908 2021-01-11 Tamar Christina <tamar.christina@arm.com>
32910 * tree-vect-slp-patterns.c (linear_loads_p): Fix externals.
32912 2021-01-11 Tamar Christina <tamar.christina@arm.com>
32914 * tree-vect-slp-patterns.c (is_linear_load_p): Fix ambiguity.
32916 2021-01-11 Jakub Jelinek <jakub@redhat.com>
32918 PR tree-optimization/95867
32919 * tree-ssa-math-opts.h: New header.
32920 * tree-ssa-math-opts.c: Include tree-ssa-math-opts.h.
32921 (powi_as_mults): No longer static. Use build_one_cst instead of
32922 build_real. Formatting fix.
32923 * tree-ssa-reassoc.c: Include tree-ssa-math-opts.h.
32924 (attempt_builtin_powi): Handle multiplication reassociation without
32925 powi_fndecl using powi_as_mults.
32926 (reassociate_bb): For integral types don't require
32927 -funsafe-math-optimizations to call attempt_builtin_powi.
32929 2021-01-11 Jakub Jelinek <jakub@redhat.com>
32931 PR tree-optimization/95852
32932 * tree-ssa-math-opts.c (maybe_optimize_guarding_check): Change
32933 mul_stmts parameter type to vec<gimple *> &. Before cond_stmt
32934 allow in the bb any of the stmts in that vector, div_stmt and
32935 up to 3 cast stmts.
32936 (arith_cast_equal_p): New function.
32937 (arith_overflow_check_p): Add cast_stmt argument, handle signed
32938 multiply overflow checks.
32939 (match_arith_overflow): Adjust caller. Handle signed multiply
32942 2021-01-11 Jakub Jelinek <jakub@redhat.com>
32944 PR tree-optimization/95852
32945 * tree-ssa-math-opts.c (maybe_optimize_guarding_check): New function.
32946 (uaddsub_overflow_check_p): Renamed to ...
32947 (arith_overflow_check_p): ... this. Handle also multiplication
32948 with overflow check.
32949 (match_uaddsub_overflow): Renamed to ...
32950 (match_arith_overflow): ... this. Add cfg_changed argument. Handle
32951 also multiplication with overflow check. Adjust function comment.
32952 (math_opts_dom_walker::after_dom_children): Adjust callers. Call
32953 match_arith_overflow also for MULT_EXPR.
32955 2021-01-11 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32957 * config/aarch64/arm_neon.h (vmovl_s8): Reimplement using
32958 __builtin_convertvector.
32959 (vmovl_s16): Likewise.
32960 (vmovl_s32): Likewise.
32961 (vmovl_u8): Likewise.
32962 (vmovl_u16): Likewise.
32963 (vmovl_u32): Likewise.
32964 (vmovn_s16): Likewise.
32965 (vmovn_s32): Likewise.
32966 (vmovn_s64): Likewise.
32967 (vmovn_u16): Likewise.
32968 (vmovn_u32): Likewise.
32969 (vmovn_u64): Likewise.
32971 2021-01-11 Martin Liska <mliska@suse.cz>
32973 * gimple-if-to-switch.cc (struct condition_info): Use auto_var.
32974 (if_chain::is_beneficial): Delete clusters
32975 (find_conditions): Make second argument of conditions_in_bbs a
32976 pointer so that we control over it's lifetime.
32977 (pass_if_to_switch::execute): Delete them.
32979 2021-01-11 Kewen Lin <linkw@linux.ibm.com>
32981 * ira.c (move_unallocated_pseudos): Check other_reg and skip if
32984 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
32986 * config/vax/vax.md (cc): Remove mode attribute.
32987 (subst_<cc>, subst_f<cc>): Rename to...
32988 (subst_<mode>, subst_f<VAXccnz:mode>): ... these respectively.
32989 (*cbranch<VAXint:mode>4_<VAXcc:mode>): Update for `cc' removal.
32990 (*cbranch<VAXfp:mode>4_<VAXccnz:mode>): Likewise.
32991 (*branch_<mode>, *branch_<mode>_reversed): Likewise.
32993 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
32995 * config/vax/vax.md (subst_f<cc>): Add mode to operands and
32996 `const_double_zero'.
32998 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
33000 * config/pdp11/pdp11.md (PDPfp): New mode iterator.
33001 (fcc_cc, fcc_ccnz): Use it. Add mode to `const_double_zero' and
33004 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
33006 * genemit.c (gen_exp) <CONST_DOUBLE>: Handle `const_double_zero'
33008 * read-rtl.c (rtx_reader::read_rtx_code): Handle machine mode
33009 with `const_double_zero'.
33010 * doc/rtl.texi (Constant Expression Types): Document it.
33012 2021-01-09 Jakub Jelinek <jakub@redhat.com>
33015 * tree-cfg.c (verify_gimple_assign_binary): Allow lhs of
33016 POINTER_DIFF_EXPR to be any integral type.
33018 2021-01-09 Jakub Jelinek <jakub@redhat.com>
33020 PR rtl-optimization/98603
33021 * function.c (instantiate_virtual_regs_in_insn): For asm goto
33022 with impossible constraints, drop all SETs, CLOBBERs, drop PARALLEL
33023 if any, set ASM_OPERANDS mode to VOIDmode and change
33024 ASM_OPERANDS_OUTPUT_CONSTRAINT and ASM_OPERANDS_OUTPUT_IDX.
33026 2021-01-09 Alexandre Oliva <oliva@gnu.org>
33029 * final.c (notice_source_line): Narrow down the condition to
33030 skip a line-0 marker.
33032 2021-01-08 Sergei Trofimovich <siarheit@google.com>
33034 * ipa-modref.c (merge_call_side_effects): Fix
33035 linebreak split by reordering two print calls.
33037 2021-01-08 Ilya Leoshkevich <iii@linux.ibm.com>
33039 * config/s390/vector.md (*tf_to_fprx2_0): Rename from
33040 "*mov_tf_to_fprx2_0" for consistency, fix constraint.
33041 (*tf_to_fprx2_1): Rename from "*mov_tf_to_fprx2_1" for
33042 consistency, fix constraint.
33044 2021-01-08 Ilya Leoshkevich <iii@linux.ibm.com>
33046 * config/s390/s390-c.c (s390_def_or_undef_macro): Accept
33047 callables instead of mask values.
33048 (struct target_flag_set_p): New predicate.
33049 (s390_cpu_cpp_builtins_internal): Define or undefine
33050 __LONG_DOUBLE_VX__ macro.
33052 2021-01-08 H.J. Lu <hjl.tools@gmail.com>
33055 * config/i386/i386.c (x86_function_profiler): Use R10 and R11
33056 to call mcount in large model with PIC for NO_PROFILE_COUNTERS
33059 2021-01-08 Richard Biener <rguenther@suse.de>
33061 * tree-ssa-sccvn.c (pass_fre::execute): Reset the SCEV hash table.
33063 2021-01-08 Richard Biener <rguenther@suse.de>
33065 * tree-vect-slp.c (scalar_stmts_to_slp_tree_map_t): Fix.
33066 (vect_build_slp_tree): On cache hit release the matched
33067 scalar stmts vector.
33068 * tree-vect-stmts.c (vectorizable_store): Properly free
33069 vec_oprnds before possibly gathering them again.
33071 2021-01-08 Richard Biener <rguenther@suse.de>
33073 PR tree-optimization/98544
33074 * tree-vect-slp.c (vect_optimize_slp): Always materialize
33075 permutes at a permute node.
33077 2021-01-08 H.J. Lu <hjl.tools@gmail.com>
33080 * config/i386/i386.c (x86_function_profiler): Use R10 to call
33081 mcount in large model. Sorry for large model with PIC.
33083 2021-01-08 Jakub Jelinek <jakub@redhat.com>
33086 * config/i386/i386.opt (ix86_cmodel, ix86_incoming_stack_boundary_arg,
33087 ix86_pmode, ix86_preferred_stack_boundary_arg, ix86_regparm,
33088 ix86_veclibabi_type): Remove x_ prefix, use TargetVariable instead of
33089 TargetSave and initialize for variables with enum types.
33090 (mfentry, mstack-protector-guard-reg=, mstack-protector-guard-offset=,
33091 mstack-protector-guard-symbol=): Add Save.
33092 * config/i386/i386-options.c (ix86_function_specific_save,
33093 ix86_function_specific_restore): Don't save or restore x_ix86_cmodel,
33094 x_ix86_incoming_stack_boundary_arg, x_ix86_pmode,
33095 x_ix86_preferred_stack_boundary_arg, x_ix86_regparm,
33096 x_ix86_veclibabi_type.
33098 2021-01-08 Richard Sandiford <richard.sandiford@arm.com>
33100 * config/aarch64/aarch64-sve.md (*cnot<mode>): Extend from
33101 SVE_FULL_I to SVE_I.
33102 (*cond_cnot<mode>_2, *cond_cnot<mode>_any): Likewise.
33104 2021-01-08 Richard Sandiford <richard.sandiford@arm.com>
33106 * config/aarch64/aarch64-sve.md (*cond_uxt<mode>_2): Extend from
33107 SVE_FULL_I to SVE_I.
33108 (*cond_uxt<mode>_any): Likewise.
33110 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33112 * config/aarch64/iterators.md (Vwhalf): New iterator.
33113 * config/aarch64/aarch64-simd.md (aarch64_<sur>adalp<mode>_3):
33115 (aarch64_<sur>adalp<mode>): ... This. Make more
33117 (<sur>sadv16qi): Adjust callsite of the above.
33118 * config/aarch64/aarch64-simd-builtins.def (sadalp, uadalp): New
33120 * config/aarch64/arm_neon.h (vpadal_s8): Reimplement using
33122 (vpadal_s16): Likewise.
33123 (vpadal_u8): Likewise.
33124 (vpadal_u16): Likewise.
33125 (vpadalq_s8): Likewise.
33126 (vpadalq_s16): Likewise.
33127 (vpadalq_s32): Likewise.
33128 (vpadalq_u8): Likewise.
33129 (vpadalq_u16): Likewise.
33130 (vpadalq_u32): Likewise.
33132 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33134 * config/aarch64/aarch64-simd.md (aarch64_<su>abd<mode>_3):
33136 (aarch64_<su>abd<mode>): ... This.
33137 (<sur>sadv16qi): Adjust callsite of the above.
33138 * config/aarch64/aarch64-simd-builtins.def (sabd, uabd): Define
33140 * config/aarch64/arm_neon.h (vabd_s8): Reimplement using
33142 (vabd_s16): Likewise.
33143 (vabd_s32): Likewise.
33144 (vabd_u8): Likewise.
33145 (vabd_u16): Likewise.
33146 (vabd_u32): Likewise.
33147 (vabdq_s8): Likewise.
33148 (vabdq_s16): Likewise.
33149 (vabdq_s32): Likewise.
33150 (vabdq_u8): Likewise.
33151 (vabdq_u16): Likewise.
33152 (vabdq_u32): Likewise.
33154 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33156 * config/aarch64/aarch64-simd-builtins.def (saba, uaba): Define
33158 * config/aarch64/arm_neon.h (vaba_s8): Implement using builtin.
33159 (vaba_s16): Likewise.
33160 (vaba_s32): Likewise.
33161 (vaba_u8): Likewise.
33162 (vaba_u16): Likewise.
33163 (vaba_u32): Likewise.
33164 (vabaq_s8): Likewise.
33165 (vabaq_s16): Likewise.
33166 (vabaq_s32): Likewise.
33167 (vabaq_u8): Likewise.
33168 (vabaq_u16): Likewise.
33169 (vabaq_u32): Likewise.
33171 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
33173 * config/aarch64/aarch64-simd.md (aba<mode>_3): Rename to...
33174 (aarch64_<su>aba<mode>): ... This. Handle uaba as well.
33175 Change RTL pattern to match.
33177 2021-01-08 Kito Cheng <kito.cheng@sifive.com>
33179 * common/config/riscv/riscv-common.c (riscv_current_subset_list): New.
33180 * config/riscv/riscv-c.c (riscv-subset.h): New.
33181 (INCLUDE_STRING): Define.
33182 (riscv_cpu_cpp_builtins): Add new style architecture extension
33184 * config/riscv/riscv-subset.h (riscv_subset_list::begin): New.
33185 (riscv_subset_list::end): New.
33186 (riscv_current_subset_list): New.
33188 2021-01-08 Kito Cheng <kito.cheng@sifive.com>
33190 * common/config/riscv/riscv-common.c (RISCV_DONT_CARE_VERSION):
33191 Move to riscv-subset.h.
33192 (struct riscv_subset_t): Ditto.
33193 (class riscv_subset_list): Ditto.
33194 * config/riscv/riscv-subset.h (RISCV_DONT_CARE_VERSION): Move
33195 from riscv-common.c.
33196 (struct riscv_subset_t): Ditto.
33197 (class riscv_subset_list): Ditto.
33198 * config/riscv/t-riscv ($(common_out_file)): Add file
33201 2021-01-07 Jakub Jelinek <jakub@redhat.com>
33204 * config/i386/i386.md (*bmi_blsi_<mode>_cmp, *bmi_blsi_<mode>_ccno):
33205 New define_insn patterns.
33207 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
33209 * config/aarch64/aarch64-sve.md (@cond_<SVE_INT_UNARY:optab><mode>)
33210 (*cond_<SVE_INT_UNARY:optab><mode>_2): Extend from SVE_FULL_I to SVE_I.
33211 (*cond_<SVE_INT_UNARY:optab><mode>_any): Likewise.
33213 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
33215 PR tree-optimization/98560
33216 * internal-fn.def (IFN_VCONDU, IFN_VCONDEQ): Use type vec_cond.
33217 * internal-fn.c (vec_cond_mask_direct): Get the data mode from
33219 (vec_cond_direct): Likewise argument 2.
33220 (vec_condu_direct, vec_condeq_direct): Delete.
33221 (expand_vect_cond_optab_fn): Rename to...
33222 (expand_vec_cond_optab_fn): ...this, replacing old macro.
33223 (expand_vec_condu_optab_fn, expand_vec_condeq_optab_fn): Delete.
33224 (expand_vect_cond_mask_optab_fn): Rename to...
33225 (expand_vec_cond_mask_optab_fn): ...this, replacing old macro.
33226 (direct_vec_cond_mask_optab_supported_p): Treat the optab as a
33228 (direct_vec_cond_optab_supported_p): Likewise.
33229 (direct_vec_condu_optab_supported_p): Delete.
33230 (direct_vec_condeq_optab_supported_p): Delete.
33231 * gimple-isel.cc: Include internal-fn.h.
33232 (gimple_expand_vec_cond_expr): Check that IFN_VCONDEQ is supported
33235 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
33237 PR tree-optimization/98560
33238 * gimple-isel.cc (gimple_expand_vec_cond_expr): If we fail to use
33239 IFN_VCOND{,U,EQ}, fall back on IFN_VCOND_MASK.
33241 2021-01-07 Uroš Bizjak <ubizjak@gmail.com>
33243 * config/i386/i386.md (insn): Merge from plusminus_insn, shift_insn,
33244 rotate_insn and optab code attributes.
33245 Update all uses to merged code attribute.
33246 * config/i386/sse.md: Update all uses to merged code attribute.
33247 * config/i386/mmx.md: Update all uses to merged code attribute.
33249 2021-01-07 Jakub Jelinek <jakub@redhat.com>
33251 PR tree-optimization/98568
33252 * gimple-ssa-store-merging.c (bswap_view_convert): New function.
33253 (bswap_replace): Use it.
33255 2021-01-06 Vladimir N. Makarov <vmakarov@redhat.com>
33257 PR rtl-optimization/97978
33258 * lra-int.h (lra_hard_reg_split_p): New external.
33259 * lra.c (lra_hard_reg_split_p): New global.
33260 (lra): Set up lra_hard_reg_split_p after splitting a hard reg.
33261 * lra-assigns.c (lra_assign): Don't check allocation correctness
33262 after hard reg splitting.
33264 2021-01-06 Martin Sebor <msebor@redhat.com>
33267 * builtins.c (new_delete_mismatch_p): New overload.
33268 (new_delete_mismatch_p (tree, tree)): Call it.
33270 2021-01-06 Alexandre Oliva <oliva@adacore.com>
33272 * Makefile.in (T_GLIMITS_H): New.
33273 (stmp-int-hdrs): Depend on it, use it.
33274 * config/t-vxworks (T_GLIMITS_H): Override it.
33275 (vxw-glimits.h): New.
33277 2021-01-06 Richard Biener <rguenther@suse.de>
33279 PR tree-optimization/98513
33280 * value-range.cc (intersect_ranges): Compare the upper bounds
33281 for the expected relation.
33283 2021-01-06 Gerald Pfeifer <gerald@pfeifer.com>
33286 2020-12-28 Gerald Pfeifer <gerald@pfeifer.com>
33288 * doc/standards.texi (HSAIL): Remove section.
33290 2021-01-05 Samuel Thibault <samuel.thibault@ens-lyon.org>
33292 * configure: Re-generate.
33294 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33296 * doc/invoke.texi (-std=c++20): Adjust for the publication of
33297 ISO 14882:2020 standard.
33298 * doc/standards.texi: Likewise.
33300 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33302 PR tree-optimization/94802
33303 * expr.h (maybe_optimize_sub_cmp_0): Declare.
33304 * expr.c: Include tree-pretty-print.h and flags.h.
33305 (maybe_optimize_sub_cmp_0): New function.
33306 (do_store_flag): Use it.
33307 * cfgexpand.c (expand_gimple_cond): Likewise.
33309 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
33311 * mux-utils.h (pointer_mux::m_ptr): Tweak description of contents.
33312 * rtlanal.c (simple_regno_set): Tweak description to clarify the
33315 2021-01-05 Richard Biener <rguenther@suse.de>
33317 PR tree-optimization/98516
33318 * tree-vect-slp.c (vect_optimize_slp): Permute the incoming
33319 lanes when materializing on a VEC_PERM node.
33320 (vectorizable_slp_permutation): Dump the permute properly.
33322 2021-01-05 Richard Biener <rguenther@suse.de>
33324 * tree-vect-slp.c (vect_slp_region): Move debug counter
33325 to cover individual subgraphs.
33327 2021-01-05 Richard Biener <rguenther@suse.de>
33329 PR tree-optimization/98428
33330 * tree-vect-slp.c (vect_build_slp_tree_1): Properly reject
33331 vector lane extracts for loop vectorization.
33333 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33335 PR tree-optimization/98514
33336 * tree-ssa-reassoc.c (bb_rank): Change type from long * to
33338 (operand_rank): Change type from hash_map<tree, long> to
33339 hash_map<tree, int64_t>.
33340 (phi_rank): Change return type from long to int64_t.
33341 (loop_carried_phi): Change block_rank variable type from long to
33343 (propagate_rank): Change return type, rank parameter type and
33344 op_rank variable type from long to int64_t.
33345 (find_operand_rank): Change return type from long to int64_t
33346 and change slot variable type from long * to int64_t *.
33347 (insert_operand_rank): Change rank parameter type from long to
33349 (get_rank): Change return type and rank variable type from long to
33350 int64_t. Use PRId64 instead of ld to print the rank.
33351 (init_reassoc): Change rank variable type from long to int64_t
33352 and adjust correspondingly bb_rank and operand_rank initialization.
33354 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33356 PR tree-optimization/96928
33357 * tree-ssa-phiopt.c (xor_replacement): New function.
33358 (tree_ssa_phiopt_worker): Call it.
33360 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33362 PR tree-optimization/96930
33363 * match.pd ((A / (1 << B)) -> (A >> B)): If A is extended
33364 from narrower value which has the same type as 1 << B, perform
33365 the right shift on the narrower value followed by extension.
33367 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33369 PR tree-optimization/96239
33370 * gimple-ssa-store-merging.c (maybe_optimize_vector_constructor): New
33372 (get_status_for_store_merging): Don't return BB_INVALID for blocks
33373 with potential bswap optimizable CONSTRUCTORs.
33374 (pass_store_merging::execute): Optimize vector CONSTRUCTORs with bswap
33377 2021-01-05 Richard Biener <rguenther@suse.de>
33379 PR tree-optimization/98381
33380 * tree.c (vector_element_bits): Properly compute bool vector
33382 * tree-vect-loop.c (vectorizable_live_operation): Properly
33383 compute the last lane bit offset.
33385 2021-01-05 Uroš Bizjak <ubizjak@gmail.com>
33388 * config/i386/sse.md (sse_cvtps2pi): Redefine as define_insn_and_split.
33389 Clear the top 64 bytes of the input XMM register.
33390 (sse_cvttps2pi): Ditto.
33392 2021-01-05 Uroš Bizjak <ubizjak@gmail.com>
33395 * config/i386/xopintrin.h (_mm256_cmov_si256): New.
33397 2021-01-05 H.J. Lu <hjl.tools@gmail.com>
33400 * config/i386/xmmintrin.h (_mm_extract_pi16): Cast to unsigned
33403 2021-01-05 Claudiu Zissulescu <claziss@synopsys.com>
33405 * config/arc/arc.md (maddsidi4_split): Use ACC_REG_FIRST.
33406 (umaddsidi4_split): Likewise.
33408 2021-01-05 liuhongt <hongtao.liu@intel.com>
33411 * config/i386/sse.md (*sse2_pmovskb_zexthisi): New
33412 define_insn_and_split for zero_extend of subreg HI of pmovskb
33414 (*sse2_pmovskb_zexthisi): Add new combine splitters for
33415 zero_extend of not of subreg HI of pmovskb result.
33417 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
33420 * explow.c (convert_memory_address_addr_space_1): Handle UNSPECs
33422 * config/aarch64/aarch64.c (aarch64_expand_mov_immediate): Use
33423 convert_memory_address to convert symbolic immediates to ptr_mode
33424 before forcing them to memory.
33426 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
33428 PR rtl-optimization/97144
33429 * recog.c (constrain_operands): Initialize matching_operand
33430 for each alternative, rather than only doing it once.
33432 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
33434 PR rtl-optimization/98403
33435 * rtl-ssa/changes.cc (function_info::finalize_new_accesses): Explain
33436 why we don't remove call clobbers.
33437 (function_info::apply_changes_to_insn): Don't attempt to add
33438 call clobbers here.
33440 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
33442 PR tree-optimization/98371
33443 * tree-vect-loop.c (vect_reanalyze_as_main_loop): New function.
33444 (vect_analyze_loop): If an epilogue loop appears to be cheaper
33445 than the main loop, re-analyze it as a main loop before adopting
33448 2021-01-05 Rainer Orth <ro@CeBiTec.Uni-Bielefeld.DE>
33451 * configure.ac (NETLIBS): Determine using AX_LIB_SOCKET_NSL.
33452 * aclocal.m4, configure: Regenerate.
33453 * Makefile.in (NETLIBS): Define.
33454 (BACKEND): Remove $(CODYLIB).
33456 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33458 PR rtl-optimization/98334
33459 * simplify-rtx.c (simplify_context::simplify_binary_operation_1):
33460 Optimize (X - 1) * Y + Y to X * Y or (X + 1) * Y - Y to X * Y.
33462 2021-01-05 Bernd Edlinger <bernd.edlinger@hotmail.de>
33464 * tree-inline.c (expand_call_inline): Restore input_location.
33465 Return result from recursive call.
33467 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
33469 PR tree-optimization/95401
33470 * config/aarch64/aarch64-sve-builtins.cc
33471 (gimple_folder::load_store_cookie): Use bits rather than bytes
33472 for the alignment argument to IFN_MASK_LOAD and IFN_MASK_STORE.
33473 * gimple-fold.c (gimple_fold_mask_load_store_mem_ref): Likewise.
33474 * tree-vect-stmts.c (vectorizable_store): Likewise.
33475 (vectorizable_load): Likewise.
33477 2021-01-04 Richard Biener <rguenther@suse.de>
33479 PR tree-optimization/98308
33480 * tree-vect-stmts.c (vectorizable_load): Set invariant mask
33483 2021-01-04 Jakub Jelinek <jakub@redhat.com>
33485 PR tree-optimization/95771
33486 * tree-ssa-loop-niter.c (number_of_iterations_popcount): Handle types
33487 with precision smaller than int's precision and types with precision
33488 twice as large as long long. Formatting fixes.
33490 2021-01-04 Richard Biener <rguenther@suse.de>
33492 PR tree-optimization/98464
33493 * tree-ssa-sccvn.c (vn_valueize_for_srt): Rename from ...
33494 (vn_valueize_wrapper): ... this. Temporarily adjust vn_context_bb.
33495 (process_bb): Adjust.
33497 2021-01-04 Matthew Malcomson <matthew.malcomson@arm.com>
33500 * doc/invoke.texi (-fsanitize=address): Fix wording describing
33501 clash with -fsanitize=hwaddress.
33503 2021-01-04 Richard Biener <rguenther@suse.de>
33505 PR tree-optimization/98282
33506 * tree-ssa-sccvn.c (vn_get_stmt_kind): Classify tcc_reference on
33507 invariants as VN_NARY.
33509 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
33512 * config/aarch64/aarch64-simd.md (aarch64_combine<mode>): Accept
33513 aarch64_simd_reg_or_zero for operand 2. Use the combinez patterns
33514 to handle zero operands.
33516 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
33518 * config/aarch64/aarch64.c (offset_6bit_signed_scaled_p): New function.
33519 (offset_6bit_unsigned_scaled_p): Fix typo in comment.
33520 (aarch64_sve_prefetch_operand_p): Accept MUL VLs in the range
33523 2021-01-04 Richard Biener <rguenther@suse.de>
33525 PR tree-optimization/98393
33526 * tree-vect-slp.c (vect_build_slp_tree): Properly zero matches
33527 when hitting the limit.
33529 2021-01-04 Richard Biener <rguenther@suse.de>
33531 PR tree-optimization/98291
33532 * tree-vect-loop.c (vectorizable_reduction): Bypass
33533 associativity check for SLP reductions with VF 1.
33535 2021-01-04 Jakub Jelinek <jakub@redhat.com>
33537 PR tree-optimization/96782
33538 * match.pd (x == ~x -> false, x != ~x -> true): New simplifications.
33540 2021-01-04 Bernd Edlinger <bernd.edlinger@hotmail.de>
33542 * collect-utils.c (collect_execute): Check dumppfx.
33543 * collect2.c (maybe_run_lto_and_relink, do_link): Pass atsuffix
33544 to collect_execute.
33545 (do_link): Add new parameter atsuffix.
33546 (main): Handle -dumpdir option. Skip one argument for
33547 -o, -isystem and -B options.
33548 * gcc.c (make_at_file): New helper function.
33549 (close_at_file): Use it.
33551 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33553 * config/darwin.h (MIN_LD64_NO_COAL_SECTS): Adjust.
33554 Amend handling for LD64_VERSION fallback defaults.
33556 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33558 * config.gcc: Compute default version information
33559 from the configured target. Likewise defaults for
33561 * config/darwin10.h: Removed.
33562 * config/darwin12.h: Removed.
33563 * config/darwin9.h: Removed.
33564 * config/rs6000/darwin8.h: Removed.
33566 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33568 * config/darwin9.h (ASM_OUTPUT_ALIGNED_COMMON): Delete.
33570 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33572 * config/darwin9.h (STACK_CHECK_STATIC_BUILTIN): Move from here..
33573 * config/darwin.h (STACK_CHECK_STATIC_BUILTIN): .. to here.
33575 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33577 * config/darwin10.h (LINK_GCC_C_SEQUENCE_SPEC): Move from
33579 * config/darwin.h (LINK_GCC_C_SEQUENCE_SPEC): ... to here.
33581 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33583 * config/darwin10.h (LINK_GCC_C_SEQUENCE_SPEC): Move the spec
33584 for the Darwin10 unwinder stub from here ...
33585 * config/darwin.h (LINK_COMMAND_SPEC_A): ... to here.
33587 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33589 * config/darwin.h (DSYMUTIL_SPEC): Default to DWARF
33590 (ASM_DEBUG_SPEC):Only define if the assembler supports
33592 (PREFERRED_DEBUGGING_TYPE): Default to DWARF.
33593 (DARWIN_PREFER_DWARF): Define.
33594 * config/darwin9.h (PREFERRED_DEBUGGING_TYPE): Remove.
33595 (DARWIN_PREFER_DWARF): Likewise
33596 (DSYMUTIL_SPEC): Likewise.
33597 (COLLECT_RUN_DSYMUTIL): Likewise.
33598 (ASM_DEBUG_SPEC): Likewise.
33599 (ASM_DEBUG_OPTION_SPEC): Likewise.
33601 2021-01-02 Jan Hubicka <jh@suse.cz>
33603 * cfg.c (free_block): ggc_free bb.
33605 2021-01-01 Jakub Jelinek <jakub@redhat.com>
33607 * gcc.c (process_command): Update copyright notice dates.
33608 * gcov-dump.c (print_version): Ditto.
33609 * gcov.c (print_version): Ditto.
33610 * gcov-tool.c (print_version): Ditto.
33611 * gengtype.c (create_file): Ditto.
33612 * doc/cpp.texi: Bump @copying's copyright year.
33613 * doc/cppinternals.texi: Ditto.
33614 * doc/gcc.texi: Ditto.
33615 * doc/gccint.texi: Ditto.
33616 * doc/gcov.texi: Ditto.
33617 * doc/install.texi: Ditto.
33618 * doc/invoke.texi: Ditto.
33620 2021-01-01 Jakub Jelinek <jakub@redhat.com>
33622 * ChangeLog-2020: Rotate ChangeLog. New file.
33625 Copyright (C) 2021 Free Software Foundation, Inc.
33627 Copying and distribution of this file, with or without modification,
33628 are permitted in any medium without royalty provided the copyright
33629 notice and this notice are preserved.