Igor Tsimbalist [Fri, 17 Nov 2017 13:34:39 +0000 (14:34 +0100)]
Enable building libgcc with CET options.
Enable building libgcc with CET options by default on Linux/x86 if
binutils supports CET v2.0. It can be disabled with --disable-cet.
It is an error to configure GCC with --enable-cet if bintuiils
doesn't support CET v2.0.
ENDBR instruction is added to __morestack_large_model since it is
called indirectly.
2017-11-17 Igor Tsimbalist <igor.v.tsimbalist@intel.com>
config/
* cet.m4: New file.
gcc/
* config.gcc (extra_headers): Add cet.h for x86 targets.
* config/i386/cet.h: New file.
* doc/install.texi: Add --enable-cet/--disable-cet.
libgcc/
* Makefile.in (configure_deps): Add $(srcdir)/../config/cet.m4.
(CET_FLAGS): New.
* config/i386/morestack.S: Include <cet.h>.
(__morestack_large_model): Add _CET_ENDBR at function entrance.
* config/i386/resms64.h: Include <cet.h>.
* config/i386/resms64f.h: Likewise.
* config/i386/resms64fx.h: Likewise.
* config/i386/resms64x.h: Likewise.
* config/i386/savms64.h: Likewise.
* config/i386/savms64f.h: Likewise.
* config/i386/t-linux (HOST_LIBGCC2_CFLAGS): Add $(CET_FLAGS).
(CRTSTUFF_T_CFLAGS): Likewise.
* configure.ac: Include ../config/cet.m4.
Set and substitute CET_FLAGS.
* configure: Regenerated.
[ARM] Rework expectation for call to Armv8-M nonsecure function
Testcase gcc.target/arm/cmse/cmse-14.c checks whether bar is called via
__gnu_cmse_nonsecure_call libcall and not via a direct call. However the
pattern is a bit surprising in that it needs to explicitely allow "by"
due to allowing anything before the 'b'.
This patch rewrites the logic to look for b as a first non-whitespace
letter followed iby anything (to match bl and conditional branches)
followed by some spaces and then bar.
2017-11-17 Thomas Preud'homme <thomas.preudhomme@arm.com>
gcc/testsuite/
* gcc.target/arm/cmse/cmse-14.c: Change logic to match branch
instruction to bar.
[ARM] Fix selection of effective target for cmse tests
Some of the tests in the gcc.target/arm/cmse directory (eg.
gcc.target/arm/cmse/mainline/bitfield-4.c) are failing when run without
an architecture specified in RUNTESTFLAGS due to them not adding the
option to select an Armv8-M architecture.
This patch fixes the issue by adding the right option from the exp file
so that no architecture fiddling is necessary in the individual tests.
2017-11-17 Thomas Preud'homme <thomas.preudhomme@arm.com>
Commit r253825 which introduced some sanity checks for sbitmap revealed
a bug in the conversion of cmse_nonsecure_entry_clear_before_return ()
to using bitmap structure. bitmap_and expects that the two bitmaps have
the same length, yet the code in
cmse_nonsecure_entry_clear_before_return () have different size for
to_clear_bitmap and to_clear_arg_regs_bitmap, with the assumption that
bitmap_and would behave has if the bits not allocated were in fact zero.
This commit makes sure both bitmap are equally sized.
2017-11-17 Thomas Preud'homme <thomas.preudhomme@arm.com>
gcc/
* config/arm/arm.c (cmse_nonsecure_entry_clear_before_return): Allocate
to_clear_arg_regs_bitmap to the same size as to_clear_bitmap.
Qing Zhao [Fri, 17 Nov 2017 05:32:05 +0000 (05:32 +0000)]
re PR middle-end/78809 (Inline strcmp with small constant strings)
2017-11-15 Qing Zhao <qing.zhao@oracle.com>
PR middle-end/78809
* gimple-fold.c (gimple_fold_builtin_string_compare): Add handling
of replacing call to strncmp with corresponding call to strcmp when
meeting conditions.
PR middle-end/78809
* gcc.dg/strcmpopt_1.c: New test.
Steven Munroe [Thu, 16 Nov 2017 22:50:16 +0000 (22:50 +0000)]
Had a small thinko in the implementation of mmintrin.h _mm_add_pi32 that only shows when compiling for power9.
Had a small thinko in the implementation of mmintrin.h _mm_add_pi32 that only shows
when compiling for power9. A trivial and obvious 2 line patch to fix it.
Joseph Myers [Thu, 16 Nov 2017 22:23:30 +0000 (22:23 +0000)]
Add -std=c18 etc. option aliases.
ISO C17 won't go to ballot until December, meaning publication of the
standard won't be until 2018, leaving ambiguity as to whether people
will end up referring to the standard as C17, as it's currently known
and which corresponds to the __STDC_VERSION__ value, or C18 based on
the publication date.
In case people end up referring to this standard as C18, this patch
adds corresponding option aliases -std=c18, -std=iso9899:2018,
-std=gnu18 so people can use those names based on publication date if
they wish. The "expected to be" explanations in help texts and the
manual can be removed as and when the standard is published, hopefully
before GCC 8 is out.
Bootstrapped with no regressions on x86_64-pc-linux-gnu.
gcc:
* doc/invoke.texi (-std=c17): Refer to 2018 expected publication
date of C17.
(-std=c18, -std=iso9899:2018, -std=gnu18): Document option
aliases.
gcc/c-family:
* c.opt (-std=c17, std=gnu17, -std=iso9899:2017): Refer to 2018
expected publication date of C17.
(-std=c18, -std=gnu18, -std=iso9899:2018): New option aliases.
gcc/testsuite:
* gcc.dg/c18-version-1.c, gcc.dg/c18-version-2.c: New tests.
Jason Merrill [Thu, 16 Nov 2017 20:13:48 +0000 (15:13 -0500)]
PR c++/79092 - non-type args of different types are different
* tree.c (cp_tree_equal): Check the type of constants.
* pt.c (unify) [TEMPLATE_PARM_INDEX]: Handle UNIFY_ALLOW_INTEGER
when comparing to previously deduced argument.
(maybe_convert_nontype_argument): New.
(convert_nontype_argument): Call it.
(tsubst_copy_and_build): Handle partial instantiation of
IMPLICIT_CONV_EXPR.
(unify): Ignore type when deducing from array bound.
(dependent_type_p_r): Handle DEFERRED_NOEXCEPT.
(value_dependent_expression_p): Any type-dependent expression is
value-dependent. Handle IMPLICIT_CONV_EXPR.
* cp-tree.h (IMPLICIT_CONV_EXPR_NONTYPE_ARG): New.
* mangle.c (write_template_arg): Strip IMPLICIT_CONV_EXPR.
Michael Meissner [Thu, 16 Nov 2017 18:35:42 +0000 (18:35 +0000)]
rs6000.c (rs6000_expand_builtin): Do not do the switch statement mapping KF built-ins to TF built-ins if...
[gcc]
2017-11-15 Michael Meissner <meissner@linux.vnet.ibm.com>
* config/rs6000/rs6000.c (rs6000_expand_builtin): Do not do the
switch statement mapping KF built-ins to TF built-ins if we don't
have the proper ISA 3.0 assembler support.
[gcc/testsuite]
2017-11-15 Michael Meissner <meissner@linux.vnet.ibm.com>
* gcc.target/powerpc/bfp/bfp.exp: Look for *.c files, not *.c*
files to prevent ~ files from getting recognized.
* gcc.target/powerpc/dfp/dfp.exp: Likewise.
* gcc.target/powerpc/vsu/vsu.exp: Likewise.
Makefile.am: Remove references to c-compiler-name.h and cp-compiler-name.h
* Makefile.am: Remove references to c-compiler-name.h and
cp-compiler-name.h
* Makefile.in: Regenerate.
* compiler-name.hh: New file.
* libcc1.cc: Don't include c-compiler-name.h. Include
compiler-name.hh.
* libcp1.cc: Don't include cp-compiler-name.h. Include
compiler-name.hh.
Co-Authored-By: Pedro Alves <palves@redhat.com>
From-SVN: r254838
Henry Linjamäki [Thu, 16 Nov 2017 17:20:45 +0000 (17:20 +0000)]
[BRIGFE] Reduce the number of type conversions due to
the untyped HSAIL regs.
Instead of always representing the HSAIL's untyped registers as
unsigned int, the gccbrig now pre-analyzes the BRIG code and
builds the register variables as a type used the most when storing
or reading data to/from each register. This reduces the total
conversions which cannot be always optimized away.
Jan Hubicka [Thu, 16 Nov 2017 16:52:20 +0000 (17:52 +0100)]
ipa-split.c (split_bb_info): Turn time to sreal.
* ipa-split.c (split_bb_info): Turn time to sreal.
(split_point): Likewise.
(dump_split_point): Likewise.
(fine_split_points): Likewise.
(execute_split_functions): Only zero split_bbs; turn time to sreals.
Martin Sebor [Thu, 16 Nov 2017 16:12:36 +0000 (16:12 +0000)]
PR tree-optimization/82588 - missing -Warray-bounds on a excessively large index
PR tree-optimization/82588 - missing -Warray-bounds on a excessively large index
PR tree-optimization/82583 - missing -Warray-bounds on out-of-bounds inner indic
* opt.ads: Elaboration warnings are now on by default. Add a comment
explaining why this is needed.
* sem_ch9.adb (Analyze_Requeue): Preserve the status of elaboration
warnings.
* sem_ch12.adb (Analyze_Package_Instantiation): Preserve the status of
elaboration warnings.
(Analyze_Subprogram_Instantiation): Preserve the status of elaboration
warnings.
* sem_elab.adb: Update the structure of Call_Attributes and
Instantiation_Attributes.
(Build_Call_Marker): Propagate the status of elaboration warnings from
the call to the marker.
(Extract_Call_Attributes): Extract the status of elaboration warnings.
(Extract_Instantiation_Attributes): Extract the status of elaboration
warnings.
(Process_Conditional_ABE_Activation_Impl): Elaboration diagnostics are
now dependent on the status of elaboration warnings.
(Process_Conditional_ABE_Call_Ada): Elaboration diagnostics are now
dependent on the status of elaboration warnings.
(Process_Conditional_ABE_Instantiation_Ada): Elaboration diagnostics
are now dependent on the status of elaboration warnings.
(Process_Guaranteed_ABE_Activation_Impl): Remove pragma Unreferenced
for formal Call_Attrs. Elaboration diagnostics are now dependent on the
status of elaboration warnings.
(Process_Guaranteed_ABE_Call): Elaboration diagnostics are now
dependent on the status of elaboration warnings.
(Process_Guaranteed_ABE_Instantiation): Elaboration diagnostics are now
dependent on the status of elaboration warnings.
* sem_prag.adb (Analyze_Pragma): Remove the unjustified warning
concerning pragma Elaborate.
* sem_res.adb (Resolve_Call): Preserve the status of elaboration
warnings.
(Resolve_Entry_Call): Propagate flag Is_Elaboration_Warnings_OK_Node
from the procedure call to the entry call.
* sem_util.adb (Mark_Elaboration_Attributes): Add formal parameter
Warnings.
(Mark_Elaboration_Attributes_Node): Preserve the status of elaboration
warnings
* sem_util.ads (Mark_Elaboration_Attributes): Add formal parameter
Warnings. Update the comment on usage.
* sinfo.adb (Is_Dispatching_Call): Update to use Flag6.
(Is_Elaboration_Warnings_OK_Node): New routine.
(Set_Is_Dispatching_Call): Update to use Flag6.
(Set_Is_Elaboration_Warnings_OK_Node): New routine.
* sinfo.ads: Attribute Is_Dispatching_Call now uses Flag6. Add new
attribute Is_Elaboration_Warnings_OK_Node along with occurrences
in nodes.
(Is_Elaboration_Warnings_OK_Node): New routine along with pragma
Inline.
(Set_Is_Elaboration_Warnings_OK_Node): New routine along with pragma
Inline.
* doc/gnat_ugn/elaboration_order_handling_in_gnat.rst: Update various
sections to indicate how to suppress elaboration warnings. Document
switches -gnatwl and -gnatwL.
* gnat_ugn.texi: Regenerate.
Wilco Dijkstra [Thu, 16 Nov 2017 11:54:49 +0000 (11:54 +0000)]
Canonicalize constant multiplies in division
This patch implements some of the optimizations discussed in
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=71026.
Canonicalize x / (C1 * y) into (x * C2) / y.
This moves constant multiplies out of the RHS of a division in order
to allow further simplifications (such as (C1 * x) / (C2 * y) ->
(C3 * x) / y) and to enable more reciprocal CSEs.
2017-11-16 Wilco Dijkstra <wdijkstr@arm.com>
Jackson Woodruff <jackson.woodruff@arm.com>
gcc/
PR tree-optimization/71026
* match.pd: Canonicalize constant multiplies in division.
gcc/testsuite/
PR tree-optimization/71026
* gcc.dg/cse_recip.c: New test.
Co-Authored-By: Jackson Woodruff <jackson.woodruff@arm.com>
From-SVN: r254816
Wilco Dijkstra [Thu, 16 Nov 2017 11:44:30 +0000 (11:44 +0000)]
Set default to -fomit-frame-pointer
Almost all targets add an explict -fomit-frame-pointer in the target specific
options. Rather than doing this in a target-specific way, do this in the
generic options so it works identically across all targets. In many cases the
target no longer needs to define TARGET_OPTION_OPTIMIZATION_TABLE, reducing
the amount of target code.
* sem_ch8.adb (Check_SPARK_Primitive_Operation): Enable the check in
instantiations.
2017-11-16 Doug Rupp <rupp@adacore.com>
Initial gnat port to aarch64-wrs-vxworks7
* libgnarl/s-vxwork__aarch64.ads,
libgnat/system-vxworks7-aarch64-rtp-smp.ads,
libgnat/system-vxworks7-aarch64.ads: New files
* sigtramp-vxworks-target.inc (aarch64): New section.
* tracebak.c (USE_ARM_UNWINDING): Disable for ARMARCH8A.
* sem_ch8.adb (Analyze_Subprogram_Renaming): Ensure that a renaming
declaration does not define a primitive operation of a tagged type for
SPARK.
(Check_SPARK_Primitive_Operation): New routine.
2017-11-16 Arnaud Charlet <charlet@adacore.com>
* libgnat/a-elchha.adb (Last_Chance_Handler): Display Argv (0) in
message when using -E binder switch.
* sem_ch3.adb (Process_Subtype): If the subtype indication does not
syntactically denote a type, return Any_Type to prevent subsequent
compiler crashes or infinite loops.
2017-11-16 Steve Baird <baird@adacore.com>
* lib-writ.adb: Fix bug which causes Program_Error to be raised in some
cases when writing out a .ali file when a Rename_Pragma pragma is in
effect.
* lib-writ.adb (Write_Unit_Information): Replace call to
Pragma_Name_Unmapped with call to Pragma_Name.
2017-11-16 Gary Dismukes <dismukes@adacore.com>
* sem_elab.adb: Minor typo fixes.
2017-11-16 Justin Squirek <squirek@adacore.com>
* sem_res.adb (Resolve_Allocator): Correct warning messages and make
them more explicit.
* atree.ads (Nkind_In): Add 10 and 11 parameter versions.
* checks.adb (Install_Primitive_Elaboration_Check): Mark the setting of
the elaboration flag as elaboration code.
* einfo.adb (Contract): Update the comments.
(Ignore_SPARK_Mode_Pragmas): Update the comments.
(SPARK_Aux_Pragma): Update the comments.
(SPARK_Aux_Pragma_Inherited): Update the comments.
(SPARK_Pragma): Update the comments. The attribute now applies
to all types and abstract states.
(SPARK_Pragma_Inherited): Update the comments. The attribute now
applies to all types and abstract states.
(Set_Contract): Update the comments.
(Set_Ignore_SPARK_Mode_Pragmas): Update the comments.
(Set_SPARK_Aux_Pragma): Update the comments.
(Set_SPARK_Aux_Pragma_Inherited): Update the comments.
(Set_SPARK_Pragma): Update the comments. The attribute now applies to
all types and abstract states.
(Set_SPARK_Pragma_Inherited): Update the comments. The attribute now
applies to all types and abstract states.
(Write_Field40_Name): Add output for SPARK_Pragma when it appears on a
type and abstract states.
* einfo.ads: Update the documentation of attributes SPARK_Pragma and
SPARK_Pragma_Inherited. Both of them now apply to all types and
abstract states.
* exp_util.adb (Set_Elaboration_Flag): Mark the setting of the
elaboration flag as elaboration code.
* sem_ch3.adb: Add with and use clauses for Sem_Elab.
(Analyze_Full_Type_Declaration): Set the SPARK_Mode of the type. Record
a derived type for later processing by the ABE mechanism.
(Analyze_Incomplete_Type_Decl): Set the SPARK_Mode of the type.
(Analyze_Private_Extension_Declaration): Set the SPARK_Mode of the
type.
* sem_ch7.adb (Analyze_Private_Type_Declaration): Set the SPARK_Mode of
the type.
* sem_elab.adb: Define the term "early call region". Update the
terminology for "scenario" and "target". Update the architecture of
the ABE mechanism. Update the steps which must be taken when adding a
new scenario. Update the section on debugging ABE issues. Add new
hash tables Early_Call_Regions and Recorded_SPARK_Scenarios. Add new
table SPARK_Scenarios. Hash table Elaboration_Context is now
Elaboration_Statuses. The majority of Process_xxx routines have been
updated to better reflect their role.
(Add_Unit): Reimplemented.
(Check_Elaboration_Constituent): New routine.
(Check_Elaboration_Scenarios): Verify previously recorded scenarios for
conditional ABE issues. Verify previously recorded SPARK scenarios.
(Check_SPARK_Derived_Type): New routine.
(Check_SPARK_Instantiation): New routine.
(Check_SPARK_Scenario): New routine.
(Check_SPARK_Refined_State_Pragma): New routine.
(Early_Call_Region): New routine.
(Elaboration_Status): New routine.
(Ensure_Prior_Elaboration): Add new formal parameter Prag_Nam. The
implicit Elabotate[_All] pragma is now specified via Prag_Nam.
(Find_Early_Call_Region): New routine.
(Info_Scenario): Add output for refinement constituents.
(Is_Recorded_SPARK_Scenario): New routine.
(Is_Suitable_SPARK_Derived_Type): New routine.
(Is_Suitable_SPARK_Instantiation): New routine.
(Is_Suitable_SPARK_Refined_State_Pragma): New routine.
(Is_Visited_Body): New routine.
(Kill_Elaboration_Scenario): Reimplemented.
(Output_Active_Scenarios): Add output for pragma Refined_State.
(Output_SPARK_Refined_State_Pragma): New routine.
(Process_Conditional_ABE_Call): Remove the use of -gnatd.v. The effect
is now achieved by different means.
(Process_Conditional_ABE_Call_SPARK): Verify that a call which precedes
the subprogram body appears within the early call region of the body.
Either ensure the prior elaboration of external subprograms or verify
that the context meets the suitable elaboration requirement.
(Process_Conditional_ABE_Instantiation_SPARK): New routine.
(Record_Elaboration_Scenario): Reimplement the portion which enforces
the level restrictions of the static model. Add support for SPARK
scenarios.
(Record_SPARK_Elaboration_Scenario): New routine.
(Reset_Visited_Bodies): New routine.
(Set_Early_Call_Region): New routine.
(Set_Elaboration_Status): New routine.
(Set_Is_Recorded_SPARK_Scenario): New routine.
(Update_Elaboration_Scenario): Reimplemented.
* sem_elab.ads: Add new subtype Library_Or_Instantiation_Level.
* sem_prag.adb (Analyze_Refined_State_In_Decl_Part): Save the pragma
for examination by the ABE Processing phase.
(Create_Abstract_State): Save the SPARK_Mode from the context.
* sem_util.adb (Is_Non_Preelaborable_Construct): New routine.
* sem_util.ads (Is_Non_Preelaborable_Construct): New routine.
* sinfo.adb (Is_Elaboration_Code): New routine.
(Set_Is_Elaboration_Code): New routine.
(Nkind_In): Add 10 and 11 parameter versions.
* sinfo.ads: Add new attribute Is_Elaboration_Code along with
occurrences in nodes.
(Is_Elaboration_Code): New routine along with pragma Inline.
(Set_Is_Elaboration_Code): New routine along with pragma Inline.
(Nkind_In): Add 10 and 11 parameter versions.
2017-11-16 Justin Squirek <squirek@adacore.com>
* sem.adb (Analyze): Remove requirement that the original node of N be
an operator in the case that analysis on the node yields the relevant
operator - so prefer it instead.
Bob Duff [Thu, 16 Nov 2017 09:43:24 +0000 (09:43 +0000)]
sem_ch6.adb (Create_Extra_Formals): The type of the BIP_Object_Access formal must not have a designated type that...
2017-11-16 Bob Duff <duff@adacore.com>
* sem_ch6.adb (Create_Extra_Formals): The type of the BIP_Object_Access
formal must not have a designated type that is the full view coming
from a limited-with'ed package.
* sem_util.adb,sem_util.ads (Incomplete_View_From_Limited_With): New
function called from sem_ch6.
* sem_ch5.adb (Analyze_Assignment): Treat user-defined concatenation
specially for b-i-p cases.
Chung-Ju Wu [Thu, 16 Nov 2017 09:38:11 +0000 (09:38 +0000)]
Provide more constraints for future use.
gcc/
* config/nds32/constraints.md: Provide more constraints.
* config/nds32/nds32.h (enum reg_class, REG_CLASS_NAMES,
REG_CLASS_CONTENTS): Define R5_REG, R8_REG, and FRAME_POINTER_REG to
support constraints usage.
Co-Authored-By: Kito Cheng <kito.cheng@gmail.com>
From-SVN: r254800
Bill Schmidt [Wed, 15 Nov 2017 21:21:32 +0000 (21:21 +0000)]
altivec.h (vec_xst_be): New #define.
[gcc]
2017-11-15 Bill Schmidt <wschmidt@linux.vnet.ibm.com>
* config/rs6000/altivec.h (vec_xst_be): New #define.
* config/rs6000/altivec.md (altivec_vperm_<mode>_direct): Rename
and externalize from *altivec_vperm_<mode>_internal.
* config/rs6000/rs6000-builtin.def (XL_BE_V16QI): Remove macro
instantiation.
(XL_BE_V8HI): Likewise.
(XL_BE_V4SI): Likewise.
(XL_BE_V4SI): Likewise.
(XL_BE_V2DI): Likewise.
(XL_BE_V4SF): Likewise.
(XL_BE_V2DF): Likewise.
(XST_BE): Add BU_VSX_OVERLOAD_X macro instantiation.
* config/rs6000/rs6000-c.c (altivec_overloaded_builtins): Correct
all array entries with these keys: VSX_BUILTIN_VEC_XL,
VSX_BUILTIN_VEC_XL_BE, VSX_BUILTIN_VEC_XST. Add entries for key
VSX_BUILTIN_VEC_XST_BE.
* config/rs6000/rs6000.c (altivec_expand_xl_be_builtin): Remove.
(altivec_expand_builtin): Remove handling for VSX_BUILTIN_XL_BE_*
built-ins.
(altivec_init_builtins): Replace conditional calls to def_builtin
for __builtin_vsx_ld_elemrev_{v8hi,v16qi} and
__builtin_vsx_st_elemrev_{v8hi,v16qi} based on TARGET_P9_VECTOR
with unconditional calls. Remove calls to def_builtin for
__builtin_vsx_le_be_<mode>. Add a call to def_builtin for
__builtin_vec_xst_be.
* config/rs6000/vsx.md (vsx_ld_elemrev_v8hi): Convert define_insn
to define_expand, and add alternate RTL generation for P8.
(*vsx_ld_elemrev_v8hi_internal): New define_insn based on
vsx_ld_elemrev_v8hi.
(vsx_ld_elemrev_v16qi): Convert define_insn to define_expand, and
add alternate RTL generation for P8.
(*vsx_ld_elemrev_v16qi_internal): New define_insn based on
vsx_ld_elemrev_v16qi.
(vsx_st_elemrev_v8hi): Convert define_insn
to define_expand, and add alternate RTL generation for P8.
(*vsx_st_elemrev_v8hi_internal): New define_insn based on
vsx_st_elemrev_v8hi.
(vsx_st_elemrev_v16qi): Convert define_insn to define_expand, and
add alternate RTL generation for P8.
(*vsx_st_elemrev_v16qi_internal): New define_insn based on
vsx_st_elemrev_v16qi.
[gcc/testsuite]
2017-11-15 Bill Schmidt <wschmidt@linux.vnet.ibm.com>
H.J. Lu [Wed, 15 Nov 2017 19:30:58 +0000 (19:30 +0000)]
i386: Add X86_TUNE_EMIT_VZEROUPPER
Add X86_TUNE_EMIT_VZEROUPPER to indicate if vzeroupper instruction should
be inserted before a transfer of control flow out of the function. It is
turned on by default unless we are tuning for KNL. Users can always use
-mzeroupper or -mno-zeroupper to override X86_TUNE_EMIT_VZEROUPPER.
gcc/
PR target/82990
* config/i386/i386.c (pass_insert_vzeroupper::gate): Remove
TARGET_AVX512ER check.
(ix86_option_override_internal): Set MASK_VZEROUPPER if
neither -mzeroupper nor -mno-zeroupper is used and
TARGET_EMIT_VZEROUPPER is set.
* config/i386/i386.h (TARGET_EMIT_VZEROUPPER): New.
* config/i386/x86-tune.def: Add X86_TUNE_EMIT_VZEROUPPER.
Will Schmidt [Wed, 15 Nov 2017 19:26:21 +0000 (19:26 +0000)]
rs6000.c (rs6000_gimple_fold_builtin): Add support for folding of vector compares.
2017-11-15 Will Schmidt <will_schmidt@vnet.ibm.com>
[gcc]
* config/rs6000/rs6000.c (rs6000_gimple_fold_builtin): Add support for
folding of vector compares.
(fold_build_vec_cmp): New helper function.
(fold_compare_helper): New helper function.
(builtin_function_type): Add compare builtins to the list of functions
having unsigned arguments. Cosmetic updates to comment indentation.
* config/rs6000/vsx.md (vcmpneb, vcmpneh, vcmpnew): Update to specify
the not+eq combination.
Bin Cheng [Wed, 15 Nov 2017 16:16:49 +0000 (16:16 +0000)]
re PR tree-optimization/82726 (ICE in verify_ssa during GIMPLE pass: pcom)
PR tree-optimization/82726
Revert
2017-01-23 Bin Cheng <bin.cheng@arm.com>
PR tree-optimization/70754
* tree-predcom.c (stmt_combining_refs): New parameter INSERT_BEFORE.
(reassociate_to_the_same_stmt): New parameter INSERT_BEFORE. Insert
combined stmt before it if not NULL.
(combine_chains): Process refs reversely and compute dominance point
for root ref.
Revert
2017-02-23 Bin Cheng <bin.cheng@arm.com>
PR tree-optimization/79663
* tree-predcom.c (combine_chains): Process refs in reverse order
only for ZERO length chains, and add explaining comment.
Palmer Dabbelt [Wed, 15 Nov 2017 15:58:52 +0000 (15:58 +0000)]
RISC-V: Add Jim Wilson as a maintainer
Jim has recently started working at SiFive, where he'll be contributing
to our GCC port. Andrew, Kito and I would like him to be a mainatiner.
My understand is that this is the right place to ask.
I've also changed my address to my SiFive one, where I also work -- it
seems cleaner to have everyone use our proper email address.
ChangeLog
2017-11-15 Palmer Dabbelt <palmer@sifive.com>
* MAINTAINERS (RISC-V): Add Jim Wilson as a maintainer.
Use my SiFive email address.
* testsuite/libgomp.oacc-c-c++-common/f-asyncwait-1.c: New test, copied
from asyncwait-1.f90. Rewrite into C. Rewrite from float to int.
* testsuite/libgomp.oacc-c-c++-common/f-asyncwait-2.c: New test, copied
from asyncwait-2.f90. Rewrite into C. Rewrite from float to int.
* testsuite/libgomp.oacc-c-c++-common/f-asyncwait-3.c: New test, copied
from asyncwait-3.f90. Rewrite into C. Rewrite from float to int.
Nathan Sidwell [Wed, 15 Nov 2017 13:18:09 +0000 (13:18 +0000)]
[PR c++/81574] lambda capture of function reference
https://gcc.gnu.org/ml/gcc-patches/2017-11/msg01200.html
PR c++/81574
* lambda.c (lambda_capture_field_type): Function references are
always catured by reference.
Martin Liska [Wed, 15 Nov 2017 12:39:06 +0000 (13:39 +0100)]
Disable -Wreturn-type by default in all languages other from C++.
2017-11-15 Martin Liska <mliska@suse.cz>
* tree-cfg.c (pass_warn_function_return::execute):
Compare warn_return_type for greater than zero.
2017-11-15 Martin Liska <mliska@suse.cz>
* options.c (gfc_post_options):
Do not set default value of warn_return_type.
* trans-decl.c (gfc_trans_deferred_vars):
Compare warn_return_type for greater than zero.
(generate_local_decl): Likewise
(gfc_generate_function_code): Likewise.
Sebastian Peryt [Wed, 15 Nov 2017 12:27:31 +0000 (13:27 +0100)]
Fix PR82941 and PR82942 by adding proper vzeroupper generation on SKX.
2017-11-15 Sebastian Peryt <sebastian.peryt@intel.com>
gcc/
PR target/82941
PR target/82942
* config/i386/i386.c (pass_insert_vzeroupper): Modify gate condition
to return true on Xeon and not on Xeon Phi.
(ix86_check_avx256_register): Changed to ...
(ix86_check_avx_upper_register): ... this. Add extra check for
VALID_AVX512F_REG_OR_XI_MODE.
(ix86_avx_u128_mode_needed): Changed
ix86_check_avx256_register to ix86_check_avx_upper_register.
(ix86_check_avx256_stores): Changed to ...
(ix86_check_avx_upper_stores): ... this. Changed
ix86_check_avx256_register to ix86_check_avx_upper_register.
(ix86_avx_u128_mode_after): Changed
avx_reg256_found to avx_upper_reg_found. Changed
ix86_check_avx256_stores to ix86_check_avx_upper_stores.
(ix86_avx_u128_mode_entry): Changed
ix86_check_avx256_register to ix86_check_avx_upper_register.
(ix86_avx_u128_mode_exit): Ditto.
* config/i386/i386.h: (host_detect_local_cpu): New define.
2017-11-15 Sebastian Peryt <sebastian.peryt@intel.com>
gcc/testsuite/
PR target/82941
PR target/82942
* gcc.target/i386/pr82941-1.c: New test.
* gcc.target/i386/pr82941-2.c: New test.
* gcc.target/i386/pr82942-1.c: New test.
* gcc.target/i386/pr82942-2.c: New test.
Jakub Jelinek [Wed, 15 Nov 2017 09:01:42 +0000 (10:01 +0100)]
re PR target/82981 (unnecessary __multi3 call for mips64r6 linux kernel)
PR target/82981
* internal-fn.c: Include gimple-ssa.h, tree-phinodes.h and
ssa-iterators.h.
(can_widen_mult_without_libcall): New function.
(expand_mul_overflow): If only checking unsigned mul overflow,
not result, and can do efficiently MULT_HIGHPART_EXPR, emit that.
Don't use WIDEN_MULT_EXPR if it would involve a libcall, unless
no other way works. Add MULT_HIGHPART_EXPR + MULT_EXPR support.
(expand_DIVMOD): Formatting fix.
* expmed.h (expand_mult): Add NO_LIBCALL argument.
* expmed.c (expand_mult): Likewise. Use OPTAB_WIDEN rather
than OPTAB_LIB_WIDEN if NO_LIBCALL is true, and allow it to fail.
Martin Liska [Wed, 15 Nov 2017 07:01:01 +0000 (08:01 +0100)]
Zero vptr in dtor for -fsanitize=vptr.
2017-11-15 Martin Liska <mliska@suse.cz>
* decl.c (begin_destructor_body): In case of VPTR sanitization
(with disabled recovery), zero vptr in order to catch virtual calls
after lifetime of an object.
2017-11-15 Martin Liska <mliska@suse.cz>
Jeff Law [Wed, 15 Nov 2017 06:30:31 +0000 (23:30 -0700)]
explow.c (anti_adjust_stack_and_probe_stack_clash): Avoid probing the red zone for stack_clash_protection_final_dynamic_probe...
* explow.c (anti_adjust_stack_and_probe_stack_clash): Avoid probing
the red zone for stack_clash_protection_final_dynamic_probe targets
when the total dynamic stack size is zero bytes.