git.ipfire.org Git - thirdparty/gcc.git/commit

author	Julian Brown <julian@codesourcery.com>
	Mon, 30 Nov 2020 19:10:04 +0000 (11:10 -0800)
committer	Julian Brown <julian@codesourcery.com>
	Wed, 13 Jan 2021 01:05:24 +0000 (17:05 -0800)
commit	4cdacf5193be792cb0e73f973247da7c97182bb3
tree	b2508999be35ce2bdd18fe42e3ff41001673835d	tree
parent	d4e8393670ea05eadd0ac2308a99666b69362822	commit \| diff

amdgcn: Improve FP division accuracy

GCN has a reciprocal-approximation instruction but no
hardware divide. This patch adjusts the open-coded reciprocal
approximation/Newton-Raphson refinement steps to use fused multiply-add
instructions as is necessary to obtain a properly-rounded result, and
adds further refinement steps to correctly round the full division result.

The patterns in question are still guarded by a flag_reciprocal_math
condition, and do not yet support denormals.

Backport from mainline:

2021-01-13 Julian Brown <julian@codesourcery.com>

gcc/
* config/gcn/gcn-valu.md (recip<mode>2<exec>, recip<mode>2): Use unspec
for reciprocal-approximation instructions.
(div<mode>3): Use fused multiply-accumulate operations for reciprocal
refinement and division result.
* config/gcn/gcn.md (UNSPEC_RCP): New unspec constant.

gcc/testsuite/
* gcc.target/gcn/fpdiv.c: New test.

(cherry picked from commit c8812bac8ee39f73ea881e4f6260acf5590b4190)

gcc/config/gcn/gcn-valu.md		diff \| blob \| blame \| history
gcc/config/gcn/gcn.md		diff \| blob \| blame \| history
gcc/testsuite/gcc.target/gcn/fpdiv.c	[new file with mode: 0644]	blob