scaling or division by *max* should not be skipped even if not
otherwise needed to prevent overflow or loss of precision.
-The assertion that hi*hi >= 1.0 is a bit subtle. Each vector element
+The assertion that hi*hi <= 1.0 is a bit subtle. Each vector element
gets scaled to a magnitude below 1.0. The Veltkamp-Dekker splitting
algorithm gives a *hi* value that is correctly rounded to half
precision. When a value at or below 1.0 is correctly rounded, it
expansion of sqrt(h**2 + x) == h + x/(2*h) + O(x**2).
Essentially, this differential correction is equivalent to one
-refinement step in the Newton divide-and-average square root
+refinement step in Newton's divide-and-average square root
algorithm, effectively doubling the number of accurate bits.
This technique is used in Dekker's SQRT2 algorithm and again in
Borges' ALGORITHM 4 and 5.