gh-100210: Correct the comment link for unescaping HTML (GH-100212)
(cherry picked from commit
9a07eff628c1cd88b7cdda88a8fd0db3fe7ea552)
gh-100210: correct the comment link for unescaping HTML
Co-authored-by: Jean-Christophe Amiel <jeanchristophe.amiel@gmail.com>
Co-authored-by: Ezio Melotti <ezio.melotti@gmail.com>
return s
-# see http://www.w3.org/TR/html5/syntax.html#tokenizing-character-references
+# see https://html.spec.whatwg.org/multipage/parsing.html#numeric-character-reference-end-state
_invalid_charrefs = {
0x00: '\ufffd', # REPLACEMENT CHARACTER