From: Paul Eggert Date: Sat, 23 Sep 2023 07:23:26 +0000 (-0700) Subject: doc: mention Unicode exceptions for wc X-Git-Tag: v9.5~143 X-Git-Url: http://git.ipfire.org/cgi-bin/gitweb.cgi?a=commitdiff_plain;h=9ecc4f4e44ef8797d1fcd01574ebb71999744d73;p=thirdparty%2Fcoreutils.git doc: mention Unicode exceptions for wc --- diff --git a/doc/coreutils.texi b/doc/coreutils.texi index 4167660a7c..ee3b1ce11a 100644 --- a/doc/coreutils.texi +++ b/doc/coreutils.texi @@ -3859,6 +3859,13 @@ space delimited by white space characters or by start or end of input. The current locale determines which characters are white space. GNU @command{wc} treats encoding errors as non white space. +@vindex POSIXLY_CORRECT +Unless the environment variable @env{POSIXLY_CORRECT} is set, +GNU @command{wc} treats the following Unicode characters as white +space even if the current locale does not: U+00A0 NO-BREAK SPACE, +U+2007 FIGURE SPACE, U+202F NARROW NO-BREAK SPACE, and U+2060 WORD +JOINER. + @item -l @itemx --lines @opindex -l