From: Raymond Hettinger Date: Sat, 5 Oct 2002 15:25:33 +0000 (+0000) Subject: Backport 1.88 and 1.87: X-Git-Tag: v2.2.2b1~59 X-Git-Url: http://git.ipfire.org/gitweb.cgi?a=commitdiff_plain;h=216c977b4ae303fbe64b3d3b1f74ca6574bfda8d;p=thirdparty%2FPython%2Fcpython.git Backport 1.88 and 1.87: Update docs to reflect that \b considers underscore to be part of a word. --- diff --git a/Doc/lib/libre.tex b/Doc/lib/libre.tex index 52330daf4a4b..ab94c48ed3a4 100644 --- a/Doc/lib/libre.tex +++ b/Doc/lib/libre.tex @@ -331,13 +331,17 @@ escapes are treated as characters. \item[\code{\e b}] Matches the empty string, but only at the beginning or end of a word. A word is defined as a sequence of -alphanumeric characters, so the end of a word is indicated by -whitespace or a non-alphanumeric character. Inside a character range, -\regexp{\e b} represents the backspace character, for compatibility with -Python's string literals. - -\item[\code{\e B}] Matches the empty string, but only when it is -\emph{not} at the beginning or end of a word. +alphanumeric or underscore characters, so the end of a word is indicated by +whitespace or a non-alphanumeric, non-underscore character. Note that +{}\code{\e b} is defined as the boundary between \code{\e w} and \code{\e +W}, so the precise set of characters deemed to be alphanumeric depends on the +values of the \code{UNICODE} and \code{LOCALE} flags. Inside a character +range, \regexp{\e b} represents the backspace character, for compatibility +with Python's string literals. + +\item[\code{\e B}] Matches the empty string, but only when it is \emph{not} +at the beginning or end of a word. This is just the opposite of {}\code{\e +b}, so is also subject to the settings of \code{LOCALE} and \code{UNICODE}. \item[\code{\e d}]Matches any decimal digit; this is equivalent to the set \regexp{[0-9]}.