From: Raymond Hettinger <python@rcn.com>
Date: Sat, 5 Oct 2002 15:25:33 +0000 (+0000)
Subject: Backport 1.88 and 1.87:
X-Git-Tag: v2.2.2b1~59
X-Git-Url: http://git.ipfire.org/gitweb.cgi?a=commitdiff_plain;h=216c977b4ae303fbe64b3d3b1f74ca6574bfda8d;p=thirdparty%2FPython%2Fcpython.git

Backport 1.88 and 1.87:

Update docs to reflect that \b considers underscore to be part of a word.
---

diff --git a/Doc/lib/libre.tex b/Doc/lib/libre.tex
index 52330daf4a4b..ab94c48ed3a4 100644
--- a/Doc/lib/libre.tex
+++ b/Doc/lib/libre.tex
@@ -331,13 +331,17 @@ escapes are treated as characters.
 
 \item[\code{\e b}] Matches the empty string, but only at the
 beginning or end of a word.  A word is defined as a sequence of
-alphanumeric characters, so the end of a word is indicated by
-whitespace or a non-alphanumeric character.  Inside a character range,
-\regexp{\e b} represents the backspace character, for compatibility with
-Python's string literals.
-
-\item[\code{\e B}] Matches the empty string, but only when it is
-\emph{not} at the beginning or end of a word.
+alphanumeric or underscore characters, so the end of a word is indicated by
+whitespace or a non-alphanumeric, non-underscore character.  Note that 
+{}\code{\e b} is defined as the boundary between \code{\e w} and \code{\e
+W}, so the precise set of characters deemed to be alphanumeric depends on the
+values of the \code{UNICODE} and \code{LOCALE} flags.  Inside a character
+range, \regexp{\e b} represents the backspace character, for compatibility
+with Python's string literals.
+
+\item[\code{\e B}] Matches the empty string, but only when it is \emph{not}
+at the beginning or end of a word.  This is just the opposite of {}\code{\e
+b}, so is also subject to the settings of \code{LOCALE} and \code{UNICODE}.
 
 \item[\code{\e d}]Matches any decimal digit; this is
 equivalent to the set \regexp{[0-9]}.