]> git.ipfire.org Git - thirdparty/Python/cpython.git/commit
gh-94526: getpath_dirname() no longer encodes the path (GH-97645)
authorMiss Islington (bot) <31488909+miss-islington@users.noreply.github.com>
Fri, 30 Sep 2022 13:38:41 +0000 (06:38 -0700)
committerGitHub <noreply@github.com>
Fri, 30 Sep 2022 13:38:41 +0000 (06:38 -0700)
commit6537bc9a49d31a3dc7f5df1284285bafdd0f56ef
treea62bf9d9587b5c3ceebff37b0da326bf87be6764
parent0fbee30f7166a5b75c738e27e506e2f125bad479
gh-94526: getpath_dirname() no longer encodes the path (GH-97645)

Fix the Python path configuration used to initialized sys.path at
Python startup. Paths are no longer encoded to UTF-8/strict to avoid
encoding errors if it contains surrogate characters (bytes paths are
decoded with the surrogateescape error handler).

getpath_basename() and getpath_dirname() functions no longer encode
the path to UTF-8/strict, but work directly on Unicode strings. These
functions now use PyUnicode_FindChar() and PyUnicode_Substring() on
the Unicode path, rather than strrchr() on the encoded bytes string.
(cherry picked from commit 9f2f1dd131b912e224cd0269adde8879799686c4)

Co-authored-by: Victor Stinner <vstinner@python.org>
Misc/NEWS.d/next/Core and Builtins/2022-09-29-15-19-29.gh-issue-94526.wq5m6T.rst [new file with mode: 0644]
Modules/getpath.c