git.ipfire.org Git - thirdparty/Python/cpython.git/commit

]> git.ipfire.org Git - thirdparty/Python/cpython.git/commit

projects / thirdparty / Python / cpython.git / commit

author	Marta Gómez Macías <mgmacias@google.com>
	Sun, 21 May 2023 00:03:02 +0000 (02:03 +0200)
committer	GitHub <noreply@github.com>
	Sun, 21 May 2023 00:03:02 +0000 (01:03 +0100)
commit	6715f91edcf6f379f666e18f57b8a0dcb724bf79
tree	25724d6eb5b8ff5e713f7bfd8f6c33e5a6d87f62	tree \| snapshot
parent	3ed57e4995d9f8583083483f397ddc3131720953	commit \| diff

gh-102856: Python tokenizer implementation for PEP 701 (#104323)

This commit replaces the Python implementation of the tokenize module with an implementation
that reuses the real C tokenizer via a private extension module. The tokenize module now implements
a compatibility layer that transforms tokens from the C tokenizer into Python tokenize tokens for backward
compatibility.

As the C tokenizer does not emit some tokens that the Python tokenizer provides (such as comments and non-semantic newlines), a new special mode has been added to the C tokenizer mode that currently is only used via
the extension module that exposes it to the Python layer. This new mode forces the C tokenizer to emit these new extra tokens and add the appropriate metadata that is needed to match the old Python implementation.

Co-authored-by: Pablo Galindo <pablogsal@gmail.com>

22 files changed:

Doc/library/token-list.inc		diff \| blob \| blame \| history
Doc/library/token.rst		diff \| blob \| blame \| history
Grammar/Tokens		diff \| blob \| blame \| history
Include/internal/pycore_global_objects_fini_generated.h		diff \| blob \| blame \| history
Include/internal/pycore_global_strings.h		diff \| blob \| blame \| history
Include/internal/pycore_runtime_init_generated.h		diff \| blob \| blame \| history
Include/internal/pycore_token.h		diff \| blob \| blame \| history
Include/internal/pycore_unicodeobject_generated.h		diff \| blob \| blame \| history
Lib/inspect.py		diff \| blob \| blame \| history
Lib/tabnanny.py		diff \| blob \| blame \| history
Lib/test/test_tabnanny.py		diff \| blob \| blame \| history
Lib/test/test_tokenize.py		diff \| blob \| blame \| history
Lib/token.py		diff \| blob \| blame \| history
Lib/tokenize.py		diff \| blob \| blame \| history
Misc/NEWS.d/next/Core and Builtins/2023-05-20-23-08-48.gh-issue-102856.Knv9WT.rst	[new file with mode: 0644]	blob
Parser/pegen.c		diff \| blob \| blame \| history
Parser/pegen_errors.c		diff \| blob \| blame \| history
Parser/token.c		diff \| blob \| blame \| history
Parser/tokenizer.c		diff \| blob \| blame \| history
Parser/tokenizer.h		diff \| blob \| blame \| history
Python/Python-tokenize.c		diff \| blob \| blame \| history
Python/clinic/Python-tokenize.c.h		diff \| blob \| blame \| history

Mirror of https://github.com/python/cpython.git

RSS Atom