]> git.ipfire.org Git - thirdparty/Python/cpython.git/commit
[3.13] gh-119118: Fix performance regression in tokenize module (GH-119615) (#119682)
authorMiss Islington (bot) <31488909+miss-islington@users.noreply.github.com>
Tue, 28 May 2024 20:47:45 +0000 (22:47 +0200)
committerGitHub <noreply@github.com>
Tue, 28 May 2024 20:47:45 +0000 (22:47 +0200)
commit0d0be6b3efeace4743329f81c08f9720cc221207
treeafeec3b81a48161eef6c675658a9e7aac55aaf36
parentc0e99617985d64e6134964f758ae0a1a20f9f433
[3.13] gh-119118: Fix performance regression in tokenize module (GH-119615) (#119682)

- Cache line object to avoid creating a Unicode object
  for all of the tokens in the same line.
- Speed up byte offset to column offset conversion by using the
  smallest buffer possible to measure the difference.

(cherry picked from commit d87b0151062e36e67f9e42e1595fba5bf23a485c)

Co-authored-by: Lysandros Nikolaou <lisandrosnik@gmail.com>
Co-authored-by: Pablo Galindo <pablogsal@gmail.com>
Misc/NEWS.d/next/Library/2024-05-28-12-15-03.gh-issue-119118.FMKz1F.rst [new file with mode: 0644]
Parser/pegen.c
Parser/pegen.h
Python/Python-tokenize.c