]> git.ipfire.org Git - thirdparty/postgresql.git/commit
Fix performance bug in regexp's citerdissect/creviterdissect.
authorTom Lane <tgl@sss.pgh.pa.us>
Fri, 20 Aug 2021 18:19:04 +0000 (14:19 -0400)
committerTom Lane <tgl@sss.pgh.pa.us>
Fri, 20 Aug 2021 18:19:04 +0000 (14:19 -0400)
commitcafebd6638d026a9340847e875148e5026c3ff45
treeb4e16919e469212ea6e942797dd9255388f683ac
parentcc7fae5c2d9effad8a907ffacb4190aca136f87b
Fix performance bug in regexp's citerdissect/creviterdissect.

After detecting a sub-match "dissect" failure (i.e., a backref match
failure) in the i'th sub-match of an iteration node, we should proceed
by adjusting the attempted length of the i'th submatch.  As coded,
though, these functions changed the attempted length of the *last*
sub-match, and only after exhausting all possibilities for that would
they back up to adjust the next-to-last sub-match, and then the
second-from-last, etc; all of which is wasted effort, since only
changing the start or length of the i'th sub-match can possibly make
it succeed.  This oversight creates the possibility for exponentially
bad performance.  Fortunately the problem is masked in most cases by
optimizations or constraints applied elsewhere; which explains why
we'd not noticed it before.  But it is possible to reach the problem
with fairly simple, if contrived, regexps.

Oversight in my commit 173e29aa5.  That's pretty ancient now,
so back-patch to all supported branches.

Discussion: https://postgr.es/m/1808998.1629412269@sss.pgh.pa.us
src/backend/regex/regexec.c