]> git.ipfire.org Git - thirdparty/postgresql.git/commit
Fix performance bug in regexp's citerdissect/creviterdissect.
authorTom Lane <tgl@sss.pgh.pa.us>
Fri, 20 Aug 2021 18:19:04 +0000 (14:19 -0400)
committerTom Lane <tgl@sss.pgh.pa.us>
Fri, 20 Aug 2021 18:19:04 +0000 (14:19 -0400)
commit9610852ab3a4b4309d8b0d0d3616b83033871a41
tree1297d34f5c50c1f0ffa86da14ed0afa6cf36bf2c
parentfbc1eed8a8dd3178de70de53a0e95786c80f9dbc
Fix performance bug in regexp's citerdissect/creviterdissect.

After detecting a sub-match "dissect" failure (i.e., a backref match
failure) in the i'th sub-match of an iteration node, we should proceed
by adjusting the attempted length of the i'th submatch.  As coded,
though, these functions changed the attempted length of the *last*
sub-match, and only after exhausting all possibilities for that would
they back up to adjust the next-to-last sub-match, and then the
second-from-last, etc; all of which is wasted effort, since only
changing the start or length of the i'th sub-match can possibly make
it succeed.  This oversight creates the possibility for exponentially
bad performance.  Fortunately the problem is masked in most cases by
optimizations or constraints applied elsewhere; which explains why
we'd not noticed it before.  But it is possible to reach the problem
with fairly simple, if contrived, regexps.

Oversight in my commit 173e29aa5.  That's pretty ancient now,
so back-patch to all supported branches.

Discussion: https://postgr.es/m/1808998.1629412269@sss.pgh.pa.us
src/backend/regex/regexec.c