]> git.ipfire.org Git - thirdparty/sqlite.git/commit
Add an experimental tokenizer to fts4 - "unicode". This tokenizer works in the same...
authordan <dan@noemail.net>
Fri, 25 May 2012 17:50:19 +0000 (17:50 +0000)
committerdan <dan@noemail.net>
Fri, 25 May 2012 17:50:19 +0000 (17:50 +0000)
commit3d403c71a8c6610b40b0b34192afb3a244dfc484
tree33891638b6e6b62c4227b23f319f928fe72e407a
parent3773b29167a13af8c8cb1cef14dda5772f0a2232
Add an experimental tokenizer to fts4 - "unicode". This tokenizer works in the same way except that it understands unicode "simple case folding" and recognizes all characters not classified as "Letters" or "Numbers" by unicode as token separators.

FossilOrigin-Name: 0c13570ec78c6887103dc99b81b470829fa28385
13 files changed:
ext/fts3/fts3.c
ext/fts3/fts3Int.h
ext/fts3/fts3_unicode.c [new file with mode: 0644]
ext/fts3/fts3_unicode2.c [new file with mode: 0644]
ext/fts3/unicode/CaseFolding.txt [new file with mode: 0644]
ext/fts3/unicode/UnicodeData.txt [new file with mode: 0644]
ext/fts3/unicode/mkunicode.tcl [new file with mode: 0644]
main.mk
manifest
manifest.uuid
test/fts4unicode.test [new file with mode: 0644]
test/permutations.test
tool/mksqlite3c.tcl