]> git.ipfire.org Git - thirdparty/dovecot/core.git/commitdiff
lib-fts: Update comment on tr29 rules.
authorTeemu Huovila <teemu.huovila@dovecot.fi>
Mon, 17 Aug 2015 10:14:44 +0000 (13:14 +0300)
committerTeemu Huovila <teemu.huovila@dovecot.fi>
Mon, 17 Aug 2015 10:14:44 +0000 (13:14 +0300)
src/lib-fts/fts-tokenizer-generic.c

index e30a9b1cda2967b0687a44527a08e8244716bc78..835413f0ae4a571e7537fca1e0a6111ec06764d4 100644 (file)
@@ -594,6 +594,10 @@ static struct letter_fn letter_fns[] = {
   #29, but tailored for FTS purposes.
   http://www.unicode.org/reports/tr29/
 
+  Note: The text of tr29 is a living standard, so it keeps
+  changing. In newer specs some characters are combined, like AHLetter
+  (ALetter | Hebrew_Letter) and MidNumLetQ (MidNumLet | Single_Quote).
+
   Adaptions:
   * No word boundary at Start-Of-Text or End-of-Text (Wb1 and WB2).
   * Break just once, not before and after.