[TYPO3-core] Supporting Thai in the index searach
Martin Kutschker
Martin.Kutschker at n0spam-blackbox.net
Thu Oct 19 09:42:57 CEST 2006
Martin Kutschker schrieb:
> Hi!
>
> To support Thai letters it is necessary to add the character ranges for letters (alpha, not CJK!) and digits into class.lexer.php:
>
> ($cp >= 0xE01 && $cp <= 0xE4E && $cp != 0xE3F) // letters
> ($cp >= 0xE50 && $cp <= 0xE59) // digits
Just for the records: Thai does not use spaces to separate words and has no
punctuation. In Thai you have just a long strings of letters, so you have
to know where a word ends (and another one starts). But they do separate
sentences - by spaces!
As a result I have currently no idea how to index Thai without a dictionary.
Masi
More information about the TYPO3-team-core
mailing list