[TYPO3-core] Supporting Thai in the index searach

Martin Kutschker Martin.Kutschker at n0spam-blackbox.net
Thu Oct 19 09:42:57 CEST 2006


Martin Kutschker schrieb:
> Hi!
> 
> To support Thai letters it is necessary to add the character ranges for letters (alpha, not CJK!) and digits into class.lexer.php:
> 
> ($cp >= 0xE01 && $cp <= 0xE4E && $cp != 0xE3F) // letters
> ($cp >= 0xE50 && $cp <= 0xE59) // digits

Just for the records: Thai does not use spaces to separate words and has no 
punctuation. In Thai you have just a long strings of letters, so you have 
to know where a word ends (and another one starts). But they do separate 
sentences - by spaces!

As a result I have currently no idea how to index Thai without a dictionary.

Masi



More information about the TYPO3-team-core mailing list