[Typo3-dev] help needed: charset tables (Asian languages)
Kasper Skårhøj
kasper at typo3.com
Wed Mar 31 12:39:21 CEST 2004
TYPO3 only needs simple string functions to support multibyte charsets.
I think the greatest problem is strtoupper and strtolower. All regex
usage is mostly on tokens in text, nothing else.
Then of course supprot for UTF-8 in the database is needed if you want
search results and ordering to work right. And extensions like
indexed_search would need modification as well.
YOu work in CVS looks good! THanks a lot!
- kasper
On Wed, 2004-03-31 at 11:22, Martin T. Kutschker wrote:
> Kasper Skårhøj wrote:
> > Maybe it would be fair to require mbstring et all for those charsets?
> > After all people using these sets are more likely to have conversion
> > support compiled in than western europe.
> >
> > At least it would be a fair tradeoff with a very limited support.
>
> I'd reckon that there is no Japanese server that has not mbstring
> support (mbstring has it's roots in Japan).
>
> And for Typo3 supporting it: it depends on what the Core needs. What
> set of string fucntions is a must, what functions are would be nice to
> have? Without these infos it is hard to determine if it is worth the
> hassle. Basic functions (lenght, substr) are fairly straightforward,
> search functions are more tricky and regular expressions *) are hopeless.
>
> Masi
>
> *) Depends on the expression, some will work, some not.
>
> Eg for character classes you need a workaraound:
>
> [aäsß] => [as]|ä|ß
>
> "." must be written to match a multibyte sequence. But that works only
> for utf-8!
>
> This should be done by the programmer. I'm not sure if a regexp
> "preparser" is an easy task (a simple replace won't do).
>
> _______________________________________________
> Typo3-dev mailing list
> Typo3-dev at lists.netfielders.de
> http://lists.netfielders.de/cgi-bin/mailman/listinfo/typo3-dev
--
Best regards
- kasper
---------------
"Zero MS-gravity" - finally migrated to Linux!
More information about the TYPO3-dev
mailing list