[Typo3-dev] character set handling in Typo 3.6: storage and processing
Martin Kutschker
martin.t.kutschker at blackbox.net
Thu Oct 16 15:18:34 CEST 2003
Addendum:
After some more research and thinking I have come to the conclusion that SYS[internalCharset] might no be of much worth, because it is way too much work to get PHP and Typo3 into using two- or four-byte Unicode.
In fact for internal processing it is vital that any single-byte character (of a multi-byt encoding) uses only 7-bit and that every character of a mult-byte sequence has the eight bit set. This is true for UTF-8 and EUC-based character sets like gb2312.
OTOH, it might be of interes to keep track wether the "current" character set is single-byte, EUC-base or UTF-8. But this information can be retrieved via the name and a lookup table.
Masi
More information about the TYPO3-dev
mailing list