[Typo3-dev] character set handling in Typo 3.6: storage and processing

Martin Kutschker martin.t.kutschker at blackbox.net
Thu Oct 16 15:18:34 CEST 2003


Addendum:

After some more research and thinking I have come to the conclusion that SYS[internalCharset] might no be of much worth, because it is way too much work to get PHP and Typo3 into using two- or four-byte Unicode.

In fact for internal processing it is vital that any single-byte character (of a multi-byt encoding) uses only 7-bit and that every character of a mult-byte sequence has the eight bit set. This is true for UTF-8 and EUC-based character sets like gb2312.

OTOH, it might be of interes to keep track wether the "current" character set is single-byte, EUC-base or UTF-8. But this information can be retrieved via the name and a lookup table.

Masi






More information about the TYPO3-dev mailing list