[TYPO3-dev] UTF8 problem when parsing XML data...

Dmitry Dulepov typo3 at accio.lv
Thu Jul 13 09:25:48 CEST 2006


Hi!

Jigal van Hemert wrote:
> I think the problem lies in the original XML. The "special-chars" as
> mentioned are not encoded using the encoding that was indicated in the
> XML-header, but represented by (numerical) entities.

And they are always Unicode characters (not utf-8 encoded, but plain 
Unicode value!). I checked character from original message, it is 
correct Unicode symbol.

> At some moment in the process chain the entities are converted to
> characters (can be anywhere between XML transformation and displaying
> the content in the browser). It seems that the entities are converted to
> UTF-9 encoded characters, but that the output is interpreted as
> ISO-xxxx-x (ISO-8859-1?).

XML should have correct character set in prologue. Otherwise it is 
treated as iso-8859-1.

Dmitry.
-- 
"It is our choices, that show what we truly are,
far more than our abilities." (A.P.W.B.D.)




More information about the TYPO3-dev mailing list