[TYPO3-dev] UTF8 problem when parsing XML data...
    Dmitry Dulepov 
    typo3 at accio.lv
       
    Thu Jul 13 09:25:48 CEST 2006
    
    
  
Hi!
Jigal van Hemert wrote:
> I think the problem lies in the original XML. The "special-chars" as
> mentioned are not encoded using the encoding that was indicated in the
> XML-header, but represented by (numerical) entities.
And they are always Unicode characters (not utf-8 encoded, but plain 
Unicode value!). I checked character from original message, it is 
correct Unicode symbol.
> At some moment in the process chain the entities are converted to
> characters (can be anywhere between XML transformation and displaying
> the content in the browser). It seems that the entities are converted to
> UTF-9 encoded characters, but that the output is interpreted as
> ISO-xxxx-x (ISO-8859-1?).
XML should have correct character set in prologue. Otherwise it is 
treated as iso-8859-1.
Dmitry.
-- 
"It is our choices, that show what we truly are,
far more than our abilities." (A.P.W.B.D.)
    
    
More information about the TYPO3-dev
mailing list