[TYPO3-dev] UTF8 problem when parsing XML data...
    Dmitry Dulepov 
    typo3 at accio.lv
       
    Thu Jul 13 14:12:33 CEST 2006
    
    
  
Hi!
Jigal van Hemert wrote:
> Wat is "plain Unicode value"? Unicode is a collection of characters
> divided into groups ("planes" IIRC). These characters can be
> represented in a number of ways. In the example there was an 'entity'
> é. This is not a unicode value, but a numerical representation
> of a character. 233 is in Latin-1 (often encoded in ISO-8859-1) an 'e
> with accent aigu'.
You answered yourself :) There are Unicode characters and there are
representations for them. é refers to the Unicode character with
code 233. And it is represented in utf-8 as 0xC3 0xA9. Entities always 
show character number, they are independent of representation.
Dmitry.
-- 
"It is our choices, that show what we truly are,
far more than our abilities." (A.P.W.B.D.)
    
    
More information about the TYPO3-dev
mailing list