[TYPO3-dev] problem:: report & analysis: 4.1.2 is coming with codepage inconsistancy

Martin Bless m.bless at gmx.de
Tue Aug 21 09:44:36 CEST 2007


Michael Stucki wrote on Mon, 20 Aug 2007 22:16:54 +0200:

>thanks for your report.

I'm happy it's appreciated.

>> [file_002] is UTF-8 encoded and since I'm using an all UTF-8
>> installation works fine.
>
>Are there no errors, or did you verify that it contains UTF8 data all over
>the place?

First of all I verified that it does containt only UTF8 data.
Afterwards I did check what characters are of ordinal value > 127.
This is what I found:

cnt, hex   ,   dec , unicodename
 33, 0xa0  ,    160, NO-BREAK SPACE
  2, 0xe5  ,    229, LATIN SMALL LETTER A WITH RING ABOVE
  1, 0xe4  ,    228, LATIN SMALL LETTER A WITH DIAERESIS
  1, 0xe0  ,    224, LATIN SMALL LETTER A WITH GRAVE
  1, 0xe6  ,    230, LATIN SMALL LETTER AE
  4, 0x2013,   8211, EN DASH
  2, 0xf8  ,    248, LATIN SMALL LETTER O WITH STROKE
  1, 0xd8  ,    216, LATIN CAPITAL LETTER O WITH STROKE
  1, 0xfc  ,    252, LATIN SMALL LETTER U WITH DIAERESIS

Since we know its static template data and there are no other bug
reports and so on I think we can be very sure it's a formally and
semantically "good" UTF8 file.

>> [file_001] is the problem. It is - in general - Latin-1 encoded except
>> for six times where the byte sequence ï, ¿, ½ occurs.
>> These 3 bytes are UTF-8 for Unicode � which is the 'unkown
>> value' symbol.

[...]

>Can you please make a diff of your changes and post them in the bugtracker?
>Thanks in advance!

I did already yesterday: http://bugs.typo3.org/view.php?id=6177
Currently I'm not using diff very much, so i uploaded the complete
file.

have a nice day

Martin




More information about the TYPO3-dev mailing list