[TYPO3-core] RFC: utf8 in log files

Martin Kutschker martin.kutschker-n0spam at no5pam-blackbox.net
Fri Nov 24 18:31:41 CET 2006


Michael Stucki schrieb:
> 
>> If you set config.stat_apache_niceTitle to 'utf-8' it will not do a
>> transliteration (which is quite pointeless for eg Chinese) but will store
>> the path and page title in UTF8.
> 
> What would be a reason for not using this while forceCharset is set to utf-8
> already? I mean: Would it make sense to set this by default?

Depends on the charset support of your analyzing software. I tried a few 
simple ones and they did not unescape the URLs. Of course I could make 
the proper escaping optional writing real UTF-8 into the logs.

Anyway, making it default does not really depend on forceCharset. If I 
have a Chinese site I may use the popular big5 and still want (nay need) 
  the setting.

> About the enable string "utf-8": I often need to check if I should specify
> "utf8" (MySQL) or "utf-8" (mail encoding). Sometimes even the case
> sensitivity needs to be correct. So, wouldn't it be ok to allow any of
> these values:
> 
> preg_match('/utf\-?8/i')

Ok, but I'd do this in a "conversion" at the beginning to avoid multiple 
regexps.

Masi



More information about the TYPO3-team-core mailing list