[TYPO3-core] RFC: utf8 in log files

Martin Kutschker Martin.Kutschker at n0spam-blackbox.net
Fri Nov 24 13:41:14 CET 2006


Hi!

This is a SVN patch request.

To get meaningful results in web analyzers for non-latin charsets the URL 
should be encoded in UTF8. The attached patch allows this.

If you set config.stat_apache_niceTitle to 'utf-8' it will not do a 
transliteration (which is quite pointeless for eg Chinese) but will store 
the path and page title in UTF8.

Additionally I have added config.stat_pageLen. Works like 
config.stat_titleLen, but affects lenght of the actual (leaf) page title, 
not the length of a (node) page title in the path. Before this change the 
page title length was fixed to 30, while the path titles could be 
configured up to 100 chars.

Property: stat_apache_niceTitle
Data type: boolean / string
Description:
If set, the URL will be transliterated from the renderCharset to ASCII (eg 
ä => ae, à => a, α => a), which yields nice and readable page titles in the 
log. All non-ASCII characters that cannot be converted will be changed to 
underscores.
If set to 'utf-8', the page title will be converted to UTF-8 which results 
in even more readable titles, if your log analyzing software supports it.

Property: stat_pageLen
Data type: int 1-100
Description:
The length of the page name (at the end of the path) written to the 
logfile/database.

If you're quick with a review, Ingmar can add this to 4.1 beta1

Masi
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: tslib_fe-utf8.diff
Url: http://lists.netfielders.de/pipermail/typo3-team-core/attachments/20061124/7733f6fc/attachment.diff 


More information about the TYPO3-team-core mailing list