[TYPO3-english] contagged Extension has error while parsing joined words (words joined with dashes)

Prakash spabhat at chandanweb.com
Wed Mar 11 16:43:15 CET 2009


Jochen Rau wrote:
> Hi Parakash,
> 
>> In the Content parser and tagger (Glossary) contagged extension there 
>> seems to be some sort of error while parsing joined words (word joined 
>> using dashes).
>> This is clearly noticeable especially when second word contains 
>> special characters such as ( ê, à, u', é, etc...)
>>
>> For example consider the word " elle-même " the term is defined as 
>> "elle" with a link to example.com then the link is getting rendered as 
>> follows:
>>
>> <dfn><a target="_top" href="http://www.example.com">Elle-m</a></dfn>ême
>>
>> I doubt this could have something related with the preg_match() used 
>> in getPositions() function of class.tx_contagged.php.
>>
>> What could be the problem? Anyone?
> 
> I have uploaded contagged v0.2.1 to the TER (should be availablew in a 
> few hours). It improves the handling of UTF-8 in combined words.
> 
> Don't forget to activate UTF-8 support by adding "u" to the Regular 
> Expression Modifier in the TS constants:
> 
> contagged.modifier = Uisu <--
> 
> UTF-8 handling was deactivated by default because some old versions of 
> PHP used on shared hosting do not have the necessary libraries activated.
> 
> Cheers
> Jochen

Hi Jochen,

That was pretty fast, and the fix works like a charm. Perfect.
Thanks a lot.

Regards,
Prakash A Bhat


More information about the TYPO3-english mailing list