[TYPO3-RTE] Cleaning pasted content

Robert Markula robert.markula at gmx.net
Mon Jan 9 16:28:34 CET 2006


Stanislas Rolland wrote:
> Hi Robert,
> 
>>
>> There are many settings for rtehtmlarea or the RTE API in general 
>> which enable the admin to control the input to a certain degree for 
>> the sake of a consistent output in the FE (like removing attributes 
>> from certain tags or removing certain tags in general). Which is very 
>> good, since a consistent output is very important.
>>
>> However, when pasting content from other sources (websites, Word, 
>> OpenOffice.org etc.), the current input control may not be sufficient, 
>> especially when the source is not well-formed from the perspective of 
>> the RTE (even more when tables are disabled with 'removeTags = table, 
>> tbody, td, th, thead, tr').
>>
> In freshly released version 1.1.0 of htmlArea RTE (although at the time 
> of writing, the documentation is not yet refreshed), there is an option 
> to configure the Page TSConfig property "enableWordClean" with a TYPO3 
> htmlparser: the pasted text is sent to the server for parsing using the 
> TYPO3 htmlparser, and the parsed text is sent back into the editing 
> area. The cleaning is done on the server: there may be a slight delay 
> depending on the connexion speed and on size of the pasted text.
> 
> This will not solve all the problems you report. However, it sets a base 
> to work from. For example, it would be possible to insert a hook on this 
> parser-invoving script in order to do some further processing. The 
> transformations you propose could be performed by such hook.
> 
> What do you think of this approach?

Hi Stanislas,
I was initially thinking about new RTE transformation options, but this 
might also be a way.

Anyway, is there a general need for such options? What do other users 
think about this?

Regards,
Ro



More information about the TYPO3-project-rte mailing list