[TYPO3-english] Crawler and external documents

Dmitry Dulepov dmitry at typo3.org
Wed Jan 28 09:49:20 CET 2009


Hi!

Claudio Strizzolo wrote:
> I'm trying to set up the crawler extension in order to index all the pages
> in the site and the external documents (/fileadmin/...) linked by anchors 
> in the pages.
> I read some documentation, included http://wiki.typo3.org/index.php/
> Ext_crawler and almost everything works: the pages are correctly indexed, 
> and the external documents are recognized. In the Crawler Log they are 
> listed in separate rows under the page which points to them.
> However, their status is ".." and their contents are not indexed. If I 
> click on the "Read" icon (it looks more like a reload icon, imho) the 
> content is correctly indexed and the status becomes "OK", but I could not 
> find a way to get this automatically through the crawler.

We lost lots of time while trying to index external documents with crawler/indexed search. In the end we just switched to mnogosearch, which does it much better. With crawler/indexed search you never know if something will be indexed or not...

-- 
Dmitry Dulepov
TYPO3 core team
"Sometimes they go bad. No one knows why" (Cameron, TSCC, "Dungeons&Dragons")


More information about the TYPO3-english mailing list