[TYPO3] crawler questions

Lars Haendler typo3 at larshaendler.com
Wed Sep 3 15:21:02 CEST 2008


Hi Steffen,

Steffen Kamper wrote:
> Hi,
> 
> i try to configure the crawler proper on a server and have some problems.
> 
> I read documentations of indexed_search and crawler, looked to the 
> podcast of Kasper and used the wiki page for configuration examples.
> 
> In my local installation all works like in Kasper's video, but on the 
> life server it doesn't.
> Looking to the crawler log all seems to work fine, but it seems that the 
> urls are not crawled proper as search results are not show all existing 
> results. Does anyone has good expieriences with crawler?

I had to fight with the re-indexing while developing. All jobs ran fine, 
but I forgot that the re-indexing was set for the next hour, so the 
crawler did its all 5 minute job but ignored new records. If you reindex 
all 24hours you could wait a long time. Just make sure reindexing is 
left blank each time you test.

Database Fields which are indexed were also something I forgot at first. 
In the FAQ extension they are q and a - not title and text.

Last is how many records are indexed. May it is just not finished ?

> So some questions for the first:
> 
> 1) Where should the record for crawling pages should be placed? Kasper 
> put it in a storage folder, i also tried on root page

I thought they had to be stored on the pages where the output records 
are rendered.

> 2) What's about the maximum level "3" configured in record, as my page 
> tree has more levels?

You are so right :-(
Nice to have more levels in a furture indexed_search release.


> 3) record has time setting for queuing the record, but it crawls the 
> pages in each run, does it have no influence?

As far as I see in my installations it only reindexes if something new 
is on the page or normal re-caching is needed. As for the load, I dont 
know because there aren't that many pages on my installations.

> 4) i tried to configure crawler to use mm_forum as data table. now the 
> table with the posts is kind of mm-table so i can't build the Getvars 
> without including the thread table, is there any workaround for such 
> situation?

sorry ... no advice here :-(

> vg Steffen

Regards
Lars


More information about the TYPO3-english mailing list