[TYPO3] crawler_cli.phpsh not working?

Ries van Twisk typo3 at rvt.dds.nl
Tue Jun 13 13:33:52 CEST 2006


Hey,

I had problems to get it running at all.
First of all on this server for some reason I could't 'execute' the 
crawler in cli mode at all,
so I wrote a little helper script. SOmething in the line as
php -q path/to/the/crawler.cli.php

Second of all in cli mode the crawler could not figure out it's own path 
name
so I had to hard code it. The config file wasn't found.

After that was solved I had to go in the backend and prepare 'something' 
for crawler
mode. I wasn't really clear about that but following the crawler manual 
did the trick.

It some point when I started the the crawler it took about 2-3 minutes 
before my site was indexed.
When I started it right after wards it was fast and returned directly. I 
think it was not scheduled yet.
However I couldn't figure out clearly when it was scheduled for the next 
time. I need to test this I think.
The next schedule time stayed empty and I didn't understand why.
These where my first baby crawler steps........ I need to play around 
with it more.

I also need to badly tune indexed-search. For example if I look for the 
word 'downloads' on my site now
It will give me all pages back with the word download on the page. means 
all pages since he
get's it from the menu!!! When I go to typo3.org and do the same I 
basicly get the same results,
means a lot of the same pages so I am not sure if I can solve it... Or 
typo3 is misconfiguration... I don't know...
I need to play much more with that search engine I think, or I will 
install mnogosearch for this client.


So far.......

Ries



> Still no answer...
> Doesn't anybody know the answer or does nobody understand
> the problem?
>
> When running the the crawler_cli.phpsh no error messages are
> shown and nothing happens (crawler queue doesn't get processed).
>
> When adding some debug echo statements to the script I see
> that it doesn't seem to get past the 'init.php' line:
>
> --- crawler_cli.phpsh ---
> [...]
> echo "Debug 1\n";
> // Include init file:
> require(dirname(PATH_thisScript).'/'.$BACK_PATH.'init.php');
> echo "Debug 2\n";
> [...]
>
> This will result in "Debug 1" getting printed in the console
> but "Debug 2" doesn't. What could be wrong?
>
> Thanks
>
>   
>> I can't get the crawler to run from the command line.
>> Everything else seems to work fine. Entries are added to the
>> crawlers queue and when I press the "Run now" button in the
>> backend they get processed. However when I run the crawler
>> from the command line nothing happens. The "Last seen" time
>> does not get updated.
>>
>> I'm calling the script with the full path as shown in the
>> backend:
>> '/data/www/intraweb/typo3conf/ext/crawler/cli/crawler_cli.phpsh'.
>> No error messages are shown. I did create a backend user
>> named '_cli_crawler'. Does that user need any special
>> options set? What about the password for that user?
>>
>> I tried setting the PATH_thisScript variable manually to the
>> correct value, but that didn't help either.
>>
>>
>>     
> define('PATH_thisScript','/data/www/intraweb/typo3conf/ext/crawler/cli/crawler_cli.phpsh');
>   
>> Any advice?
>>
>> Thanks.
>>     
>
>
> jwittema at yahoo.com
>
> __________________________________________________
> Do You Yahoo!?
> Tired of spam?  Yahoo! Mail has the best spam protection around 
> http://mail.yahoo.com 
> _______________________________________________
> TYPO3-english mailing list
> TYPO3-english at lists.netfielders.de
> http://lists.netfielders.de/cgi-bin/mailman/listinfo/typo3-english
>   


-- 
Ries van Twisk
Freelance Typo3 Developer
=== Private:
email: ries at vantwisk.nl
web:   http://www.rvantwisk.nl/freelance-typo3.html
skype: callto://r.vantwisk
=== Work:
email: ries at livetravelguides.com
web:   http://www.livetravelguides.com




More information about the TYPO3-english mailing list