[Typo3] Reindex entire site

Nate Russell nate at chillmost.com
Tue Jul 5 19:27:27 CEST 2005


Arnaud goudsmit wrote:
> Hi list
> 
> Ik have a question need answering and it's about Indexed search.
> 
> Hope someone can help me out.
> 
> As far i understand indexed search indexes a page when a frontend page gets visited.
> So user interaction with the site.
> 
> What i want is to (re)index the <<entire>> site at a given time, say as a cronjob, or when the administrator of the site wants the site to be (re)indexed. I am talling about all the pages not just a couple.
> 
> I'm not sure how to trigger the (re)indexing function.
> 
> At the moment I'm looking at the class.indexer.php file, but where to start?
> 
> Is this the correct file to start with? 
> Is there a function call i can make or is there a solution far more easier.
> 
> Awaiting for your response,
> 
> Regards,
> 
> Arnaud goudsmit 
> 
> -----------------------
> The mailing list archive is found here:
> http://typo3.org/documentation/mailing-lists/english-main-list-archive/
> 				


Hello Arnaud,

I use wget to index my site. Its a shell program that you can use to 
download and or mirror whole sites. You can tell it what 
file/directories to include or exclude. Here is a good page with lots of 
info: http://www.xyzzy.claranet.de/Wget.htm

It's probably already installed on your server, so can use screen and 
wget (if you have ssh access) and let it run on your server without 
using up your bandwidth. just move to a temp directory tmp>wget> and 
type: wget -nc -r http://domain.com/

and it will download a copy of the site into the dir you are in, and in 
the process index your pages.

It's probably possible to set it up in a cronjob.

Hope this helps,

Nate



More information about the TYPO3-english mailing list