[TYPO3-dev] indexed_search kills server

Ernesto Baschny [cron IT] ernst at cron-it.de
Wed May 21 11:58:55 CEST 2008


Oliver Rowlands wrote: on 21.05.2008 11:45:

> The 21,263,272 Rows_examined is where your problems are coming from.
> 
> The bottom line is 'indexed_search' does not scale. It's great for a 100 
> page site but beyond that you will be better off using an external crawler.
> 
> You might want to give Dimitry's 'mnogosearch' extension a try, it is 
> far more scalable and efficient than 'indexed_search' will ever be.
> 
> If you are stuck with 'indexed_search' you could try enabling 
> '$join_pages' in the tx_indexedsearch class if you are not bothered by 
> 'extendToSubpages':
> 
> http://typo3.org/documentation/document-library/extension-manuals/doc_indexed_search/current/view/1/6/#id3520912 

btw this can be configured from TypoScript since 4.2:

search.skipExtendToSubpagesChecking
  If set to false (default), on each search the complete page tree will
  be transversed to check which pages are accessible, so that the
  extendToSubpages can be considered. This will work with a limited
  number of page-ids (which means most sites), but will result in slow
  performance on huge page trees.

  If set to true, then the final result rows are joined with the pages
  table to select pages that are currently accessible. This will speed up
  searching in very huge page trees, but on the other hand
  extendToSubpages will NOT be taken into account!

Default: 0

That won't help on that particular query, as it is a single query for 
word occurrences in a huge data-set (searching with "like"). I guess 
either the content amount is really huge, in which case you are probably 
right about indexed_search scalability (will be better in 4.3 with 
FULLTEXT support), but I guess that the problem has another cause: The 
same content being cached multiple times and thus being indexed multiple 
times: e.g. same page, multiple cHash'es because some plugin generating 
"endless" links to the same page (e.g. might happen with the "cal"). Try 
to find out which page has multiple indexed version (use Web>Info module 
for that) and figure out what might cause it.

Cheers,
Ernesto





More information about the TYPO3-dev mailing list