[TYPO3-Solr] Code for Apache Nutch for TYPO3 CMS has been released

Olivier Dobberkau olivier.dobberkau at dkd.de
Wed Apr 16 20:49:09 CEST 2014


Am 16.04.14 17:14, schrieb Jan Slusarczyk:

Hi Jan,

> Olivier,
> this information caught me a little off-guard :-) Is there any place I
> could get more info on this project's objectives? Is this a replacement
> for typo3-solr? What are the benefits of Nutch integration in comparison
> to Solr?

Apache Nutch for TYPO3 CMS is not a replacement, but an additional way 
to add documemts to you TYPO3 CMS Solr based search.

> One suggestion I have is something that could solve my problem - namely
> integrating search results of typo3 pages and other indexes. In my case
> it's a large forum that is separate from typo3. Having the ability to
> search for content from both the forum and typo3 pages in one interface
> would be very useful. I can imagine that the same solution could cover
> other integrations like ecommerce, wiki etc. Can Nutch use xml sitemaps
> for indexing contents? Maybe a way to define indexes using a set of
> sitemap urls, including typo3?

If you need to crawl HTML pages coming from other systems then this is 
the option. Please have a look at the nutch documentation on how to add 
documents to crawl to the nutch urls list.

Best greetings,

Olivier

PS: We have added some more info in the readme on github.


More information about the TYPO3-project-solr mailing list