[TYPO3-Solr] Code for Apache Nutch for TYPO3 CMS has been released
Olivier Dobberkau
olivier.dobberkau at dkd.de
Wed Apr 16 20:49:09 CEST 2014
Am 16.04.14 17:14, schrieb Jan Slusarczyk:
Hi Jan,
> Olivier,
> this information caught me a little off-guard :-) Is there any place I
> could get more info on this project's objectives? Is this a replacement
> for typo3-solr? What are the benefits of Nutch integration in comparison
> to Solr?
Apache Nutch for TYPO3 CMS is not a replacement, but an additional way
to add documemts to you TYPO3 CMS Solr based search.
> One suggestion I have is something that could solve my problem - namely
> integrating search results of typo3 pages and other indexes. In my case
> it's a large forum that is separate from typo3. Having the ability to
> search for content from both the forum and typo3 pages in one interface
> would be very useful. I can imagine that the same solution could cover
> other integrations like ecommerce, wiki etc. Can Nutch use xml sitemaps
> for indexing contents? Maybe a way to define indexes using a set of
> sitemap urls, including typo3?
If you need to crawl HTML pages coming from other systems then this is
the option. Please have a look at the nutch documentation on how to add
documents to crawl to the nutch urls list.
Best greetings,
Olivier
PS: We have added some more info in the readme on github.
More information about the TYPO3-project-solr
mailing list