[TYPO3-Solr] Indexing external documents

Pim Snel pim at lingewoud.nl
Thu Mar 18 22:30:12 CET 2010


Op 03-03-10 10:48, Thomas Hempel schreef:

>
> Anyway, after indexing the first few pages I asked myself if indexing of
> external documents (files like PDF etc.) is possible right now.
> I didn't find any information on that. All I see is, that the page get
> indexed by solr but not the included referenced pdf documents.

Hi Thomas,

I have created a helper extension which now also can work together with 
Solr Cell to index binary documents. I did not read this thread earlier 
else I would have uploaded this extension sooner. I finished it about 
when you wrote this.

The Solr Custom Index extension uses the base solr extension from Ingo 
to generate it's results. Its purpose is to index custom database 
queries but since version 1.1.0 is can also index binary documents. I 
hope the documentation is enough to get you started. Else don't hesitate 
to contact me.

Version 1.1.0 is attached to this mail but also uploaded in TER.

All credits still go to Ingo for his very cool Solr extension. Probably 
my extension will soon be obsolete. I still hope it can be useful for 
someone. Or my be parts of it.

If you read the manual of Solr Custom Index you wil notice there's a 
fully working example of Solr Cell distributed with the nightly builds 
of Apache Solr.

Regards,
Pim Snel






More information about the TYPO3-project-solr mailing list