[TYPO3-Solr] Indexing external documents
Pim Snel
pim at lingewoud.nl
Thu Mar 18 22:30:12 CET 2010
Op 03-03-10 10:48, Thomas Hempel schreef:
>
> Anyway, after indexing the first few pages I asked myself if indexing of
> external documents (files like PDF etc.) is possible right now.
> I didn't find any information on that. All I see is, that the page get
> indexed by solr but not the included referenced pdf documents.
Hi Thomas,
I have created a helper extension which now also can work together with
Solr Cell to index binary documents. I did not read this thread earlier
else I would have uploaded this extension sooner. I finished it about
when you wrote this.
The Solr Custom Index extension uses the base solr extension from Ingo
to generate it's results. Its purpose is to index custom database
queries but since version 1.1.0 is can also index binary documents. I
hope the documentation is enough to get you started. Else don't hesitate
to contact me.
Version 1.1.0 is attached to this mail but also uploaded in TER.
All credits still go to Ingo for his very cool Solr extension. Probably
my extension will soon be obsolete. I still hope it can be useful for
someone. Or my be parts of it.
If you read the manual of Solr Custom Index you wil notice there's a
fully working example of Solr Cell distributed with the nightly builds
of Apache Solr.
Regards,
Pim Snel
More information about the TYPO3-project-solr
mailing list