[TYPO3-Solr] Solr und Tika

johannes weber|medienlounge info at medienlounge.com
Mon Mar 29 15:14:55 CEST 2010


Hi Ingo,
that was the statement i was looking for :)

so, only one further question:

with that command:
java -jar tika.jar Feb03.pdf
i get files indexed.

i tried your tika extension with:

- local jar
Tika Path
- /var/www/solr/fileadmin/tika.jar
Host
http://localhost
Port
8080
Path
/solr/

but get config errors in the reports.

so any advice?

thanks
johannes

On Mar 29, 2010, at 2:57 PM, Ingo Renner wrote:

> Hi all,
>
> to bring some light into the situation:
>
> * the current version of EXT:solr in TER (1.0.1) does not support  
> file indexing
> * the soon to be released version of EXT:solr (1.1.0) will not  
> support file indexing either because it's not implemented yet (not  
> even in 2.0-dev), and because it will need some testing
> * version 2.0, released somewhen later this year (end of the year  
> tentatively) will support file indexing
> * if you need file indexing now, you can join the 2.0 dev program  
> with dkd to get early access and influence what we're going to work  
> on next (file indexing f.e.)
>
>
> EXT:tika as it is on forge will be used by EXT:solr 2.0 to extract  
> content from files. So EXT:solr hands over the files to EXT:tika.
> EXT:tika again, can use either a local Tika jar or a remote Solr  
> server which is handled by EXT:solr 2.0. Both scenarios, local and  
> remote extraction, have pros and cons...
>
> For local extraction you need Java on your host. This might not be  
> available, like on a shared host or you may have a distributed setup  
> where you don't want to install Java on each host.
>
> For remote extraction you need to send the files over the network  
> though, that might not be the most clever thing either... consider  
> sending many MB of audio or video files over the network only to get  
> a few bytes of information back.
>
>
> However, it should be possible to use EXT:tika with local extraction  
> for EXT:dam already. "Should" means that I implemented it the way so  
> that it "should" work, but I didn't have the time to actually test  
> it yet...
>
>
> Hope that helps :)
> Ingo
>
> -- 
> Ingo Renner
> TYPO3 Core Developer, Release Manager TYPO3 4.2
>
> _______________________________________________
> TYPO3-project-solr mailing list
> TYPO3-project-solr at lists.typo3.org
> http://lists.typo3.org/cgi-bin/mailman/listinfo/typo3-project-solr
>





More information about the TYPO3-project-solr mailing list