[TYPO3-Solr] MetaDataExtraction with Tika and solrfal

Jigal van Hemert jigal.van.hemert at typo3.org
Tue Jul 7 22:57:34 CEST 2015


Hi,

On 07/07/2015 18:03, Sebastian Schreiber wrote:
> i´m using the metadataextraction with tika (Remote via Solr).
> The scheduler task is running well and some metadata gets updated
> (author, publisher).
> But i don´t understand where the whole content extraction i.e. for a pdf
> is written?

During indexing the content is temporarily written in the tika_content 
field and right after the record is indexed the field is emptied again.
solrfal and tika use a signal-slot mechanism to trigger these actions.

-- 
Jigal van Hemert
TYPO3 CMS Active Contributor

TYPO3 .... inspiring people to share!
Get involved: typo3.org


More information about the TYPO3-project-solr mailing list