[TYPO3-Solr] Metadata from file or FAL DB?

Dennis Luemkemann dennis.luemkemann at gmx.de
Fri Jul 3 17:04:49 CEST 2015


Thank you Jigal,

I can confirm this behavior. This is good. And thanks for the pointer to EXT:extractor.

Best regards
Dennis

Am 02.07.2015 um 21:38 schrieb Jigal van Hemert <jigal.van.hemert at typo3.org>:

> Hi,
> 
> On 02/07/2015 15:19, Olivier Dobberkau wrote:
>> Am 01.07.15 um 13:52 schrieb Dennis Luemkemann:
>> 
>>> Dear all,
>>> 
>>> I’m trying to better understand how typo3solr / solrfal works with
>>> regard to metadata indexing.
>>> 
>>> Let’s assume I have a PDF file, which has no metadata defined. Then an
>>> editor adds the file to FAL and writes some metadata information in
>>> the FAL backend. For Typo3, the metadata is now available, even though
>>> it’s not contained within the file itself. So far so good.
>>> 
>>> Now comes solrfal with tika to add the file to solr. Where does it
>>> look for metadata, in the file itself or in the FAL record for the file?
>>> If it looks in both, which has precedence over the other and gets sent
>>> to solr?
>>> 
>>> Thanks
>>> Dennis
>> 
>> Hi Dennis.
>> 
>> I think you touched a valid point. I am unsure if solrfal should take
>> care of this. I would expect FAL to offer such a "do not" overwrite file
>> meta data here.
> 
> As far as I have understood (and noticed after indexing) solrfal just takes what FAL supplies. You can have a "best of both worlds" situation with the extension 'extractor' [1]. This allows FAL to use Tika as a metadata extraction service.
> Now Tika can automatically extract metadata from files and let solrfal use this for indexing.
> 
> [1] http://typo3.org/extensions/repository/view/extractor
> 
> 
> -- 
> Jigal van Hemert
> TYPO3 CMS Active Contributor
> 
> TYPO3 .... inspiring people to share!
> Get involved: typo3.org
> _______________________________________________
> TYPO3-project-solr mailing list
> TYPO3-project-solr at lists.typo3.org
> http://lists.typo3.org/cgi-bin/mailman/listinfo/typo3-project-solr



More information about the TYPO3-project-solr mailing list