[TYPO3-Solr] Exclude Columns from indexing

Stephan Schuler Stephan.Schuler at netlogix.de
Tue Jul 16 14:33:04 CEST 2013


Hey there.


I'm not completely aware of all of its implications in reference to tx_solrs inner mechanics, but you could try to use Solrs Result Grouping and Field Collapsing feature.


Add those query parameters to the search query:
&group=true&group.field=id&group.limit=1&group.format=simple&group.main=true

This should do something like "SELECT * FROM solr GROUP BY id", and the "id" property contains both, the records database table name and UID.

Unfortunately the grouping feature only exists in Solr 3.3 and higher. Currently the tx_solrs extension is based on Solr 2.5 where no grouping is available. Feel free to give Solr 4 a try and see if it works.


Having duplicates stored to the Solr core should not be a huge problem since Solr stores data very efficiently. Multiple rows that differ only very little should not use much more disk space then a single of those rows.


Regards,


Stephan Schuler
Web-Entwickler

Telefon: +49 (911) 539909 - 0
E-Mail: Stephan.Schuler at netlogix.de
Website: media.netlogix.de



--
netlogix GmbH & Co. KG
IT-Services | IT-Training | Media
Neuwieder Straße 10 | 90411 Nürnberg
Telefon: +49 (911) 539909 - 0 | Fax: +49 (911) 539909 - 99
E-Mail: info at netlogix.de | Internet: http://www.netlogix.de

netlogix GmbH & Co. KG ist eingetragen am Amtsgericht Nürnberg (HRA 13338)
Persönlich haftende Gesellschafterin: netlogix Verwaltungs GmbH (HRB 20634)
Umsatzsteuer-Identifikationsnummer: DE 233472254
Geschäftsführer: Stefan Buchta, Matthias Schmidt



-----Ursprüngliche Nachricht-----
Von: typo3-project-solr-bounces at lists.typo3.org [mailto:typo3-project-solr-bounces at lists.typo3.org] Im Auftrag von Olivier Dobberkau
Gesendet: Dienstag, 16. Juli 2013 13:47
An: typo3-project-solr at lists.typo3.org
Betreff: Re: [TYPO3-Solr] Exclude Columns from indexing

Am 16.07.13 11:56, schrieb Alexander Wende:

> I'm looking for a solution to remove these duplicates...

there is currently noone as this is designed behaviour to prevent to expose content to people not entitled to see it.

maybe with apache solr server 4.x there might be a way to remove those duplicates.

until then you will need to live with this cosmetic issue.

Olivier
_______________________________________________
TYPO3-project-solr mailing list
TYPO3-project-solr at lists.typo3.org
http://lists.typo3.org/cgi-bin/mailman/listinfo/typo3-project-solr


More information about the TYPO3-project-solr mailing list