[TYPO3-Solr] The way spelling corrections in the synonyms.txt file work...
Jigal van Hemert
jigal.van.hemert at typo3.org
Wed Aug 14 11:28:18 CEST 2013
Hi,
In the synonyms.txt file you can define synonyms and spelling
corrections. The description [1] of the SynonymFilterFactory advices to
use them only at index time because of two potential issues and a
counter intuitive issue with idf differences.
This is how it's configured in EXT:solr too.
The problem which I see "in the wild" is that if visitors search for a
word that is on the left side in the spelling correction list there will
be no results at all. Only if a misspelled word exists in the indexed
documents it will be included in the results when you search for the
correctly spelled word.
Could it be a solution to enable the SynonymFilterFactory at query time
with the following restrictions? :
- a separate synonyms file with only spelling corrections
- no multiple word spelling corrections
Of course these restrictions can't be automated, but at least there is a
possibility to improve the search experience for people with bad
spelling habits.
1) is it correct that spelling corrections do not work on search terms
entered by visitors?
2) is the extra synonyms file a usable solution for custom cases?
3) is this useful to be included in EXT:solr?
[1]
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.SynonymFilterFactory
--
Jigal van Hemert
TYPO3 CMS Active Contributor
TYPO3 .... inspiring people to share!
Get involved: typo3.org
More information about the TYPO3-project-solr
mailing list