The text_ws should use more of the properties off of the standardized string represented by the text_general field and text_en_split field.
The whole_string should not use the lower case filter.
This also automatically performs multilingual matching of sorts.
Searching for the English word "Reserved" for example may find German matches.
Attempts to use solr.WordDelimiterGraphFilterFactory (including with preserveOriginal="1") have solved the immediate problem but introduced a regression where the quote strings never match anymore. This has graph has been removed from this change set.
Fixes #515
Type of change
Please delete options that are not relevant.
[x] Bug fix (non-breaking change which fixes an issue)
How Has This Been Tested?
[X] Manually through the UI.
Checklist:
[x] My code follows the style guidelines of this project
[x] I have performed a self-review of my code
[x] My changes generate no new warnings
[x] New and existing unit tests pass locally with my changes
Coverage: 45.856% (+0.02%) from 45.833% when pulling 5c43c41e1996cdd491d03cea5865c22dc96f8a67 on 515-solr_schema-redesign into 92ac6e05c9f03e742ad78dba6e8c4e6c092bf2d8 on staging.
Description
The text_ws should use more of the properties off of the standardized string represented by the text_general field and text_en_split field.
The whole_string should not use the lower case filter.
This also automatically performs multilingual matching of sorts. Searching for the English word "Reserved" for example may find German matches.
Attempts to use
solr.WordDelimiterGraphFilterFactory
(including withpreserveOriginal="1"
) have solved the immediate problem but introduced a regression where the quote strings never match anymore. This has graph has been removed from this change set.Fixes #515
Type of change
Please delete options that are not relevant.
How Has This Been Tested?
Checklist: