TAMULib / SAGE

Search Aggregation Engine
MIT License
6 stars 3 forks source link

Issue 515: Improve text_ws and whole_string query searching. #527

Closed kaladay closed 1 year ago

kaladay commented 1 year ago

Description

The text_ws should use more of the properties off of the standardized string represented by the text_general field and text_en_split field.

The whole_string should not use the lower case filter.

This also automatically performs multilingual matching of sorts. Searching for the English word "Reserved" for example may find German matches.

Attempts to use solr.WordDelimiterGraphFilterFactory (including with preserveOriginal="1") have solved the immediate problem but introduced a regression where the quote strings never match anymore. This has graph has been removed from this change set.

Fixes #515

Type of change

Please delete options that are not relevant.

How Has This Been Tested?

Checklist:

coveralls commented 1 year ago

Coverage Status

Coverage: 45.856% (+0.02%) from 45.833% when pulling 5c43c41e1996cdd491d03cea5865c22dc96f8a67 on 515-solr_schema-redesign into 92ac6e05c9f03e742ad78dba6e8c4e6c092bf2d8 on staging.