okfn / ckanext-lacounts

CKAN extension for the LA Counts project
GNU Affero General Public License v3.0
8 stars 5 forks source link

Search query with a partial word does not return valid results #203

Open joshuazrobins opened 5 years ago

joshuazrobins commented 5 years ago

For example, for a user searching "vet" (no quotes) we WILL NOT return datasets with the term "veteran". An oddity is that when searching "veteran" (no quotes), we WILL return datasets with the term "veterans".

From a user's perspective the word veteran is often shortened to vet, so this is an instance where "vet" SHOULD return datasets with terms like "veteran" or "veterans".

amercader commented 5 years ago

@joshuazrobins The search engine (Solr) uses stemming to match grammatically related words (like plurals, in the veterans case). For social conventions like vet -> veteran we would need to provide a custom list of synonyms to be taken into account when matching.

I'll check if the Solr backend provider allows this in our case.

joshuazrobins commented 5 years ago

Ok, great. I'll wait to hear back from you if this is do-able. And if that's the case, I'll then work with Carla and Emily to gather a list of common social conventions. Thanks.