AtlasOfLivingAustralia / ala-name-matching

Atlas name matching API and index generation
Other
10 stars 13 forks source link

Language sensitive matching #209

Open javier-molina opened 1 year ago

javier-molina commented 1 year ago

As a user I would like the match to be more clever and match some variations of words such as latin singular vs plural or nouns, adjectives and adverbs.

Background See question from a user

Legitimate species name is Endoxyla cinereus. Searching for Endoxyla cinerea instead does not retrieve the species, so it seems like the fuzzy matching is perhaps not as flexible as it could be regarding different gender endings/orthographic variants (whether these are legitimate different endings or just an error on the user end, like in this case)

The search in Google for "Endoxyla cinerea" gets back results for "Endoxyla cinereus".

This will require some investigation and discussion as it might not be trivial to implement.