gnames / gnparser

GNparser normalises scientific names and extracts their semantic elements.
MIT License
40 stars 5 forks source link

Handle epithets containing `non` #214

Closed tobymarsden closed 2 years ago

tobymarsden commented 2 years ago

This tiny PR removes the special cases of names with epithets containing non and updates the stopword regex to match only non that's not followed by [a-zA-Z-].

It doesn't solve the Xiphipops fisheri (non Snyder, 1904) issue (i.e. that should perhaps be parsed), though, but I can open a new issue just for that if you think it's worth handling.

dimus commented 2 years ago

Thanks @tobymarsden, merging, and yes Xiphipops fisheri (non Snyder, 1904) and similar name-strings are rather common and it will be helpful to deal with them at some point. It is a bigger, more complex issue that does require a bit of investitation what exists in the wild

dimus commented 2 years ago

On a second thought constructs like Xiphipops fisheri (non Snyder, 1904) can easily be annotations, reference to a non-nomenclatural name usage, a ref to a chresonym). I will talk to our taxonomists more about it