GlobalNamesArchitecture / gnparser

Split scientific names to meaningful elements with meta information
https://parser.globalnames.org/
MIT License
20 stars 2 forks source link

Fix #402: don’t parse name with ‘of’ word inside #417

Closed alexander-myltsev closed 7 years ago

dimus commented 7 years ago

I was thinking about this one. In our database it is OK not to parse these names at all, as the noise significantly higher than the signal. However when we use parser for name finding it will remove a lot of good names. Lets consider a compromise and just cut everything like ' of ....' in preprocessing.

Interesting case in this case is Neritina chemmoi Reeve var of cornea Linn, but looks like parser does work with it correctly already