bionomia / dwc_agent

Ruby gem to cleanse Darwin Core terms containing people names prior to passing to its dependent parser. Comes with a command-line utility.
MIT License
5 stars 1 forks source link

Do better handling of names like Frère León #9

Closed dshorthouse closed 4 years ago

dshorthouse commented 4 years ago

The parser strips out Frère before it has a chance to be recognized as a title. This should be retained where as John Smith, frère would need to be stripped out cc @kcopas.

kcopas commented 4 years ago

That happened while I was attributing some specimens—it was weird. He'd hung around as Frère León for a reasonable amount of time before that.

dshorthouse commented 4 years ago

There was a time when I did not parse the canonical name from wikidata prior to entry into Bloodhound, but reverted that decision some time back, likely to accommodate things like particles. The decision broke handling of titles like Frère. Working on it now. Good news is once repaired, I won't have to reharvest all of GBIF as it's (mostly) a display issue.