GlobalNamesArchitecture / gnparser

Split scientific names to meaningful elements with meta information
https://parser.globalnames.org/
MIT License
20 stars 2 forks source link

Parse "apud" ("with") as a possible delimiter between authors #436

Closed dimus closed 6 years ago

dimus commented 6 years ago

Pseudocercospora dendrobii Goh apud W.H. Hsieh 1990 means Goh published name with Hsieh

Other examples:

Pseudocercospora dendrobii Goh & W.H. Hsieh 1990
Pseudocercospora dendrobii Goh and W.H. Hsieh 1990
Pseudocercospora dendrobii Goh apud W.H. Hsieh 1990
Lysimachia montana (Reinw.) Bakh. fil. apud Bentvelzen
alexander-myltsev commented 6 years ago

@dimus do you think it could be embedded in authorEx rule?

alexander-myltsev commented 6 years ago
ppr.parse("Aus bus Allaaa apud Gaad") => {:scientificName=>{:id=>"84507b62-4c85-5e10-8840-a2cc3fef131a", :parsed=>true, :parser_version=>"3.4.6", :verbatim=>"Aus bus Allaaa apud Gaad", :normalized=>"Aus bus Allaaa apud Gaad", :canonical=>"Aus bus", :hybrid=>false, :details=>[{:genus=>{:string=>"Aus"}, :species=>{:string=>"bus", :authorship=>"Allaaa apud Gaad", :basionymAuthorTeam=>{:authorTeam=>"Allaaa apud Gaad", :author=>["Allaaa", "Gaad"]}}}], :parser_run=>1, :positions=>{0=>:positions=>{0=>["genus", 3], 4=>["species", 7], 8=>["author_word", 14], 20=>["author_word", 24]}}}

Add apud near to & and and, but normalize it to apud.