globalbioticinteractions / nomer

maps identifiers and names to other identifiers and names
GNU General Public License v3.0
19 stars 3 forks source link

Duplicate rows as seen by https://github.com/GlobalNamesArchitecture/dwca_hunter/issues/53 #72

Closed seltmann closed 2 years ago

seltmann commented 2 years ago

While working on indexing the Discoverlife data, duplicate rows were observed per this issue .

jhpoelen commented 2 years ago

@seltmann In the upcoming Nomer release, an attempt is made to no longer list self-referential synonym name relations.

In previous version, a self-referential synonym of Pseudapis neumayeri was generated via nomer ls discoverlife.

Now, only the accepted name relation is provided:

$ nomer ls discoverlife | grep "Pseudapis neumayeri"
[main] INFO org.globalbioticinteractions.nomer.match.TermMatcherRegistry - using matcher [discoverlife-taxon]
https://www.discoverlife.org/mp/20q?search=Pseudapis+neumayeri  Pseudapis neumayeri HAS_ACCEPTED_NAME   https://www.discoverlife.org/mp/20q?search=Pseudapis+neumayeri  Pseudapis neumayeri species     Animalia | Arthropoda | Insecta | Hymenoptera | Halictidae | Pseudapis neumayeri    https://www.discoverlife.org/mp/20q?search=Animalia | https://www.discoverlife.org/mp/20q?search=Arthropoda | https://www.discoverlife.org/mp/20q?search=Insecta | https://www.discoverlife.org/mp/20q?search=Hymenoptera | https://www.discoverlife.org/mp/20q?search=Halictidae | https://www.discoverlife.org/mp/20q?search=Pseudapis+neumayeri  kingdom | phylum | class | order | family | species https://www.discoverlife.org/mp/20q?search=Pseudapis+neumayeri  
seltmann commented 2 years ago

@jhpoelen I have reviewed and the self-referential synonym is removed and the zendo publication is updated: Seltmann, Katja, & Poelen, Jorrit. (2022). Tab and comma delimited versions of Discover Life bee species guide and world checklist (Hymenoptera: Apoidea: Anthophila) (v55.2) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.6147345

jhpoelen commented 2 years ago

@seltmann thanks for checking my work. Glad to hear that @dimus was able to index the DiscoverLife bee checklist now https://github.com/GlobalNamesArchitecture/dwca_hunter/issues/53 .

from https://resolver.globalnames.org/ searching by Apis mellifera -

Screenshot from 2022-02-22 11-38-53