gbif / checklistbank

GBIF Checklist Bank
Apache License 2.0
31 stars 14 forks source link

Regression for Pieris rapae (Linnaeus, 1758) #242

Closed tobiasgf closed 2 years ago

tobiasgf commented 2 years ago

Is it OK that the genus key changes as the only thing?

77K out of 1.6M records (of that species) affected

{
  "count": 77823,
  "verbatim_kingdom": "Animalia",
  "verbatim_phylum": "null",
  "verbatim_class": "Insecta",
  "verbatim_order": "null",
  "verbatim_family": "Pieridae",
  "verbatim_genus": "null",
  "verbatim_species": "null",
  "verbatim_infra": "null",
  "verbatim_rank": "Species",
  "verbatim_verbatimRank": "null",
  "verbatim_scientificName": "Pieris rapae (Linnaeus, 1758)",
  "verbatim_generic": "null",
  "verbatim_author": "null",
  "current_kingdom": "Animalia",
  "current_phylum": "Arthropoda",
  "current_class": "Insecta",
  "current_order": "Lepidoptera",
  "current_family": "Pieridae",
  "current_genus": "Pieris",
  "current_subGenus": "null",
  "current_species": "Pieris rapae",
  "current_scientificName": "Pieris rapae (Linnaeus, 1758)",
  "current_acceptedScientificName": "Pieris rapae (Linnaeus, 1758)",
  "current_kingdomKey": 1,
  "current_phylumKey": 54,
  "current_classKey": 216,
  "current_orderKey": 797,
  "current_familyKey": 5481,
  "current_genusKey": 7236092,
  "current_subGenusKey": "null",
  "current_speciesKey": 1920496,
  "current_taxonKey": 1920496,
  "current_acceptedTaxonKey": 1920496,
  "proposed_kingdom": "Animalia",
  "proposed_phylum": "Arthropoda",
  "proposed_class": "Insecta",
  "proposed_order": "Lepidoptera",
  "proposed_family": "Pieridae",
  "proposed_genus": "Pieris",
  "proposed_subGenus": "null",
  "proposed_species": "Pieris rapae",
  "proposed_scientificName": "Pieris rapae (Linnaeus, 1758)",
  "proposed_acceptedScientificName": "Pieris rapae (Linnaeus, 1758)",
  "proposed_kingdomKey": 1,
  "proposed_phylumKey": 54,
  "proposed_classKey": 216,
  "proposed_orderKey": 797,
  "proposed_familyKey": 5481,
  "proposed_genusKey": 1920481,
  "proposed_subGenusKey": "null",
  "proposed_speciesKey": 1920496,
  "proposed_taxonKey": 1920496,
  "proposed_acceptedTaxonKey12727": 1920496,
  "_key": 2117,
  "changes": {
    "genusKey": true
  },
  "reviewed": false
}
mdoering commented 2 years ago

At least there are 2 Pieris genera existing:

   id    |           constituent_key            | source_taxon_key |  status  | rank  |   scientific_name    
---------+--------------------------------------+------------------+----------+-------+----------------------
 7236092 | d8fb1600-d636-4b35-aa0d-d4f292c1b424 |        137278086 | ACCEPTED | GENUS | Pieris Schrank, 1801
 1920481 | 2d59e5db-57ad-41ff-97d6-11f5fb264527 |        173217851 | DOUBTFUL | GENUS | Pieris Hübner, 1819

Both exist like that already: https://www.gbif.org/species/7236092 https://www.gbif.org/species/1920481

But I would have expected the species to be included in the accepted genus 7236092, not the doubtful one! This is exciting. The 2 genera above are both Lepidoptera, but there is also a plant genus: 2d59e5db

COL does not include any authorship for the butterfly genus, so the first dataset with an accepted genus and author defines the authorship used for the butterfly genus Pieris. This happens to be WoRMS: https://www.gbif.org/species/181182919

And WoRMS appears to accept the newer author Hübner. IRMNG, which was our previous source for this genus, lists all genera as accepted so our previous build probably picked one of them randomly: https://www.gbif.org/species/search?q=Pieris&rank=GENUS&dataset_key=0938172b-2086-439c-a1dd-c21cb0109ed5&qField=SCIENTIFIC&advanced=1

I would think it is fine to have the genus change from Schrank to Hübner, but cannot really judge it all.