NHMDenmark / DanSpecify

Important files regarding the Danish instance of the Specify database system for collections digitisation and management, plus placeholder for issue tracking. Guidelines, manuals and other kinds of documentations will be gathered on the wiki.
3 stars 2 forks source link

Retirement of Botany type specimen GBIF dataset #269

Closed Sosannah closed 2 months ago

Sosannah commented 5 months ago

A huge part of old static dataset https://www.gbif.org/dataset/84d3829c-f762-11e1-a439-00145eb45e9a should be swapped out with a dynamic version originating from Specify https://www.gbif.org/dataset/0d1f9cee-7cb7-4d3a-a8c4-d2ca6edcd23b .

The dynamic version contains objects from only from phylum Tracheophyta, while the static one has several records from other phyla and a bunch of records with undetermined higher taxonomy.

We can provide relationship between the old occurrenceIDs and the new occurrenceIDs in case of the Tracheophyta ones (18.917 records), but we have no matching objects in the dynamic dataset for the remaining ~5.000 records.

The plan is to

  1. Move the non-tracheophyte occurrences to the respective datasets
  2. leave the tracheophyte occurrences in the dataset and
  3. change the endpoint for the tracheophytes afterwards.

Also a list pairing the new occurrenceIDs (UUIDs) with the old ones (URNs) should be provided to GBIF in case of Tracheophyte specimen to keep record level linkage wherever it's possible.

Sosannah commented 2 months ago

The records of this dataset of types (https://www.gbif.org/dataset/84d3829c-f762-11e1-a439-00145eb45e9a) has been distributed into the Plantae datasets from NHMD where these types belong taxonomically (i.e. Vascular plants, Algae, and Mosses). Since the majority of the records were vascular plants, we chose that for the dataset to be a duplicate of. Static dataset was retired.