Closed peggynewman closed 4 months ago
Progress update: the data is in production, but we're waiting
Still working on this - new data is in play but old DRs need to be removed.
To do for @cha801p
Metadata: Name: NatureMapr Short description: A citizen science platform to upload plant and animal sightings to contribute to real world outcomes across Australia. Long description: NatureMapr seeks to ensure every important plant and animal is known to the people charged in positions of power that can directly influence its protection, management or eradication. Anybody can report a plant or animal sighting in under a minute anywhere across Australia and: Promptly receive an expert identification of their record Be assured that the information will be received by the government organisations and research institutions that need to know about it Develop increased awareness and knowledge of important species through the sharing of knowledge within a thriving community
dr14081 - 724 Records - Albury Wodonga Nature Map dr702 - 31970 Records - Atlas of Life in the Coastal Wilderness dr14021 - 9125 Records - Budawang Coast Nature Map dr1947 - 90014 Records - Canberra Nature Map dr736 - 4575 Records - Frogwatch ACT and Region dr15273 - 1191 Records - Southern Highlands Nature Map
The DR (or "dataset") record can't be totally deleted in GBIF because it has a DOI associated with it. GBIF (helpdesk@gbif.org) says: We could link the old datasets to the new one before deleting the associated occurrences. The pages, DOI and citations will be preserved and they will link to the new dataset, see this example: https://www.gbif.org/dataset/84aa5ee4-f762-11e1-a439-00145eb45e9a If so, you will need to publish the new dataset and send us its link. We will then make the changes necessary.
Find the corresponding datasets in GBIF (click on the DOI link on the DR collectory page) and compile an email to GBIF saying that the datasets are to be deleted and replaced by the main one: https://www.gbif.org/dataset/7ebef267-9d72-4c21-a276-cc84281a8590
Datasets have been deleted from GBIF.
Patricia is writing a DAG to delete dr from prod after which additional drs will be removed from prod. https://github.com/AtlasOfLivingAustralia/data-management/issues/890
The naturemapr website reports this at the bottom:
2,091,367 sightings of 18,686 species in 5,482 locations from 9,618 contributors
so it looks like the data feeds are only serving a fraction of the data. Is this right ?
When we first brought data in they sent 350K through their API, which included a bunch of eBird and BioNet records, and likely other museums etc. They've since removed them from the API. I'm guessing that in their front end they're included a bunch of other datasets. I can't confirm this.
Datasets have been deleted from biocache. dr702 dr14021 dr1947 dr736 dr15273 dr14081
Consolidate all of the NatureMapr data resources. Create a recurring job that pulls a full refresh from the API into a Darwin Core archive.