AtlasOfLivingAustralia / data-management

Data management issue tracking
7 stars 0 forks source link

Remove Weedscan test data #980

Closed peggynewman closed 3 months ago

peggynewman commented 1 year ago

Soft delete the Weedscan data asap Disable the Weedscan schedule

Data resource: https://collections.ala.org.au/public/show/dr21567

Further discussions are needed on filtering and excluding private data

cha801p commented 1 year ago

Ticket Update: September 25, 2023 (5 PM)

Issue Resolved: 1. Soft delete the Weedscan data

  1. Disable the Weedscan schedule

Solution: 1. We have successfully performed a soft delete operation on the Avro files for the following dataset: dr21567

  1. Regarding disabling the Weedscan schedule - https://github.com/AtlasOfLivingAustralia/preingestion/issues/195 The ticket above implies that the job wasn't scheduled due to the absence of data updates from the data provider. Therefore, there is currently no requirement to deactivate the Weedscan schedule.

Actions Taken: Executed the "Delete_dataset_dag" with the following parameters: { "datasetIds": "dr21567", "remove_records_in_solr": "true", "remove_records_in_es": "false", "delete_avro_files": "true", "retain_dwca": "true", "retain_uuid": "true" }

Validation: I have confirmed the following:

peggynewman commented 1 year ago

Thanks Raj, we'll need to go through the steps to remove it from GBIF as well, as per this issue

Done.