AtlasOfLivingAustralia / data-management

Data management issue tracking
7 stars 0 forks source link

Merge Data Load : APC platypus & rakali records for ALA June-Aug 2023 #975

Closed cha801p closed 3 weeks ago

cha801p commented 9 months ago

Metadata

Data Prep

Data Load

cha801p commented 9 months ago

Ticket Update: September 12, 2023 (2 PM)

Issue: Refresh of APC data for June to August

Resolution: The new dataset has been successfully loaded into biocache. (Verified by loading on databox)

Actions Taken: The data has been loaded into databox and prod. I have verified the UUID count and will ensure it reflects on prod tomorrow after indexing.

Log Records: newUuids: 203.0, preservedUuids: 6835.0, orphanedUniqueKeys: 108.0 INFO [2023-09-12 02:38:25,037+0000] [main] au.org.ala.pipelines.beam.ALAUUIDMintingPipeline: Percentage UUID change: 2, allowed percentage: 50, override percentage check: false

Useful Links: Databox: https://collections-test.ala.org.au/public/show/dr8128 Prod: https://collections.ala.org.au/public/show/dr8128

Challenges Encountered: Modified "catalogueNumber" to "catalogNumber". Fixed date format; This problem may have arisen due to the xlsx file format.

cha801p commented 9 months ago

Prod count: 7,038 records Link: https://collections.ala.org.au/public/show/dr8128

Data refreshed done successfully.