AtlasOfLivingAustralia / data-management

Data management issue tracking
7 stars 0 forks source link

New Data Load : SREBA invertebrate data #1028

Closed cha801p closed 2 weeks ago

cha801p commented 4 months ago

https://collections-test.ala.org.au/dataResource/show/dr22377

cha801p commented 4 months ago

Ticket Update: February 16, 2024 (12:30 PM)

Issue: New dataset to load.

Solution: Successfully load the new dataset into biocache

Actions Taken: Successfully loaded the data on the test environment

Issues Encountered: The issue with the column name was identified This caused ingest_small_dataset to fail twice The column name was fixed and DwcA was created locally Loaded the data on biocache-test

Loaded data for review: https://biocache-test.ala.org.au/occurrences/search?q=data_resource_uid:dr22377

Status: Waiting for the customer review

cha801p commented 4 months ago

Issue with "Individual count" column flagged by data provider - I think the column is formatted as float, it should be plain text or int

cha801p commented 4 months ago

Ticket Update: February 28, 2024 (1 PM)

Issue: New dataset to load.

Solution: Successfully load the new dataset into biocache

Actions Taken:

Logs: INFO [2024-02-28 00:54:47,422+0000] [main] au.org.ala.pipelines.beam.ALAUUIDMintingPipeline: Running the pipeline INFO [2024-02-28 00:54:48,147+0000] [main] au.org.ala.pipelines.beam.ALAUUIDMintingPipeline: As this is an initial load for this dataset, UUID change checks are not ran. INFO [2024-02-28 00:54:48,147+0000] [main] au.org.ala.pipelines.beam.ALAUUIDMintingPipeline: Pipeline complete. INFO [2024-02-28 00:54:48,147+0000] [main] au.org.ala.pipelines.beam.ALAUUIDMintingPipeline: Checking for backups to prune..../data/pipelines-data/dr25133/1/identifiers INFO [2024-02-28 00:54:48,148+0000] [main] au.org.ala.pipelines.beam.ALAUUIDMintingPipeline: Writing metrics..... INFO [2024-02-28 00:54:48,149+0000] [main] org.gbif.pipelines.common.beam.metrics.MetricsHandler: Trying to write pipeline's metadata to a file - /data/pipelines-data/dr25133/1/uuid-metrics.yml INFO [2024-02-28 00:54:48,150+0000] [main] org.gbif.pipelines.common.beam.metrics.MetricsHandler: Added pipeline metadata - newUuidsAttempted: 1845,

Loaded data for review: https://collections.ala.org.au/public/show/dr25133

Status: Waiting for indexing

cha801p commented 4 months ago

Ticket Update: February 29, 2024 (12:30 PM)

Issue: The "individual count" column issue continues to persist after the indexing

Solution: Reload the data

Actions Taken:

Loaded data for review: https://collections.ala.org.au/public/show/dr25133

Status: Waiting for indexing

cha801p commented 4 months ago

Status: Data loaded on Prod and links sent to the data provider for review