AtlasOfLivingAustralia / data-management

Data management issue tracking
7 stars 0 forks source link

Data load : Australian Museum #652

Open peggynewman opened 3 years ago

peggynewman commented 3 years ago

AM has migrated onto EmU version 6.3. Previously a job has run at their end which was written by us, which created extracts and SFTPd them to the upload server.

A new job needs to be written. Investigate the current Jenkins job and files (eg look in the raw files on the upload server) and we will work with AM to build a new extract and deliver/ETL.

sadeghim commented 6 months ago

Server inaccessible, AM working on it.

sadeghim commented 5 months ago

IMu driver is implemented. Reading other related tables to make up the final DwCA

rosemaryjoconnor commented 4 months ago

Update 29/02/2024

Databox load:

Waiting on:

rosemaryjoconnor commented 4 months ago

Mtg: 01/03/2024

Outstanding for AM

Outstanding for ALA

rosemaryjoconnor commented 3 months ago

27/03/2024

New IP address whitelisted by AM To do:

rosemaryjoconnor commented 3 months ago

28/03/2024 - Meeting

ALA

AM

peggynewman commented 2 months ago

In AM in test, there are 5k-ish records that don't have a AM as an institution: In this sample record it's clear that there is no provider map for the collectionCode value Malacology, Evolutionary Biology Unit I think this isn't actually a collection, but Malacology is, so we probably need to either update the collectionCode in the data, or the collection name on collectory (collections list here), and maybe the provider map if there is a new collection.

rosemaryjoconnor commented 2 months ago

12/04/2024

Status Update

rosemaryjoconnor commented 2 months ago

17/04/2024

Images Image URLs can be valid when no image with the width specified exists. This is due to the URL being for PHP code which takes the record key and retrieves the image if it exists. This will be an ongoing problem as an empty 'text' image file is created in image service. Need to be able to delete these.

Databox

To Do

sadeghim commented 1 week ago

18/06/2024 Ingested all the records (without images) on databox and notified the data provider.