ACED-IDP / aced_etl_pod

etl worker pod
MIT License
1 stars 1 forks source link

etl

etl worker job/pod

use case

As an ACED devops engineer, in order to maintain the ACED datasets,I need to be able to run the ETL process on a regular basis, leveraging the environment provided to the etl k8s pod.

Implementation: Docker k8s job/pod image

image

See etl-job/README

As an ACED analyst, in order to make the data available to researchers, I need to be able to upload files and associate them with a study, patient, specimen or observation

Dependencies

image

image

Metadata generation

image

# optionally edit the metadata
# files created from the previous step
ls -1 <DIR>
DocumentReference.ndjson
ResearchStudy.ndjson

Metadata publication

Copy the metadata to the bucket and publish the metadata to the portal

image