etl worker job/pod
As an ACED devops engineer, in order to maintain the ACED datasets,I need to be able to run the ETL process on a regular basis, leveraging the environment provided to the etl k8s pod.
See etl-job/README
The root home directory will have virtual environment with all dependencies loaded
The Helm chart will mount the following directories into the ETL pod:
As an ACED analyst, in order to make the data available to researchers, I need to be able to upload files and associate them with a study, patient, specimen or observation
# optionally edit the metadata
# files created from the previous step
ls -1 <DIR>
DocumentReference.ndjson
ResearchStudy.ndjson
Copy the metadata to the bucket and publish the metadata to the portal