Closed kramesh1 closed 1 year ago
Edit: Separating out the upload pipe for NCBI into its own issue since that infrastructure has not been built at all vs. the locations listed here all have existing pipelines that could just use increased streamlining
Specific todo list here of upload features
Laying out the structure here to be built in a branch:
From any location, a user initiates a bjorn_utils command to "upload". This upload command does 3 things:
/alfheim/hcov-19-genomics/
alfheim
using the following located in /code
, with the save out locations being in hcov-19-genomics
as well./gisaid_uploader CoV upload --fasta /asgard/2021-10-25_release/msa/consensus_sequences/2021-10-25_release_unaligned_combined.fa --csv /asgard/2021-10-25_release/gisaid_metadata.csv --failedout /asgard/2021-10-25_release/gisaid_failed_metadata.csv
gsutils
HCoV-19-Genomics
repo, updates the readme, and uploads them
Need to automate the upload of files to these different sources
GISAID
Will review each of these portions with Mark and Karthik G. before taking out branch against them.