ksumngs / yavsap

Yet Another Viral Subspecies Analysis Pipeline
https://ksumngs.github.io/yavsap
MIT License
2 stars 1 forks source link

[Feature]: Add workflow for downloading reference genomes #47

Open MillironX opened 2 years ago

MillironX commented 2 years ago

Summary

There should be a way to pre-download all of the present genomes from NCBI so that YAVSAP can be run offline.

Added Features

Additional parameters

Propose any new parameters that would be needed to implement this feature, including its name, type, and default value, and explain how it would be consumed by the new/existing processes.

Additional output

  1. Reference genome fasta/GenBank file corresponding to --genome
  2. Strain reference genome fasta/GenBank file containing all of the sequences from --genome_list
  3. Strain reference genome TSV file that points to 2 in a format YAVSAP can recognize (i.e., from #44, /data/yavsap/results/genomes.fasta:NC_01234.1).

These files should probably be output even if downloading the files directly in the pipeline (the way it's done now).