broadinstitute / viral-pipelines

viral-ngs: complete pipelines
Other
51 stars 28 forks source link

add new workflow scaffold_and_refine_multitaxa #506

Closed dpark01 closed 7 months ago

dpark01 commented 8 months ago

This PR adds a new workflow called scaffold_and_refine_multitaxa which runs scaffold_and_refine on one input sample (contigs + reads) against many reference genomes from different taxa of interest. This is designed to attempt to assemble all taxa of interest for every sample, and will produce partial and empty outputs for all unsuccessful sample x taxon combinations. It is intended for high throughput metagenomic analyses.

This includes a few updates to tasks to make them more resilient to empty fasta inputs/outputs: