CDPHE-bioinformatics / CDPHE-SARS-CoV-2

Workflows and scripts for the assembly and analysis of SARS-CoV-2 whole genome tiled amplicon sequencing.
https://cdphe-bioinformatics.github.io/CDPHE-SARS-CoV-2/
GNU General Public License v3.0
5 stars 0 forks source link

[REQUEST] Version Capture Files for Sample Level Workflows #36

Closed molly-hetheringtonrauth closed 2 months ago

molly-hetheringtonrauth commented 3 months ago

Feature Request

The version capture output file lists the versions and docker containers for each program used in the workflow. The way we are handling version capture files for the sample level workflow causes an issue. Currently, a version capture file for each sample is created and transferred to the bucket; however because each file is named the same, each sample's file overwrites the previous sample's file that was transferred to the bucket. This causes an issue, if the last file that is transferred is incomplete (ie. the sample failed).

Solution

Upstream effects

None

Downstream effects

danpolanco commented 3 months ago

We currently do not import the version capture files into BigQuery so we have some flexibility here. We might want to try a few files to start, but I wouldn't make that a hard requirement since they csv. Instead, since we might not ever need the version capture files in BigQuery, I'd say we cross that bridge when we get there. Open to other opinions though!

danpolanco commented 2 months ago

This may be obsolete if we change to set level workflows. See #38.