broadinstitute / gatk-sv

A structural variation pipeline for short-read sequencing
BSD 3-Clause "New" or "Revised" License
160 stars 71 forks source link

Refactor workflow inputs/output for consistency #668

Open VJalili opened 2 months ago

VJalili commented 2 months ago

Workflows may have different input variable names for the same inputs, or the output variable name of one workflow may not match the input variable name of its downstream workflow where the output variable is piped to the input variable. For instance, both GatherSampleEvidence and EvidenceQC take a reference .fasta as input, but the variable names are different: GatherSampleEvidence.reference_fasta vs. EvidenceQC.genome_file.

While such inconsistencies do not have any functional impact, they can be confusing when preparing inputs to workflows (e.g., it is not clear from the variable name that reference_fasta and genome_file both expect the same input unless consulting with the configured Terra workspace or its corresponding templates), or setting up a new Terra workspace.

This PR makes the following refactoring: