Closed evelyn-schmidt closed 2 weeks ago
Just to document the rationale here. The optional localization (streaming the input BAMs from a cloud bucket) works almost all the time. But we have hit a case where it fails consistently on a single shard causing the whole workflow to fail. This is similar to the behavior we saw with VarScan where we were not able to understand why.
For our current use cases the inconvenience of a workflow run failure is high. We have a small number of cases but all require clinically compatible turnaround times. The increased robustness of localizing the inputs outweighs the cost savings for our application.
Attempting to formalize documentation of pipeline releases, we have chosen some sample datasets to re-run through releases of the pipeline. Currently, I am testing v1.1.4 using the pancreas trail case 5120-17
. This is the case that originally inspired this PR because of its strange failure on shard-3 of Mutect. My re-run also resulted in the same failure (since this PR was not merged).
I am rerunning case 5120-17
with these fixes to confirm that turning off localization_optional resolves this issue. If it does, then we should consider turning off this setting in other parts of the pipeline, for example, VarScan.
Confirmed that this did fix the problem. Include in next formal release.
Files where localization_optional:true
. @malachig maybe we should changed these all to false?
@evelyn-schmidt it looks like these additional places do not involve scattered tasks? So we are probably less likely to encounter problems with those. If so, I think we can leave them for now as we haven't noticed any problems to date.
Just adding a comment here that I also encountered an error on shard-3 of a completely different case. Exit code 247
.
Required file output '/cromwell_root/mutect.filtered.vcf.gz' does not exist.
And once again, applying this fix seems to have resolved the issue.
Change
localization_optional:true
to false.https://github.com/wustl-oncology/analysis-wdls/blob/c89037d4164215c3d73f8fd766f63c92933dcfc2/definitions/tools/mutect.wdl#L19-L22