mskcc / tempo

CCS research pipeline to process WES and WGS TN pairs
12 stars 5 forks source link

add custom read support filters in Delly and Manta Somatic calling #968

Closed anoronh4 closed 2 years ago

anoronh4 commented 2 years ago

Based on an analysis of precision and recall compared to other datasets, we are incorporating a few filters into Delly and Manta somatic structural variant outputs:

In this PR the variants will not be removed from the file entirely, instead the FILTER value will be updated with normal_read_supp and tumor_read_supp, respectively. Based on a prior analysis where we compared to pcawg calls, the estimate accuracy of the pipeline including the aforementioned filters:

Also the containers for strelka and manta were updated to cmopipeline/strelka2-manta-bcftools-vt:2.0.0. The previous container cmopipeline/strelka2_manta:latest is much older and does not contain bcftools or vt, even though it seems to have the same version of manta and both have python2. The container folder for cmopipeline/strelka2_manta:latest was removed as it is no longer necessary.

anoronh4 commented 2 years ago

Refining Filters I had a few findings when comparing our calls to pcawg using the filters we have been trying out: