bcbio / bcbio-nextgen

Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis
https://bcbio-nextgen.readthedocs.io
MIT License
986 stars 354 forks source link

GATK haplotype caller and UnifiedGenotyper for InDels only #1086

Closed apastore closed 8 years ago

apastore commented 8 years ago

I am wondering if is possible to keep only the indel call from GATK caller. I have realized that otherwise I get a large number of SNPs gremline events.

variantcaller: [gatk,gatk-haplotype,varscan,mutect,freebayes] ensemble: numpass: 2

I am also wondering why the -nWayOut parameter in not set for IndelRealigner Tumor Normal pair?

Thanks a lot ! Alessandro

chapmanb commented 8 years ago

Alessandro; We don't have an option to only call indels in bcbio. The best approach is to subset your calls after they come out of bcbio with bcftools:

bcftools view -v indels

We haven't implemented pooled realignment as part of bcbio, as our validations haven't found much value in doing this (http://bcb.io/2013/10/21/updated-comparison-of-variant-detection-methods-ensemble-freebayes-and-minimal-bam-preparation-pipelines/). In the next release we plan to default to not doing recalibration and realignment.

Practically if you're doing somatic calling, you'll likely want to stick with callers that handle lower frequency mutations. I'd suggest:

variantcaller: [vardict, freebayes, mutect]

if you want multiple callers, or only vardict if you'd like a single caller. It does local rearrangement so especially avoids the need for seperate realignment. Here is an in-progress writeup with the latest validations of callers (https://github.com/bcbio/bcbio.github.io/blob/master/_posts/2015-10-05-vardict-filtering.md). Hope this helps.

apastore commented 8 years ago

Thanks as usual for your exhaustive replay!

On Oct 29, 2015, at 8:37 PM, Brad Chapman notifications@github.com wrote:

Alessandro; We don't have an option to only call indels in bcbio. The best approach is to subset your calls after they come out of bcbio with bcftools:

bcftools view -v indels We haven't implemented pooled realignment as part of bcbio, as our validations haven't found much value in doing this (http://bcb.io/2013/10/21/updated-comparison-of-variant-detection-methods-ensemble-freebayes-and-minimal-bam-preparation-pipelines/ http://bcb.io/2013/10/21/updated-comparison-of-variant-detection-methods-ensemble-freebayes-and-minimal-bam-preparation-pipelines/). In the next release we plan to default to not doing recalibration and realignment.

Practically if you're doing somatic calling, you'll likely want to stick with callers that handle lower frequency mutations. I'd suggest:

variantcaller: [vardict, freebayes, mutect] if you want multiple callers, or only vardict if you'd like a single caller. It does local rearrangement so especially avoids the need for seperate realignment. Here is an in-progress writeup with the latest validations of callers (https://github.com/bcbio/bcbio.github.io/blob/master/_posts/2015-10-05-vardict-filtering.md https://github.com/bcbio/bcbio.github.io/blob/master/_posts/2015-10-05-vardict-filtering.md). Hope this helps.

— Reply to this email directly or view it on GitHub https://github.com/chapmanb/bcbio-nextgen/issues/1086#issuecomment-152368088.