broadinstitute / gatk-protected

Obsolete/Legacy GATK repository -- go to https://github.com/broadinstitute/gatk instead
BSD 3-Clause "New" or "Revised" License
33 stars 20 forks source link

Use bigger kmers in assembly #1114

Closed davidbenjamin closed 7 years ago

davidbenjamin commented 7 years ago

HaplotypeCaller and Mutect by default assemble reads with kmer sizes of 10 and 25. 10 seems extremely small given the low error rates of Illumina sequencing. It's worth investigating how the Mutect validations are affected by increasing these values.

ldgauthier commented 7 years ago

Investigate away, but keep in mind bigger kmers introduce more "dangling tails", which may end up dropping evidence at the ends of reads.

If you end up diving into the assembly graphs, I'm happy to consult. It's a deep, dark rabbit hole, but I've been there before and I know the way. ;)

droazen commented 7 years ago

Issue moved to broadinstitute/gatk #3024 via ZenHub