nf-core / modules

Repository to host tool-specific module files for the Nextflow DSL2 community!
https://nf-co.re/modules
MIT License
280 stars 714 forks source link

new subworkflow: gatk_joint_germline_variant_calling #1126

Open GCJMackenzie opened 2 years ago

GCJMackenzie commented 2 years ago

This subworkflow will cover the gatk4 best practices workflow: https://gatk.broadinstitute.org/hc/en-us/articles/360035535932-Germline-short-variant-discovery-SNPs-Indels- starting with the analysis ready reads of multiple individual samples*, haplotypecaller is then run in gvcf mode for each sample, the output gvcfs are then combined into a single genomicsdb workspace, and then run through genotypeGVCFs to perform joint germline genotyping and produce a joint vcf file. VQSR is then run to recalibrate the results file using variantrecalibrator and applyvqsr**.

* setting run_haplotc to false will skip haplotypecaller and use individual gvcf files as input instead, which go directly to genomicsdbimport. ** setting run_vqsr to false will skip VQSR recalibration, meaning the run ends with the output of genotypeGVCFs

GCJMackenzie commented 2 years ago

To Do list: