grbot / varcall

Nextflow scripts for doing variant calling on human samples
MIT License
8 stars 13 forks source link

Intro

The repos contains individual wrokflows to process cohorts of human genome samples through alignment, calling, joint calling and variant quality score recalibration. Some additional workflows for pre and post BAM/gVCF manipulation/checking are also included.

To run

Each folder contains a workflow which is part of the main workflow or a separate workflow to check/prepare the data for a specific part of the main workflow.

Main workflow

Note: Can use combine-gvcfs or genomics-db-import for combining the gVCFs. genomics-db-import is however the latest method and allows for updating of an existing database / combined set.

Separate workflows

Note

Example

The GiaB dataset can be downloaded and used for testing

wget -O NA12878_R1.fastq.gz ftp://ftp-trace.ncbi.nih.gov/giab/ftp/data/NA12878/Garvan_NA12878_HG001_HiSeq_Exome/NIST7035_TAAGGCGA_L001_R1_001.fastq.gz
wget -O NA12878_R2.fastq.gz ftp://ftp-trace.ncbi.nih.gov/giab/ftp/data/NA12878/Garvan_NA12878_HG001_HiSeq_Exome/NIST7035_TAAGGCGA_L001_R2_001.fastq.gz