Snakemake workflow for variant calling from raw sample sequences, with lots of bells and whistles.
Advantages:
See --> the Wiki pages <-- for setup and documentation.
For questions, bug reports, and feature requests, please open an issue. Please do not send emails with questions or requests, as others might be having them as well, and so it is better to discuss them where they can be found.
Minimal input:
fasta
filefastq
filesvcf
file of known variants to restrict the variant calling processProcess and available tools:
Typical output:
vcf
, raw and filtered, and potentially with annotationsvcf
)Intermediate output files such as bam
files are also kept by default,
and mpileup
files can optionally be created if needed.
In addition to the above tools, there are some tools used as glue between the steps.
If you are interested in the details, have a look at the snakemake rules for each step.
When using grenepipe, please cite:
grenepipe: A flexible, scalable, and reproducible pipeline
to automate variant calling from sequence reads.
Lucas Czech and Moises Exposito-Alonso. Bioinformatics. 2022.
doi:10.1093/bioinformatics/btac600 [pdf]
Furthermore, please do not forget to cite all tools that you selected to be run for your analysis. See our Wiki for their references.