You should use https://github.com/KCCG/mity/
mity
is a bioinformatic analysis pipeline designed to call mitochondrial SNV and INDEL variants from Whole Genome Sequencing (WGS) data. mity
can:
mity -h
Installation instructions via Docker, pip, or manually are available in INSTALL.md
This is an example of calling variants in the Ashkenazim Trio.
First run mity call
on three MT BAMs provided in mity/test_in.
We recommend always using --normalise
, or mity report
won't work:
mity call \
--prefix ashkenazim \
--out-folder-path test_out \
--region MT:1-500 \
--normalise \
test_in/HG002.hs37d5.2x250.small.MT.RG.bam \
test_in/HG003.hs37d5.2x250.small.MT.RG.bam \
test_in/HG004.hs37d5.2x250.small.MT.RG.bam
This will create test_out/normalised/ashkenazim.mity.vcf.gz
(and tbi file).
We can create a mity report
on the normalised VCF:
mity report \
--prefix ashkenazim \
--min_vaf 0.01 \
--out-folder-path test_out \
test_out/ashkenazim.mity.vcf.gz
This will create: test_out/ashkenazim.annotated_variants.csv
and test_out/ashkenazim.annotated_variants.xlsx
.
High-depth sequencing and sensitive variant calling can create many variants with more than 2 alleles, and in some
cases, joins two nearby variants separated by shared REF
sequence into a multi-nucleotide polymorphism
as discussed in the manuscript. Here, variant normalisation relates to decomposing the multi-allelic variants and
where possible, splitting multi-nucleotide polymorphisms into their cognate smaller variants. At the time of writing,
all variant decomposition tools we used failed to propagate the metadata in a multi-allelic variant to the split
variants which caused problems when reporting the quality scores associated with each variant.
Technically you can run mity call
and mity normalise
separately, but since mity report
requires a normalised
vcf file, we recommend running mity call --normalise
.
You can merge a nuclear vcf.gz file and a mity.vcf.gz file thereby replacing the MT calls from the nuclear VCF (
presumably from a caller like HaplotypeCaller which is not able to sensitively call mitochondrial variants) with
the calls from mity
.
mity merge \
--prefix ashkenazim \
--mity_vcf test_out/ashkenazim.mity.vcf.gz \
--nuclear_vcf todo-create-example-nuclear.vcf.gz
Assuming that you are looking for a pathogenic variant underlying a patient with a rare genetic disorder potentially caused by a Mitochondrial mutation, then we recommend the following strategy:
We would like to thank:
mity
.mity
as a packageFreeBayes
and his early feedback in optimising FreeBayes
for sensitive variant detection.gsort