cumc / xqtl-protocol

Molecular QTL analysis protocol developed by ADSP Functional Genomics Consortium
https://cumc.github.io/xqtl-protocol/
MIT License
38 stars 42 forks source link

VCF_QC raise error when lacking certain tags #428

Open gaow opened 2 years ago

gaow commented 2 years ago

Report from Travyse: in the VCF_QC step the VCF file that I was provided lacks the DP tag, so the pipeline quits on error.

hsun3163 commented 2 years ago

This is one of the required field for vcf, as documented here:

The target vcf.gz file

If its chromosome name does not have the chr prefix and you need it to match with reference fasta file, please run rename_chrs workflow to add chr.
The vcf.gz file needs to be compressed by bgzip, instead of simple gzip
It should have a index file accompanying it. The index file can be generated by tabix
It must be a valid vcf.gz file that can pass bcftools sanity check: i.e. all tags are defined properly
It must contains following fields:
##FORMAT=<ID=DP,Number=1,Type=Integer,Description="Approximate read depth (reads with MQ=255 or with bad mates are filtered)">
##FORMAT=<ID=GQ,Number=1,Type=Integer,Description="Genotype Quality">
##FORMAT=<ID=AB,Number=1,Type=Float,Description="Allele balance for each het genotype">