Closed tedyun closed 3 years ago
It turns out my binary must have had some issues - once I recompiled SHAPEIT4 and htslib with a clean source, I can get past the initialization step. Phasing still fails in the HMM computation step, but since that is a different issue I'll open a new bug after more investigation. Thank you for taking a look.
THIS ISSUE IS NOW OBSOLETE. PLEASE SEE ISSUE #52 INSTEAD.
=====
Hello,
I am trying to use SHAPEIT4 for phasing a large cohort (~2,000 samples) of whole genome sequencing data and I'm running into issues when trying to phase chromosome X VCF. As far I know, the input VCF doesn't use any special representation for chromosome X, but SHAPEIT4 only fails on chrX with this message
ERROR: No variants to be phased in [...vcf.gz]
while it works well for other chromosomes.This issues is reproducible with a public 30x WGS release of 1000 Genomes Project phase 3 by New York Genome Center. You can either download the variant calls by DeepVariant+GLnexus at this link in Google Cloud (+ .tbi), or the calls by GATK at this link in Google Cloud (+ .tbi) or official FTP, and run the following command (with the genetic map file included in SHAPEIT4):
Full output log can be found here. I'm running this in a Debian/Ubuntu machine with intel Xeon CPU. SHAPEIT4 binary was compiled with htslib v1.9.
I have tried the following to fix this issue but all failed:
--pbwt-mdr
value to a higher value.chrX
instead ofX
to match the chromosome name in VCF.chrX
tochr1
in both VCF and the genetic map file../.
) to0/0
.--map
flag to use the default flat linkage structure.I'm using SHAPEIT v4.1.2 for this run, but I also tried v4.2.0 and it failed with the same error message. Just as a reference, I also tried phasing the same VCF with Eagle v2.4.1 and it did work without an error.
I'm not sure if I missed anything obvious but I ran out of ideas to try - it'd be great if I can get some advice on where else I can look to investigate this issue.
Thank you very much for your work in developing this awesome software.
Best, Ted