Open gsc74 opened 1 week ago
Hi @gsc74, in principle, I think that this approach makes sense. The only thing I'm not sure about is how well the genotyping would work for a haploid sample, as the model underlying PanGenie assumes a diploid genome and we have never specifically evaluated how well it performs for haploid data. But I think it is worth a try. Probably the rate of heterozygous genotypes PanGenie predicts would be an indication of how good the genotyping works.
What is your question? @eblerjana I am working on reconstructing a haploid haplotype using the imputed genotypes from PanGenie. Currently, I am using the following commands:
In the above commands, I am using haploid reads to obtain genotypes, then filtering the heterozygous variants, and finally using the filtered genotypes to reconstruct the haploid haplotype from the imputed filtered genotypes.
My question is: Is this the correct way to use PanGenie to reconstruct haplotypes? The input VCF is a phased diploid VCF generated by the minigraph-cactus pipeline and preprocessed with the "prepare-mc-vcf" pipeline.
If applicable: which version of PanGenie are you using? v3.1.0
If applicable: how did you run PanGenie? Please provide the command lines used. Did you run it using Singularity? I've used conda to install PanGenie
If applicable: what data are you running PanGenie on? Which species are you analyzing? Which input reads are used? How does the input VCF look like (number of input samples, how was it produced etc.)? MHC VCF file generated using Minigraph-Cactus pipeline and preprocessed using "prepare-mc-vcf" pipeline.