Closed Yijun-Tian closed 3 weeks ago
Hi, @Yijun-Tian
I performed some tests and did not encounter this error message. Could you please provide the version of LongPhase you are using, and the SHAPEIT4 VCF header you used?
Thanks
Thanks @twolinin ! I am using longphase 1.5.1 for the haplotagging:
shapeit4]$ longphase --version
Version: 1.5.1
Usage: main <command> [options]
phase run phasing algorithm.
haplotag tag reads by haplotype.
modcall convert bam file to modification vcf file.
Here is the header of SHAPEIT4 VCF:
##fileformat=VCFv4.2
##FILTER=<ID=PASS,Description="All filters passed">
##fileDate=16/08/2024 - 15:36:24
##source=shapeit4.1.3
##contig=<ID=chr22>
##INFO=<ID=AF,Number=A,Type=Float,Description="Allele Frequency">
##INFO=<ID=AC,Number=1,Type=Integer,Description="Allele count">
##INFO=<ID=CM,Number=A,Type=Float,Description="Interpolated cM position">
##FORMAT=<ID=GT,Number=1,Type=String,Description="Phased genotypes">
##bcftools_viewVersion=1.17+htslib-1.17
##bcftools_viewCommand=view -h s246.chr22.snv.vcf.gz; Date=Fri Aug 23 09:13:21 2024
#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT Sample
Hi @Yijun-Tian
Please modify your header
before
##contig=<ID=chr22>
after
##contig=<ID=chr22,length=242193529>
Thanks
Thanks, @twolinin. Haptag now works with the modification you noted.
Hi, To further improve the read level phasing performance, we use the population based (genotype based) haplotype reference to adjusting the longphase phase vcf in SHAPEIT4. Here is the output VCF.
However, when I use the VCF from SHAPEIT4 for haplotagging, error happens like below:
Since the SHAPEIT4 output only 1 phase set ideally, could you clarify what INFO/FORMAT fields are needed for the haplotag step? I was thinking to define these field by myself so that can use haplotag to label each read for downstream analysis.
Thanks,