Closed YPGG1234 closed 3 years ago
In your VCF, the "#CHROM..." line has 10 fields but your VCF lines only have 8 fields.
If SURVIVOR doesn't require population genotypes (or other population metrics) for merging your records, then merge first before doing genotyping. If it does require population information...it depends on your computational resource. I may still recommend merging first, because, with many different records for one SV, you'll need to perform genotyping on each sample multiple times. This can be computationally expensive.
Thank you for your prompt reply!
Hello,
Thanks for your contribution to Paragraph, it's a nice software for population analysis. Recently I am using Paragraph to genotyping some animals Illumina data, but when I run the
multigrmpy.py
, I got the mistake:My VCF file like this, it was generated from Sniffles and processed by custom script in order to be better used by Paragraph:
I have no idea about this case cause I can't find any information about the "Number of samples" either in VCF body or header, could you please help me?
I have one other question: Suppose I have ten samples PacBio HiFi data, should I first use Sniffles to call SVs of each of them following by the "Population-scale genotyping" pipeline, after all of them are finished, merge all VCF files into one big VCF file; Or first merge all Sniffles VCFs into one (Using SURVIVOR), then follow the "Population-scale genotyping" pipeline. Which one is better?
Looking froward to your kind reply.