Open Wong718 opened 3 weeks ago
To further illustrate the problem, I attach the haplotype contacts map of scNanoHiC below. We could observe that a clear diagonal between the PAT and MAT of the same chromosome, which is not consisted with our knowledge about contacts between alleles.
Hello Professor Li. It's a wonderful tool for 3D genome analysis. Recently, I am dealing with the scNanoHiC data with bwa-sw and hickit, and I found that the haplotype phasing results were confusing. And I want to ask if hickit could deal with the nanopore long sequencing data as properly as NGS. The sam file generated by bwa-sw is looked like as follows, where a single read may map to multiple positions and generated several records with the same id.
Then, I applyed the sam2seg function to generated the .seg file as follows, by providing the corresponding .vcf file with -v parameter.
However, after I generated the .pairs file with modifed seg2pairs function (I have modified this function to generate multi-contact for one read), I observed that the trans-parental contacts in the same chromosome were more than expected, and I want to figure out why. In general, I want to ask if the hickit::sam2seg functon could deal with the .sam files generated by bwa-sw and make correct phasing decision, and what will sam2seg do if a long mapped read has SNPs derived from opposite phases.