Open xujialupaoli opened 4 weeks ago
Hello,
Yes, you would expect to see much shorter reads or slightly longer reads due to chimeric reads and various mutations. If you enable the chimeric read simulation scenario, the program will randomly choose some reads and treat them as chimeric reads, i.e., reads that expand the restriction cutting sites and sequence into the interacting fragment from a distal region. Before alignment, the DNA sequences from the distal interacting fragment will be trimmed. Therefore, the simulated chimeric reads are like the real chimeric reads, shorter than the original read length. As for longer reads, it is possibly due to insertion mutation added to the simulated sequence.
Hope the above makes sense to you.
Thanks, Ye
OK, thank you very much for your patience in explaining!
Hello! Thank you for providing such a useful software.
My original hic data are all paired reads of equal length of 150bp. I hope that the simulated data are also regular 150bp paired reads.
But when I use freehic for simulation, the reads I get are long and short, unevenly distributed reads. Please help me see how I should simulate?
This is my FreeHiC_parameters file:
This is my original sequencing hic data:![image](https://github.com/yezhengSTAT/FreeHiC/assets/96410022/fc0b850e-4985-4c10-b485-b6300f92fba7)
This is the hic reads with uneven length distribution that I simulated:![image](https://github.com/yezhengSTAT/FreeHiC/assets/96410022/bee689aa-dbbf-4509-a1bc-f9782f1dbdec)
Looking forward to your reply!