I don't understand what the readID naming in the output means? In addition, why do some reads start with "@SIM3C:3C" and some start with "@SIM3C:WGS:", are there any differences between these reads? Do these differences lead to different naming meanings for the subsequent "WGS hap1:3356458..3356840:R" and "HIC hap1:22115 hap1:195624"?
Looking forward to your reply!
Thank you for providing such a useful software. I used sim3C to simulate a hic data for my E. coli genome. Use the following code:
sim3C --profile mycom.txt -n 5000000 -l 150 -e DpnII -m hic /home/work/jialu/tetraploid_assembly/simulate_data/strain_fq/ref_genome/hap4.fa hap4_R1.fq hap4_R2.fq
The content of mycom.txt is as follows:
Here is the readID of the simulated data output:
I don't understand what the readID naming in the output means? In addition, why do some reads start with "@SIM3C:3C" and some start with "@SIM3C:WGS:", are there any differences between these reads? Do these differences lead to different naming meanings for the subsequent "WGS hap1:3356458..3356840:R" and "HIC hap1:22115 hap1:195624"? Looking forward to your reply!