liyu95 / DeepSimulator

The first deep learning based Nanopore simulator which can simulate the process of Nanopore sequencing.
117 stars 40 forks source link

length of the reads #54

Open parsboy66 opened 1 year ago

parsboy66 commented 1 year ago

Thanks for the nice job! I was trying to simulate 100K cDNA reads which each read consist of: BC + UMI + Polya(t)+ 1500 (random seq) at first, i couldn't ran it all over 100K in one fasta file and at the end I spliced reads in 10 batches. so each read consists of 1570 ~ 1590 (variable length cos of the poly a/t) i set the -l or mean read length to 800 but by using this option, I got the error that says the input genome is null, while by just not using -l, it ran correctly.

i am wondering to know while the mean read length in my sample is around 800 while default in deepsimulatre is 8000 , does it affect the output??? and also is there anyways to run 100k in one fasta file?? (error was like list was too long ) .

i ran the instruction like this: ./deep_simulator.sh -B 2 -l 800 -i ../nano_fasta_small_9.fasta -n -1 -o ../output_9

Thanks