Closed arshajii closed 7 years ago
Hi, would you please provide me all the parameter you've been using, thanks.
I used just -r hg19.fa -p sim -x 1 -o
.
I see, that's because you simulated too few reads so the simulator keeps sweep through the 3G slots array to search for usable reads as requested by a formula on three other parameters -f, -t and -m. With this few reads, please reduce -f, -t and -m to their lowest possible values.
Thanks for your help. I tried -x 1 -m 4 -t 3 -o
and it seems to have worked. Just to clarify, the first 16bp of the R1 FASTQs are the barcode, right?
The two one issues I'm facing now are:
first 16bp of the R1 FASTQs are the barcode, right? RE: correct.
you have only generated 1M reads, which means that you need to sweep for 3000bp in average to get the next available read randomly generated from the human genome. You can change the code from
to
in order to fulfill your parameters, the side effect is that the problem will run much slower.
Notice that for the default parameter value 600M pairs for a human genome, which converts to ~40-fold, the original value 10 is safe and efficient.
a suggestion for testing out the simulator without working on huge genomes is to use smaller genomes such as S. Cerevisiae or Arabidopsis. 10X's supernova assembler costs 2 hours to finish the assembly of 40x of arabidopsis data.
Changing the CHK_PREV_SLOT_LIMIT
parameter seems to have worked. Thanks again for your help!
Is it possible to allow smaller simulations (i.e. smaller
-x
)? At the moment, I receive the messageI have tried using
-o
(with-x 1
and-x 5
), but the program seems to hang:My reference is hg19.