Closed rwhetten closed 2 years ago
Hi @rwhetten,
You can deviate a little bit from the default value of that parameter, ideally using 1/ the exact median value, 2/ a greater value or 3/ a lower value (in that order of preference). The difference between 2/ and 3/ is that by using 2/, Ratatosk might pick up a little bit too much "signal" which I think in general is better than not enough.
In your case, median insert is 450 bp and default -i
is 500bp so I wouldn't bother changing the default value of -i
. However, in the case of 250bp median insert and default -i 500
, Ratatosk would definitely pick way too much signal which might confuse it so I would change the default value of -i
. To summarize, changing -i
is unlikely to impact the outcome for small deviations but will most definitely have a negative impact for larger deviations.
Yes and no. What I think you need here is the 4 commands correction of Ratatosk:
index -1
) builds the index for all the short read data you have, irrespective of the long reads in input.correct -1
) to correct each flowcell separately/in parallel.index -2
), you have to use in input all the corrected flowcells from the second command.correct -1
using the index from index -2
. Do not use the -L
option in this case.I hope this is clear enough :) Guillaume
Yes, that is perfect - thank you for the quick reply! I'll close the issue.
Thanks for a great tool! I have two questions to try to optimize my use of Ratatosk.