Closed hungweichen0327 closed 2 weeks ago
Medaka is intended for use with simplex data and those determined as "pass" by the sequencer. Read length isn't typically a consideration.
Thank you for the quick reply. Do you mean using whole simplex data (dx:0 and dx:-1) would be better?
The error modes of duplex reads are different from simplex, so medaka is likely to misinterpret them. There's a minor secondary issue that medaka makes no particular allowance for dependence between reads (so providing both parts of a duplex read might trip it up).
Many thanks for the suggestion!
Hello,
Thank you for the useful software. I have two questions.
Should I use whole reads (dx:1, dx:0, and dx:-1) for
medaka_consensus
? Or both duplex and simplex reads without duplex offspring(dx:0 )?Did I need to filter the reads (like read length > 1kb and quality score Q>10)? Or use the raw reads?
Thank you.