nanoporetech / medaka

Sequence correction provided by ONT Research
https://nanoporetech.com
Other
391 stars 73 forks source link

Suggestions of threshold of minimum mapping quality and read quality score #510

Closed hungweichen0327 closed 2 weeks ago

hungweichen0327 commented 2 weeks ago

Hi,

Do you have any suggestions about the threshold of minimum mapping quality? -M Minimum mapQ (default: None)

About the read quality score, do you suggest filtering the raw ONT reads (e.g. Q>=10) for medaka_consensus?

cjw85 commented 2 weeks ago

For read quality score, medaka is trained with "pass" reads from the sequencer. Deviating from this may provide sub-par results. Counter intuitively filtering to keep only the very highest quality data may not provide optimal results.

Mapping quality is as much to do with the similarity of reads to multiple locations in reference sequences than absolute accuracy of reads. We don't typically find it useful to adjust this filtering.

hungweichen0327 commented 2 weeks ago

I see. Many thanks for your kind suggestions!