marbl / merqury

k-mer based assembly evaluation
Other
280 stars 19 forks source link

USE case: long accurate reads #117

Closed Axze-rgb closed 8 months ago

Axze-rgb commented 9 months ago

Hello,

More and more assemblies come based solely on long reads such as PacBio or Nanopore. Nanopore used to be noisy, but it seems they are not anymore and have an accuracy > 99.99%. Could merqury be reliably used with that kind of reads ? I understand with noisy ONT we would get errors with Kmers, but I don't know what could be wrong using accurate long reads.

would you recommend merqury in this case? It seems KAT works ok with PacBio reads, but I prefer Merqury because your plots are easier to understand for me.

Cheers

arangrhie commented 9 months ago

Hello @Axze-rgb,

It's probably ok to use the ONT kmers for spectra-cn (like KAT) type of analysis. The QV would be slightly inflated though, as a result of platform specific errors observed in the read and in the assembly. Similarly, HiFi assemblies get higher QV when measuring QV from HiFi kmers compared to Illumina kmers. Usually I see decrease of QV of a polished ONT assembly when measuring from ONT kmers.

To put it in other words, 99.99% accuracy is translated as Q40, which means measuring an assembly that is over Q40 would less reliable.

Cheers, Arang