ythuang0522 / homopolish

High-quality Nanopore-only genome polisher
GNU General Public License v3.0
65 stars 12 forks source link

Alternatives to Evaluating Assembly Quality without Reads #46

Closed afvrbanac closed 2 years ago

afvrbanac commented 2 years ago

Hello, I have R10.4 nanopore data (Bifidobacteria isolate genomes) that I assembled with trycycler (using Flye, miniasm+minipolish, raven), polished with Medaka, and then polished with Homopolish. They are all closed, single main contigs with some strains containing plasmids.

I noticed that methods to evaluate assembly quality that use the raw reads (i.e Inspector) report much worse assembly quality after using Homopolish. I assume this is because the reads no longer map well to the assembly in areas that Homopolish has corrected. Are there alternative evaluation tools that are recommended to use with assemblies processed by Homopolish besides CheckM? I unfortunately do not have reference genomes for my strains as they are novel isolates so there is no way for me to evaluate quality in that manner.

ythuang0522 commented 2 years ago

Hi @afvrbanac, I am a bit skeptical of the inspector which used read consensus for evaluation. The real issue of ONT is systematic errors, in which the basecalling algorithm consistently miscalled at the same locus for all reads. Homopolish was designed for solving this obstacle by looking at indel conservation in homologs, which will against the read consensus.

image

Without a reference, the completeness of core genes is also the way we did for reference-free evaluation (i.e., CheckM/Busco/Ideel). Unfortunately, they are not sensitive enough to reflect the accuracy when passing Q40, and I believed all your genomes should be close to Q50 (we had a recent test using Illumina showing consistent Q50 on R9.4 after Homopolish). In our experience, this is good enough for most studies. Hence, I would suggest you just continue with the downstream analysis. When any important mutation related to your study is found, you can always double confirm by IGV later. I hope this helps.

afvrbanac commented 2 years ago

Awesome, this is very helpful--thank you! Really appreciate the graphic and extra info.