refresh-bio / vclust

Fast and accurate tool for calculating Average Nucleotide Identity (ANI) and clustering virus genomes and metagenomes
GNU General Public License v3.0
44 stars 1 forks source link

K-mer and ANI computation between multi-contigs bins #14

Open HanJune-Kim opened 1 month ago

HanJune-Kim commented 1 month ago

Hi thanks for the great tool. I am trying to calculate ANI between multi-contigs bins, where each viral bin consists of multiple contigs. However, length estimate is different from the original one, also ani results from vclust are far different from the outputs from nucmer. vclust seems like a tool only applicable to the singleton viral genome. Is it right?

aziele commented 4 weeks ago

Hi, thanks for your feedback and for using Vclust!

You are correct - currently, Vclust is designed for single-contig viral genomes, so there are discrepancies when applying it to multi-contig bins. However, we recognize the need to handle multi-contig viral bins/genomes, and we're working on adding this feature in a future update.

Best, Andrzej