ParBLiSS / FastANI

Fast Whole-Genome Similarity (ANI) Estimation
Apache License 2.0
368 stars 66 forks source link

many-to-many expected output? #55

Closed mldillon-LBL closed 4 years ago

mldillon-LBL commented 4 years ago

Hello,

I am supplying query list and reference list that are identical to see if my bins match themselves. What I get out as ANI > 80% are the exact matching bins. But, I can’t tell from the log exactly what it’s testing, so it’s difficult for me to know if it’s testing bins against each other that I would expect to have > 80% ANI.

For example, Nome_Creek.metabat_bin.100.genus_NA__0.57.cleaned.fa and Nome_Creek.metabatbin.100.orderSolirubrobacterales__0.61.cleaned.fa are both query/reference bins. So, is fastani not comparing these two? Or are they really < 80% identical? Log file only lists the reference and queries but doesn’t exactly say what it’s comparing.

log.txt

Maybe just need more explicit information about many-to-many expected output?

Thanks!

Megan

cjain7 commented 4 years ago

Hi, based on the log file, FastANI is comparing 34 genomes against the 34 genomes. Let me know if that doesn't answer your question.

mldillon-LBL commented 4 years ago

It does answer my question, but what do you see in the log file that indicates that? Just want to know how to read this correctly.

cjain7 commented 4 years ago

All genomes in the query list are always compared against all genomes in the reference list.