rrwick / Metagenomics-Index-Correction

GNU General Public License v3.0
79 stars 9 forks source link

dereplication thresholds other than 0.005 #11

Open chassenr opened 4 years ago

chassenr commented 4 years ago

Hi, I was wondering if you also tried dereplication threshold other than 0.005 (i.e. something between 0.005 and the original 0.05 from GTDB)? I am trying to find the best compromise between good classification results and size of the database. E.g. do you think that increasing the dereplication threshold to 0.01 would drastically lower classification performance?

Thanks!

Cheers, Christiane