wwood / CoverM

Read coverage calculator for metagenomics
GNU General Public License v3.0
273 stars 30 forks source link

default rep seq picking method #173

Closed itsmisterbrown closed 11 months ago

itsmisterbrown commented 1 year ago

hi,

thanks for making such a great tool with myriad excellent functions. Sorry if I missed this somewhere, but how does coverm choose a representative sequence during clustering and is there a way to modify this?

thanks!

wwood commented 1 year ago

Hi,

Thans for the kinds words.

CoverM uses the galah code underneath https://github.com/wwood/galah - you can see an explanation on the README there. Perhaps easiest way to modify using an aribtrary ranking metric is to provide the genomes without checkm stats but in an order that respects the order of your preference for it being a rep. Alternatively several ranking metrics are provided in galah / coverm.

HTH, ben

itsmisterbrown commented 11 months ago

thanks for the speedy response, Ben. Sorry that my closing of this issue wasn't as fast. doing it now!