Open erikrikarddaniel opened 1 year ago
I also know of: https://github.com/wwood/galah that might do something similar
I have a module for Galah that I wrote for a personal pipeline processing the output of mag, as well as a process that takes the busco_summary.tsv file and converts it to the format required by Galah to use the completeness/contamination information.
I had planned to add them to mag at some point, but haven't found time yet.
:tada: awesome!
Description of feature
dRep takes a set of genomes, with CheckM data, and dereplicates them to produce a set of non-overlapping genomes at a specified ANI. As this is basically just pointing to certain MAGs as the representatives of clusters, the output could possibly be summarised by a column in the bin summary table:
dRep
. If the column says e.g.95
and hence indicates the ANI, one could potentially run dRep multiple times. There is no existing nf-core module.