metagentools / GraphBin2

☯️🧬 Refined and Overlapped Binning of Metagenomic Contigs Using Assembly Graphs
https://graphbin2.readthedocs.io/en/latest/
BSD 3-Clause "New" or "Revised" License
33 stars 3 forks source link

Should GraphBin/GraphBin2 operate on the contigs of metaFlye assemblies? #9

Closed schorlton closed 2 months ago

schorlton commented 2 years ago

Hi @Vini2 ,

Thanks for the great tool. More a conceptual question, your manuscripts all discuss binning of contigs, which seems to make sense to me (especially for paired-end Illumina where pairs can link contigs into scaffolds); however the metaFlye implementations operate on edges in the assembly graph. My understanding of metaFlye assembly graph is it is possible to have a contig made of 2 connected edges (eg. A and B) which loops back to 1 of the edges so that the contig may be A -> B -> A. This additional information (ie the longer, more complete contig) may then help the binning software bin the contig, and also lead to more complete bins (else the bin may only have 1 copy of A instead of 2).

Would it not make sense for GraphBin/GraphBin2 to operate on contigs, propagating labels across scaffold links? Unfortunately/fortunately, there are frequently very few or even no scaffold links in metaFlye assemblies (found in the scaffolds_links.txt file in 30-contigger folder in the metaFlye output), and therefore I expect the effect of GraphBin/GraphBin2 to not be as large as for Illumina assemblies. Thanks for your thoughts!

-Sam

Vini2 commented 2 months ago

Hi @schorlton,

GraphBin/GraphBin2 now supports original contigs from Flye.

Thanks! Closing this issue now. Please re-open if needed.