MicrobialDarkMatter / GraphMB

MIT License
35 stars 6 forks source link

Negative edges in neighborhoods #27

Closed yamada321 closed 1 year ago

yamada321 commented 1 year ago

Hi,

Thank you for the wonderful tool. I wonder how does GraphMB handle not fully phased or cleaned regions in a given assembly graph. For example, during the neighborhood sampling and graph embedding, from my understanding, GraphMB will not try to split edges of a simple bubble apart. Intuitively, if they also unfortunately share similar read coverage, this might lead to a contaminated bin. Does this happen in practice, or GraphMB's algorithm already has mitigations?

In binning of phased long read assemblies, I see many contaminated bins (from other binners as well). I'm trying to figure out why and if there's anything could be done.

Thank you!

AndreLamurias commented 1 year ago

Hi, sorry for the delayed response. GraphMB takes the assembly graph as it is, and it does not try to fix any assembly issues. So, as you said, yes, this can be a source of contamination.