sanger-pathogens / Roary

Rapid large-scale prokaryote pan genome analysis
http://sanger-pathogens.github.io/Roary
Other
302 stars 189 forks source link

MCL clustering #594

Open mweberr opened 1 year ago

mweberr commented 1 year ago

Hi, I have an issue understanding how Roary clusters the sequences after CDHIT step. The paper states : " Sequences are then clustered with MCL ([Enright et al., 2002]), and finally, the pre-clustering results from CD-HIT are merged together with the results of MCL."

I have not worked with MCL yet. Does MCL cluster the blast all-against-all network or the raw input sequences ? Thank you for clarification.

Best, Michael