meyer-lab / DDMC

Clusters phosphoproteomics data based on a combination of the sequence information and abundance changes over conditions.
https://asmlab.org
1 stars 1 forks source link

Notes space for comparing different clustering methods #508

Closed aarmey closed 2 years ago

aarmey commented 2 years ago

Clustering methods should be compared using adjusted_mutual_info_score. It's adjusted for random labelling to always be zero. It's also symmetric and independent of permutation.

Methods:

All hard methods have the X.labels_ attribute.

My thinking is we can run all of these, then calculate the pairwise distances by adjusted_mutual_info_score. Those distances can then be fed into MDS (precomputed) to visualize how similar they all are to each other. I bet they all end up pretty similar and GMM matches DDMC at weight 0. DDMC then becomes quite different and higher weights.

mcreixell commented 2 years ago

Sounds great I'll work on that. Thank you.

mcreixell commented 2 years ago

Just read you're available to implement this. Thanks!