xychang / RecursiveHierarchicalClustering

Use iterative feature pruning to identify hierarchical clusters.
http://sandlab.cs.ucsb.edu/clickstream/index.html
GNU General Public License v3.0
55 stars 20 forks source link

Cluster quality results? #15

Open gmobile15 opened 4 years ago

gmobile15 commented 4 years ago

Hi,

Massive thanks for this great tool and it works absolutely fine with my data. In the paper, you mentioned that you experimented with different values of k to create the k-gram sequences. What metric would you recommend to evaluate these clusters?

For e.g. if I experiment with k -> [1,2,3,4,5] I would have 5 set of results (assuming I dont include time gaps at this stage, as that would double the number of results). How would I decide which clustering is the best? Is it simply the modularity score? If yes, each cluster has a modularity value but is there a way to amalgamate that for an entire set of results?