Computational-Content-Analysis-2018 / 19-Jan-Flat-Clustering

Manning, Christopher, Prabhakar Raghavan and Hinrich Schütze. 2008. “Flat Clustering” and “Hierarchical Clustering.” Chapters 16 and 17 from Introduction to Information Retrieval.
https://github.com/Computational-Content-Analysis-2018
0 stars 1 forks source link

Hyperparameters - On distance measures and clustering parameters #14

Open Moloq opened 6 years ago

Moloq commented 6 years ago

To be able to do clustering, we need a distance measure. In last week's assignment, we were introduced to several distance measures. However, given all the options in those times the different parameters for clustering (for example, number of clusters for some algorithms), we have a matrix of options. How do researchers commonly decide on this? Do they first settle on a distance measure? Do they generate the matrix of results and then choose the one that they feel provides better clusters?