Doing a grid search for the optimum number of clusters and clustering method (e.g. Kmeans Pam etc) based on cluster quality. We could use something like a similarity score to determine how similar the points are within the clusters.
Copied from original issue: edgi-govdata-archiving/filtration#8
From @vidkum1 on February 11, 2017 23:56
Doing a grid search for the optimum number of clusters and clustering method (e.g. Kmeans Pam etc) based on cluster quality. We could use something like a similarity score to determine how similar the points are within the clusters.
Copied from original issue: edgi-govdata-archiving/filtration#8