edgi-govdata-archiving / web-monitoring-processing

Tools for access, "diff"-ing, and analyzing archived web pages
https://edgi-govdata-archiving.github.io/web-monitoring-processing
GNU General Public License v3.0
20 stars 20 forks source link

Score Cluster Quality [5] #17

Closed dcwalk closed 7 years ago

dcwalk commented 7 years ago

From @vidkum1 on February 11, 2017 23:56

Doing a grid search for the optimum number of clusters and clustering method (e.g. Kmeans Pam etc) based on cluster quality. We could use something like a similarity score to determine how similar the points are within the clusters.

Copied from original issue: edgi-govdata-archiving/filtration#8

dcwalk commented 7 years ago

This issue was moved to edgi-govdata-archiving/web-monitoring-processing#16

dcwalk commented 7 years ago

Sorry, looks like a duplicate got brought in here, closing