Waikato / moa

MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection and recommender systems) and tools for evaluation.
http://moa.cms.waikato.ac.nz/
GNU General Public License v3.0
610 stars 353 forks source link

made kmeans take the weight of the microclusters into account #116

Closed sybernix closed 4 years ago

sybernix commented 6 years ago

since the kmeans is intended to be used on the micro clustering results from ClusTree or CluStream etc the data points to the kmeans are actually micro clusters which has an associated weight. A dominant micro cluster which has more points in it should be given more weightage when calculating new centers. Also ClusTree handles concept drift by reducing the weight of the micro-clusters. So it is important to use the weights also when finding centers.

sybernix commented 4 years ago

Closing as this PR is old