SOCR / SOCRAT-issues

Project management for SOCRAT
2 stars 0 forks source link

implement Elkan's algorithm for k-means #166

Closed alxndrkalinin closed 6 years ago

alxndrkalinin commented 7 years ago

Elkan's algorithm is an improved version of usual kMeans (called Lloyd's algorithm) that uses the triangular inequality to avoid many distance calculations when assigning points to clusters.

Note: for algorithm implementation you can look at scikit-learn Cython code as an example and use ClusterMetrics service to calculate distances