scikit-learn-contrib / hdbscan

A high performance implementation of HDBSCAN clustering.
http://hdbscan.readthedocs.io/en/latest/
BSD 3-Clause "New" or "Revised" License
2.76k stars 497 forks source link

Streaming Data Clustering #239

Open zeynepkucuk opened 5 years ago

zeynepkucuk commented 5 years ago

Hello, What are the most used data streams clustering algorithms? (for example telecomination datas).

lmcinnes commented 5 years ago

Streaming clustering is really a topic unto itself. I don't have much experience in that field, so I can't offer too many suggestions. Realistically I would suggest that depending on the speed of update required a Gaussian mixture model or K-Means are the basic options.

On Sun, Sep 23, 2018 at 3:58 PM zeynep kucuk notifications@github.com wrote:

Hello, What are the most used data streams clustering algorithms? (for example telecomination datas).

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/scikit-learn-contrib/hdbscan/issues/239, or mute the thread https://github.com/notifications/unsubscribe-auth/ALaKBUO1kt3zH4sP4Lbkii5EbEQn627Eks5ud-fVgaJpZM4W12nN .