Open Asquator opened 1 month ago
You can try the soft clustering options: https://hdbscan.readthedocs.io/en/latest/soft_clustering.html but there really isn't a magical straightforward way to do this.
On Sun, Jun 9, 2024 at 10:23 PM Asquator @.***> wrote:
My application requires total clustering of all data samples, and I would like to assign all outliers to their adjacent clusters (the dataset is very noisy, and after tweaking the two parameters, at least 1/4 of the samples are marked as outliers).
I want to benefit from the advantages of density-based clustering, but also make deterministic decision based on every point's (approximate) cluster.
It seems we just need to assign every outlier to its closest core point's cluster, what is the easiest way to do it?
— Reply to this email directly, view it on GitHub https://github.com/scikit-learn-contrib/hdbscan/issues/640, or unsubscribe https://github.com/notifications/unsubscribe-auth/AC3IUBP4B5ZPAAXHIFL5J3LZGUE3XAVCNFSM6AAAAABJBNMBQOVHI2DSMVQWIX3LMV43ASLTON2WKOZSGM2DENRZG42DAOA . You are receiving this because you are subscribed to this thread.Message ID: @.***>
My application requires total clustering of all data samples, and I would like to assign all outliers to their adjacent clusters (the dataset is very noisy, and after tweaking the two parameters, at least 1/4 of the samples are marked as outliers).
I want to benefit from the advantages of density-based clustering, but also make deterministic decision based on every point's (approximate) cluster.
It seems we just need to assign every outlier to its closest core point's cluster, what is the easiest way to do it?