Open IlyaOrson opened 7 years ago
Most likely there as issue with NaNs creeping in somehow. This could be due to peculiarities of the dataset, particularly if you have more than min_cluster_size points that are all identical (although I believe many of those issues should be caught more elegantly now). Can you share the dataset?
I can't share the data set, but you are right, NaNs appear because I have the default min_cluster_size = 5
and hdbscan identifies two clusters where one has just three members. In my particular case all the values contained in each cluster are identical between them.
You can potentially alleviate such a problem by adding a very small amount of noise to your data (well below the level of data distribution, just enough to jiggle identical points apart).
@lmcinnes It worked for me.
Hello! I am getting a lot of warnings of the following type in the latest tagged version:
Does anyone know why this warning appears and how to avoid the problem?