Closed jolespin closed 6 months ago
Note: The labels are not duplicates
labels.tolist()[:20]
[238,
236,
-1,
160,
236,
160,
-1,
-1,
308,
-1,
197,
-1,
-1,
-1,
312,
312,
-1,
300,
-1,
-1]
Diving a little deeper, this seems to only happen when I change min_cluster_size
I think you're adding the same elements multiple times to the dataset: you should create a new clusterer in each inner for loop.
It seems to me this is not a bug, but feel free to reopen if you think I'm wrong.
Apologies! I hadn't realized that was outside of the for-loop. Thanks for catching this error on my part.
No version available but I'm using the current build which is commit: 96a2c3f
Here's the command I ran to benchmark:
Here's the error:
Not sure why there are 2x more with these params.