scikit-learn-contrib / hdbscan

A high performance implementation of HDBSCAN clustering.
http://hdbscan.readthedocs.io/en/latest/
BSD 3-Clause "New" or "Revised" License
2.77k stars 496 forks source link

Got an unexpected keyword argument 'max_cluster_size' #488

Open mymusise opened 3 years ago

mymusise commented 3 years ago

Hello, I got an error with the following code:

import hdbscan

hdbscan_model = hdbscan.HDBSCAN(min_cluster_size=600,
                                max_cluster_size=20000,
                                  metric='euclidean',
                                  cluster_selection_method='eom',
                                  prediction_data=True)
hdbscan_model.fit(data)

then

~/.local/lib/python3.9/site-packages/hdbscan/hdbscan_.py in _hdbscan_boruvka_kdtree(X, min_samples, alpha, metric, p, leaf_size, approx_min_span_tree, gen_min_span_tree, core_dist_n_jobs, **kwargs)
    272         X = X.astype(np.float64)
    273 
--> 274     tree = KDTree(X, metric=metric, leaf_size=leaf_size, **kwargs)
    275     alg = KDTreeBoruvkaAlgorithm(tree, min_samples, metric=metric,
    276                                  leaf_size=leaf_size // 3,

sklearn/neighbors/_binary_tree.pxi in sklearn.neighbors._kd_tree.BinaryTree.__init__()

sklearn/neighbors/_dist_metrics.pyx in sklearn.neighbors._dist_metrics.DistanceMetric.get_metric()

TypeError: __init__() got an unexpected keyword argument 'max_cluster_size'
lmcinnes commented 3 years ago

What version are you using? Can you also check if this is the case if you install directly from the master branch on github?