Open erico-imgproj opened 8 months ago
Thanks for the issue! I'm not entirely sure what's happening, any chance you could run the script https://github.com/rapidsai/cuml/blob/branch-24.04/print_env.sh and post the output to see what versions of cuml/dask/etc you have, which will be super useful to reproduce.
Hello
Here is my configuration
cubinlinker-cu11 0.3.0.post1
cucim-cu11 24.2.0
cuda-python 11.8.3
cudf-cu11 24.2.2
cugraph-cu11 24.2.0
cuml-cu11 24.2.0
cuproj-cu11 24.2.0
cupy-cuda11x 13.0.0
cuspatial-cu11 24.2.0
cuxfilter-cu11 24.2.0
dask 2024.1.1
dask-cuda 24.2.0
dask-cudf-cu11 24.2.2
dask-glm 0.3.2
dask-ml 2023.3.24
raft-dask-cu11 24.2.0
rapids-dask-dependency 24.2.0
I hope it helps
What is your question? During processing of a large NLP dataset I found an very good example on cuml documentation site example. Following its instructions I wrote my own version for my dataset. My dataset contains 6 million phrases, and I wish to run a clustering algorithm to begin testing.
After preprocessing the data, the X variable is of type
which is the same type that the rapids example presents. Unfortunately, I run into a problem when I load it into the KMeans model.
If I try to run
yhat = kmeans_float.fit_predict(X.compute())
the error changes toChanging the clustering algorithm also does not help. For instance, I tried the following code:
And I get this error
Any help is appreciated