Open hchekired opened 2 months ago
I just changed version to cuDF version: 24.08.02 cuML version: 24.08.00
but it still does not work
Thanks @hchekired, I have moved the issue to cuml. It looks like a cuml issue, for some reason, _get_internal_model()
returns None
when dask uses cloudpickle.dumps()
to serialize cuml's BaseEstimator
.
I won't be around for the next couple of weeks, but @viclafargue will take a look here
Hello @viclafargue, hope everything is going well for you. Do you have an idea where the problem comes from?
Thanks.
Hello @hchekired, sorry for the late reply. Thanks for opening the issue. I could reproduce it successfully. It looks like the GridSearchCV
estimator is serializing the estimator prior to training which causes a bug. I will open a PR to fix the serialization of MNMG estimators prior to training. There is also another issue that I am looking into to make things work. However, I would recommend to either use sklearn's GridSearchCV
with cuML dask estimators, or (if a single GPU can handle it) dask-ml's GridSearchCV
with local cuML estimators.
Thanks for the reply, let me know if find something else to make things work.
Thanks
I fixed the issue that prevented serialization prior to training. But again, I am not quite sure if it is a good idea to use a Dask estimator with a Dask GridSearchCV. Maybe you should try either one of them in Dask and the other without. I will look into solving this when I have more time.
Hello @viclafargue, thanks for fixing the issue, how can I have access to the corrected version?
Also, why it is not a good idea to us a Dask estimator with a Dask GridSearchCV?
Thanks!
Hello, I am a beginner with GPU accelerated computing and I can’t find what is wrong with my code. I am getting this serialization error and don’t understand why.
The error I get is this
I created a case on dask forum but they told me to put my problem here since it seemed to be caused by an incompatibility between dask-cuda and dask-ml.
Here are the info on the system I use:
Python version: 3.10.9 | packaged by conda-forge | (main, Feb 2 2023, 20:20:04) [GCC 11.3.0] CUDA version: nvcc: NVIDIA (R) Cuda compiler driver Copyright (c) 2005-2024 NVIDIA Corporation Built on Thu_Mar_28_02:18:24_PDT_2024 Cuda compilation tools, release 12.4, V12.4.131 Build cuda_12.4.r12.4/compiler.34097967_0
cuDF version: 24.08.00a405 cuML version: 24.08.00a50 Dask version: 2024.7.1 CUDA version (nvidia-smi): Tue Aug 20 14:17:43 2024
+-----------------------------------------------------------------------------------------+ | NVIDIA-SMI 550.106 Driver Version: 552.86 CUDA Version: 12.4 |
Also, I am using a WSL2 environment.
Thanks a lot for your help!!