Closed Huilin-Li closed 3 months ago
cc @dask/gpu
Unfortunately this isn't the right place to report this. Could you open an issue on the cudf repo with a minimal bug report. Something that the cudf team can copy/paste to reproduce the problem.
Hi @Huilin-Li!
As @jacobtomlinson suggested - Please do raise an issue in dask/dask or cudf
I think we will need to know more about the data you are calling explode
on. I don't think explode
is supported by the "cudf" backend when query-planning is enabled. However, there also seem to be problems with explode when "pandas"
backend is used as well. E.g
import dask
import dask.dataframe as dd
dask.config.set({"dataframe.backend": "pandas"})
df = dd.from_dict({"A": [[0, 1, 2], [], [3, 4]]}, 1)
df.explode("A")
(I get an error for both backends here. So please include a specific reproducer like this in your dask issue)
Linking the issue that was opened, for those finding this from search: https://github.com/rapidsai/cudf/issues/16458
Without setting
dask.config.set({"dataframe.backend": "cudf"})
, the calculation process works fine, but very slow, so then settingdask.config.set({"dataframe.backend": "cudf"})
. But, I got this error:More details are here link