Closed HugoPhibbs closed 1 month ago
Sorry for late reply. The only problem I see is: "class DVCupyVector" doesn't have "to_host()". Here you can do this:
V = cp.zeros(len(E), dtype=cp.int32)
V_d = trtc.DVCupyVector(V)
trtc.Exclusive_Scan(E_d, V_d)
return V
And it acutally works for me. So far, I'm not able to reproduce the actual error you see.
If you are still seeing "An internal error happend", then it looks more like a configuration issue.
For me, Miniconda works stably. I used:
conda install -c conda-forge cupy cuda-version=11.3
to install cupy.
At the beginning of the script, I set the nvrtc path like
trtc.set_libnvrtc_path('/home/fei/miniconda3/lib/libnvrtc.so')
Also be careful if your GPU driver is new enough for your cuda version.
Hi,
I'm getting an error when using Exclusive_Scan with python
My code:
The error:
FYI My CUDA version is 11.3