RUCAIBox / RecBole

A unified, comprehensive and efficient recommendation library
https://recbole.io/
MIT License
3.42k stars 612 forks source link

GPU无法使用[🐛BUG] 用一句话描述您的问题。 #1910

Open ahukmr opened 11 months ago

ahukmr commented 11 months ago

描述这个 bug 在运行recbole1.2.0时,配置文件里use_gpu=true,gpuid =0都配置了,但是运行时一直显示没有使用gpu是为什么

如何复现 复现这个 bug 的步骤:

  1. 您引入的额外 yaml 文件
  2. 您的代码
  3. 您的运行脚本

预期 对您的预期作清晰简明的描述。

屏幕截图

链接 添加能够复现 bug 的代码链接,如 Colab 或者其他在线 Jupyter 平台。(可选)

实验环境(请补全下列信息):

BoXiaohe commented 11 months ago

Thanks for your attention to RecBole! As for your problem, you can try advice below. 1. CUDA and cuDNN Compatibility: Verify that your CUDA and cuDNN versions are compatible with RecBole. Different versions of deep learning libraries may require specific versions of CUDA and cuDNN to work properly. 2. Environment Variables: Double-check the setting of the CUDA_VISIBLE_DEVICES environment variable. Setting gpuid=0 in the configuration file should generally be sufficient, but there might be other configurations or environmental factors affecting GPU usage. 3. GPU Availability: Ensure that your GPU is available and not being used by other processes. You can use tools like nvidia-smi to check GPU status and usage. If another process is occupying the GPU, RecBole won't be able to use it. Hope this could help you!