huggingface / deep-rl-class

This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.
Apache License 2.0
3.87k stars 594 forks source link

[HANDS-ON BUG] mlagents-learn in unit 5 not working #571

Open benbekir opened 2 days ago

benbekir commented 2 days ago

Describe the bug

The command

!mlagents-learn ./config/ppo/SnowballTarget.yaml --env=./training-envs-executables/linux/SnowballTarget/SnowballTarget --run-id="SnowballTarget1" --no-graphics

doesnt work. It always results in the following error:

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! (when checking argument for argument mat1 in method wrapper_CUDA_addmm).

I have tried specifying the device with the --torch-device argument but that didnt help either. Maybe this has something to do with the fact that dev versions are used for the ml-agents and ml-agents-envs packages?

  ml-agents: 1.2.0.dev0,
  ml-agents-envs: 1.2.0.dev0,
  Communicator API: 1.5.0,
  PyTorch: 2.5.0+cu121

Material

TPK-MAKG commented 1 day ago

Experiencing the same issue. Additionally, the runtime requires a restart after the installation of numpy packages, otherwise it can't find the hyperparameter file in /config/ppo/SnowballTarget.yaml. Maybe it's a package version conflict?