Closed fywu85 closed 4 years ago
Have you checked it didn't crash or something? Use ray stack
to see what is going on in the remote workers.
Closing this since it's most likely no longer an issue. But @fywu85 please reopen if still needed. Thanks!
System information
pip install ray==0.5.3
python experiments/mutiagent_hrl/tune.py HalfCheetah-v2 LSTMPolicy -n 1 -s 3
Succeeded command:python experiments/mutiagent_hrl/tune.py Pendulum-v0 LSTMPolicy -n 1 -s 3
Describe the problem
We try to use
ray
to parallelize our runner script. Specifically, we used something belowThis works for simple envs in classical control, such as
Pendulum-v0
, but does not work for all mujoco envs, e.g.,HalfCheetah-v2
. Specifically, in case ofHalf-Cheetah-v2
, it starts with using ~3 CPUs, but quickly collapses into 1 CPU after 5 seconds or so. On the other hand, tests withPendulum-v0
use all 3 CPUs, which is what we intend to do.Source code / logs
The source code is at https://github.com/AboudyKreidieh/h-baselines/. Please use the following steps for installation:
To run the tests, please run:
To see how we use
ray
to implement the parallelization, please check: