I am trying to reproduce the DDPG results in Plappert et al., 2018 arxiv.org/abs/1802.09464.
However, even the simplest environment FetchReach-v1 seems to stay stuck at a return of -49. I have tried many different settings, but the latest one is simply a variation of the instructions for her in https://github.com/openai/baselines/tree/master/baselines/her, but to use just DDPG :
I am trying to reproduce the DDPG results in Plappert et al., 2018 arxiv.org/abs/1802.09464.
However, even the simplest environment
FetchReach-v1
seems to stay stuck at a return of -49. I have tried many different settings, but the latest one is simply a variation of the instructions forher
in https://github.com/openai/baselines/tree/master/baselines/her, but to use just DDPG :mpiexec -np 12 python3 -m baselines.run --alg=ddpg --env=FetchReach-v1 --num_timesteps=50000
It seems to stays stuck at