sweetice / Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
MIT License
3.75k stars 837 forks source link

Cannot solve the Pendulum problem by PPO iml in Chapter 07 #19

Open ryanhuang1014 opened 4 years ago

ryanhuang1014 commented 4 years ago

I cannot solve the continuous control problem of the Pendulum with your implementation in Chapter 07, i.e., PPO.

When the program exits finally, the problem is still not solved. Could you please verify it and tell me how to reproduce your solution? Thx. image