stevenpjg / ddpg-aigym

Continuous control with deep reinforcement learning - Deep Deterministic Policy Gradient (DDPG) algorithm implemented in OpenAI Gym environments
MIT License
275 stars 74 forks source link

It si very very slow for Pendulum-v0 of classic control environment #6

Closed sarvghotra closed 8 years ago

sarvghotra commented 8 years ago

I ran this code for Pendulum-v0 environment, its too too slow on this particular environment. But its considerably faster on InvertedPendulum-v1. Do you have any idea why is it so ?

stevenpjg commented 8 years ago

It should be fine now. You can accelerate it further using a small wrapper to use normalized environments and reward scaling(link), as you mentioned earlier in your discussion.