I ran this code for Pendulum-v0 environment, its too too slow on this particular environment. But its considerably faster on InvertedPendulum-v1. Do you have any idea why is it so ?
It should be fine now. You can accelerate it further using a small wrapper to use normalized environments and reward scaling(link), as you mentioned earlier in your discussion.
I ran this code for Pendulum-v0 environment, its too too slow on this particular environment. But its considerably faster on InvertedPendulum-v1. Do you have any idea why is it so ?