Closed nottombrown closed 7 years ago
I expect PPO to train significantly faster than TRPO https://github.com/openai/baselines/tree/master/baselines/pposgd
Working on this now.
Added in #11
I expect PPO to train significantly faster than TRPO https://github.com/openai/baselines/tree/master/baselines/pposgd