pat-coady / trpo

Trust Region Policy Optimization with TensorFlow and OpenAI Gym
https://learningai.io/projects/2017/07/28/ai-gym-workout.html
MIT License
360 stars 106 forks source link

Does this code use TRPO? #24

Closed ghost closed 5 years ago

ghost commented 5 years ago

The folder says TRPO but the description includes PPO. If this repository runs PPO and not TRPO then the repository should be renamed to PPO.

ryanmaxwell96 commented 4 years ago

Did you ever come to a concensus as to which algorithm is actually being used? It seems to me that TRPO is indeed being used based on the other function names but I was a little confused on this as well.