rlworkgroup / garage

A toolkit for reproducible reinforcement learning research.
MIT License
1.88k stars 310 forks source link

make default TF PPO optimizer ADAM, not lbfgs #2149

Closed avnishn closed 4 years ago