heronsystems / adeptRL

Reinforcement learning framework to accelerate research
GNU General Public License v3.0
204 stars 29 forks source link

Fix ppo reward norm and remove unused grad clip arg #98

Closed benbellheron closed 4 years ago

benbellheron commented 4 years ago

With the new ppo api grad clipping is done elsewhere so this is an unused argument.