Fix ppo reward norm and remove unused grad clip arg

heronsystems / adeptRL

Reinforcement learning framework to accelerate research

GNU General Public License v3.0

204 stars 29 forks source link

Closed benbellheron closed 4 years ago

benbellheron commented 4 years ago

With the new ppo api grad clipping is done elsewhere so this is an unused argument.