vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
http://docs.cleanrl.dev
Other
4.84k stars 560 forks source link

feat: add vloss clipping to jax ppo. #426

Open KaleabTessera opened 8 months ago

KaleabTessera commented 8 months ago

Description

Added value loss clipping to Jax ppo. I noticed it was there in the pytorch implementation and not in the Jax implementation, and it made quite a big performance difference on the environments I tested.

Types of changes

Checklist:

If you need to run benchmark experiments for a performance-impacting changes:

vercel[bot] commented 8 months ago

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Comments Updated (UTC)
cleanrl ✅ Ready (Inspect) Visit Preview 💬 Add feedback Oct 27, 2023 11:52am