vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
http://docs.cleanrl.dev
Other
4.84k stars 560 forks source link

Refactor to use tyro #424

Closed vwxyzjn closed 7 months ago

vwxyzjn commented 8 months ago

Description

Better code, IDE support via tyro. Also refactors PPO and closes https://github.com/vwxyzjn/cleanrl/issues/206

Gonna do https://github.com/vwxyzjn/cleanrl/pull/408 separately.

I also got to redocument ppo_atari_multigpu about the scaling log as well.

Types of changes

Checklist:

If you need to run benchmark experiments for a performance-impacting changes:

vercel[bot] commented 8 months ago

The latest updates on your projects. Learn more about Vercel for Git β†—οΈŽ

Name Status Preview Comments Updated (UTC)
cleanrl βœ… Ready (Inspect) Visit Preview πŸ’¬ Add feedback Nov 28, 2023 0:32am
vwxyzjn commented 8 months ago

image

image image

image image

image

image

vwxyzjn commented 8 months ago

image image image

vwxyzjn commented 8 months ago

This PR will also drop mujoco_py support to simplify things with the gymnasium migration.

vwxyzjn commented 7 months ago

Closes #418 too (added CI for 3.8, 3.9, 3.10)