vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
http://docs.cleanrl.dev
Other
4.91k stars 566 forks source link

Add gymnasium support to C51 #403

Closed sdpkjc closed 1 year ago

sdpkjc commented 1 year ago

Description

Types of changes

Checklist:

If you need to run benchmark experiments for a performance-impacting changes:

vercel[bot] commented 1 year ago

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Comments Updated (UTC)
cleanrl ✅ Ready (Inspect) Visit Preview 💬 Add feedback Jun 20, 2023 1:18pm
sdpkjc commented 1 year ago

c51 c51_time


c51_jax c51_jax_time

sdpkjc commented 1 year ago

compare compare-time

sdpkjc commented 1 year ago

c51_atari c51_atari_time

c51_atari_jax c51_atari_jax_time

sdpkjc commented 1 year ago

compare compare-time