vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
http://docs.cleanrl.dev
Other
5.54k stars 631 forks source link

JAX + DQN #220

Closed vwxyzjn closed 2 years ago

vwxyzjn commented 2 years ago

See #218. @yooceii and @kinalmehta have expressed interest in working on this. @kinalmehta is also interested in C51 (#221).

@kinalmehta if working with DQN first helps with working on C51, feel free to also work on this issue (saying this because usually working with DQN helps with "warm-up"). That way we will have two DQN PR from both you and @yooceii, and we can review and reconcile the differences between the two PRs.

kinalmehta commented 2 years ago

DQN + JAX pull request is already merged. So closing this issue.