vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
http://docs.cleanrl.dev
Other
5.4k stars 617 forks source link

Implement CrossQ #463

Open noahfarr opened 3 months ago

noahfarr commented 3 months ago

Problem Description

I would like to implement the CrossQ algorithm as proposed in https://openreview.net/pdf?id=PczQtTsTIX. I already did some testing and it looks like its not that much effort to implement it based on SAC. I will implement it for myself anyway, so I might as well contribute to the repo. Please let me know if this contribution is something you are interested in.

Checklist

noahfarr commented 3 months ago

Here is the current state of my implementation: https://github.com/noahfarr/cleanrl/blob/master/cleanrl/crossq.py