vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
http://docs.cleanrl.dev
Other
4.91k stars 566 forks source link

ManiSkill2 - Fast Visual RL robotics cleanrl baselines #366

Open StoneT2000 opened 1 year ago

StoneT2000 commented 1 year ago

Opening an issue on adding ManiSkill2 cleanrl baselines. The primary reason is to have good baselines and leverage the fast GPU-accelerated visual rendering of ManiSkill2 for tackling robotics envs. The plan is to include a PPO and SAC implementation (following their continuous variants implemented for mujoco with modifications to support parallel envs for SAC)

vwxyzjn commented 1 year ago

ManiSkill2 looks awesome. We would love to have a CleanRL variant for it. I have updated the contribution process here https://docs.cleanrl.dev/contribution to help get started.

StoneT2000 commented 1 year ago

Quick update: will make a PR once ManiSkill2 has upgraded officially to gymnasium, which can be followed here: https://github.com/haosulab/ManiSkill2/pull/76