vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
http://docs.cleanrl.dev
Other
5.02k stars 575 forks source link

Added Polyak update rate for soft DQN target network updates #347

Closed manjavacas closed 1 year ago

manjavacas commented 1 year ago

Description

According to #346, the Polyak update rate (tau) has been included as an argument to dqn.py and dqn_atari.py implementations.

It will allow soft updates of the target network, as already done for dqn_jax.py.

Types of changes

Checklist:

If you are adding new algorithm variants or your change could result in performance difference, you may need to (re-)run tracked experiments. See https://github.com/vwxyzjn/cleanrl/pull/137 as an example PR.

vercel[bot] commented 1 year ago

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Comments Updated
cleanrl ✅ Ready (Inspect) Visit Preview 💬 Add your feedback Jan 16, 2023 at 9:37PM (UTC)
manjavacas commented 1 year ago

Sure! I have updated the documentation. Please let me know if you need anything else 😃

vwxyzjn commented 1 year ago

Thanks for the contribution! @manjavacas

vwxyzjn commented 1 year ago

This PR closes https://github.com/vwxyzjn/cleanrl/issues/346.

manjavacas commented 1 year ago

Thanks for the contribution! @manjavacas

Thanks to you for this great project! Looking forward to continue contributing :-)