vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
http://docs.cleanrl.dev
Other
5.54k stars 631 forks source link

TD3 should also log losses for `qf2` #172

Closed vwxyzjn closed 2 years ago

vwxyzjn commented 2 years ago

Problem Description

Currently, TD3 only logs losses for qf1 but does not log losses for qf2, we should fix this.