High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
5.54k
stars
631
forks
source link
TD3 should also log losses for `qf2` #172
Closed
vwxyzjn closed 2 years ago
Problem Description
Currently, TD3 only logs losses for
qf1
but does not log losses forqf2
, we should fix this.