allowing for the last training batch drawn to be smaller than batch_size + adding support for more agents in BatchRL by adding softmax with temperature to the corresponding heads + adding a CartPole_QR_DQN preset with a golden test + cleanups
The list of supported agents in BatchRL is now extended to -
DQN
DDQN
Dueling DDQN
Categorical DQN (C51)
Rainbow DQN
QR DQN
Bootstrapped DQN
NEC
MMC
PAL
allowing for the last training batch drawn to be smaller than batch_size + adding support for more agents in BatchRL by adding softmax with temperature to the corresponding heads + adding a CartPole_QR_DQN preset with a golden test + cleanups
The list of supported agents in BatchRL is now extended to - DQN DDQN Dueling DDQN Categorical DQN (C51) Rainbow DQN QR DQN Bootstrapped DQN NEC MMC PAL