High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
1.08k
stars
131
forks
source link
LB-SAC implementation #31
Closed
Howuhh closed 1 year ago
Implementation of Q-ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size.
TODO: