Open javiarrobas opened 3 years ago
Rescaling rewards may be useful to increase sampling efficiency. This can be done using previous rewards, but without shifting mean as that may affect agent's will to live.
Rescaling rewards may be useful to increase sampling efficiency. This can be done using previous rewards, but without shifting mean as that may affect agent's will to live.