Closed daisatojp closed 4 years ago
use Retrace Algorithm (paper) as Policy Evaluation
i observed it is unstable, fail to implement?
I gave up.
use Retrace Algorithm (paper) as Policy Evaluation