Open Bahador-Bakhshi opened 3 years ago
This paper proposed/reviewed different approaches to approximate the rho in the R-Learning
"Average-reward model-free reinforcement learning"
This paper proposed/reviewed different approaches to approximate the rho in the R-Learning
"Average-reward model-free reinforcement learning"