Closed sasforce closed 5 years ago
Hi, the Q-PAMDP class encapsulates a discrete action agent to which it delegates discrete action learning, in this case using Sarsa(λ). This agent is updated in the _rollout
function of qpamdp.py
, where the step
function of the discrete agent is called.
Thank you!
When does the update of the the discrete action parameters happen? There is no usage of '_action_update' function in 'learn' function of the file 'qpamdp.py'. Thank you.