Thanks very much for bsuite, it is a great resource for reproducible research.
I have a question on the framework.
I am setting up some pedagogic implementation of canonical rl algorithms, among which, sarsa.
Is there any design pattern you had in mind for n-step methods or any method that requires access to experience from longer transitions?
I am currently solving the issue with sarsa by computing the next action with the select_action method in the update function.
What about n-step methods or model-based methods?
Hi There!
Thanks very much for
bsuite
, it is a great resource for reproducible research.I have a question on the framework. I am setting up some pedagogic implementation of canonical rl algorithms, among which, sarsa.
Is there any design pattern you had in mind for n-step methods or any method that requires access to experience from longer transitions? I am currently solving the issue with sarsa by computing the next action with the
select_action
method in theupdate
function. What about n-step methods or model-based methods?