The signature for `update` does not allow for sarsa or n-step methods?

Hi There!

Thanks very much for bsuite, it is a great resource for reproducible research.

I have a question on the framework. I am setting up some pedagogic implementation of canonical rl algorithms, among which, sarsa.

Is there any design pattern you had in mind for n-step methods or any method that requires access to experience from longer transitions? I am currently solving the issue with sarsa by computing the next action with the select_action method in the update function. What about n-step methods or model-based methods?

google-deepmind / bsuite

The signature for `update` does not allow for sarsa or n-step methods? #34