-
To implement SARSA with experience replay:
- The memory module should not compute "targets" or TD error. Memory should just store state/action/reward/next state information, and provide it in batches …
-
- [x] Implement SARSA update
- [x] Test it for convergence with a known policy, i.e. our simple controller
- [x] Implement full SARSA policy updates
-
Implement the general SARSA algorithm according to the definition of Barto and Sutton
-
Hi @hartikainen,
Thank you for the super cool repo 👍
I add one question regarding the Sarsa agent implementation. In the official pseudo-algorihtm of Sarsa lambda ([slide 29](https://www.davidsi…
-
I would like to know how can I compile the high_level_sarsa_agent.cpp in the example directory. Are these files already compiled in some place, or do I have to do them manually?
-
-
I would like to get a slightly better understanding regarding the difference between the on-policy and off-policy as well as some clarifications regarding the formulas used to apply them. Namely, what…
-
Unable to save Sarsa machines. Not marked as serializable.
-
Is there any information available about the SARSA weights discussed in the main HFO paper? I'm not necessarily asking if the weights themselves are available, but more any statistical information tha…
-
## 💥 Proposal
Hey, I am GSSOC Contributor.
I want to implement SARSA Algorithm on Cartpole problem.
Kindly, assign to me.