-
To implement SARSA with experience replay:
- The memory module should not compute "targets" or TD error. Memory should just store state/action/reward/next state information, and provide it in batches …
-
- [x] Implement SARSA update
- [x] Test it for convergence with a known policy, i.e. our simple controller
- [x] Implement full SARSA policy updates
-
Implement the general SARSA algorithm according to the definition of Barto and Sutton
-
I would like to know how can I compile the high_level_sarsa_agent.cpp in the example directory. Are these files already compiled in some place, or do I have to do them manually?
-
Hi @hartikainen,
Thank you for the super cool repo 👍
I add one question regarding the Sarsa agent implementation. In the official pseudo-algorihtm of Sarsa lambda ([slide 29](https://www.davidsi…
-
-
## 💥 Proposal
Hey, I am GSSOC Contributor.
I want to implement SARSA Algorithm on Cartpole problem.
Kindly, assign to me.
-
Unable to save Sarsa machines. Not marked as serializable.
-
Is there any information available about the SARSA weights discussed in the main HFO paper? I'm not necessarily asking if the weights themselves are available, but more any statistical information tha…
-
An overview over all techniques/strategies mentioned:
1. Policy exploration/exploitation
- $\epsilon$-greedy
- Softmax
2. Update Q function
- SARSA
- (k-step) temporal differen…
luwo9 updated
3 months ago