sarsa Search Results - Githubissues

432 results
for sarsa

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

EderSantana/X #4

SARSA

To implement SARSA with experience replay: - The memory module should not compute "targets" or TD error. Memory should just store state/action/reward/next state information, and provide it in batches …

mattsqerror updated 8 years ago
1
peteflorence/Machine-Learning-6.867-homework #32

SARSA

- [x] Implement SARSA update - [x] Test it for convergence with a known policy, i.e. our simple controller - [x] Implement full SARSA policy updates

manuelli updated 8 years ago
1
raharth/PyMatch #24

Implement SARSA

Implement the general SARSA algorithm according to the definition of Barto and Sutton

raharth updated 3 years ago
2
LARG/HFO #46

Compiling sarsa_agent

I would like to know how can I compile the high_level_sarsa_agent.cpp in the example directory. Are these files already compiled in some place, or do I have to do them manually?

JACKHAHA363 updated 7 years ago
2
hartikainen/easy21 #2

Sarsa lambda implementation

Hi @hartikainen, Thank you for the super cool repo 👍 I add one question regarding the Sarsa agent implementation. In the official pseudo-algorihtm of Sarsa lambda ([slide 29](https://www.davidsi…

Matyyas updated 4 years ago
2
kgex/developer-roadmap #490

Add SARSA Algorithm

DineshkumarS05 updated 1 year ago
2
The-Data-Alchemists-Manipal/MindWave #41

SARSA on Cartpole Problem

## 💥 Proposal Hey, I am GSSOC Contributor. I want to implement SARSA Algorithm on Cartpole problem. Kindly, assign to me.

siriarelli updated 1 year ago
1
accord-net/framework #1097

Sarsa not serializable

Unable to save Sarsa machines. Not marked as serializable.

krflol updated 6 years ago
4
LARG/HFO #43

Data re SARSA weights?

Is there any information available about the SARSA weights discussed in the main HFO paper? I'm not necessarily asking if the weights themselves are available, but more any statistical information tha…

drallensmith updated 7 years ago
4
luwo9/bomberman_rl #4

Strategies Overview

An overview over all techniques/strategies mentioned: 1. Policy exploration/exploitation - $\epsilon$-greedy - Softmax 2. Update Q function - SARSA - (k-step) temporal differen…

luwo9 updated 3 months ago
1

上一页 1...1 2 3 4 5 6 7...44 下一页

432 results for sarsa

432 results
for sarsa