sarsa Search Results - Githubissues

429 results
for sarsa

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ethan-oro/super-tic-tac-toe #4

next steps

our few next steps...

ethan-oro updated 5 years ago
4
mcmachado/b-pro #1

Packaging all the featurizers within a single library

In order to avoid code duplication, it might be useful if you could package all of you existing _featurizers_ within a single library. For the moment, I copied all of the following files within the …

pierrelux updated 5 years ago
6
google-deepmind/deepmind-research #187

[RL Unplugged] Is the data stored chronologically? Or how to…

Hi, Thanks for your nice work. I want to train an RNN agent with sequential data and I tested it with dm_control_suite. The paper mentioned that: "For sequence data, we also provide future states,…

zhaoyi11 updated 3 years ago
1
allentran/rl-l2t #5

Generate Policy gradients, wrt u, given sarsas, and Deep Q w…

allentran updated 9 years ago
1
google-deepmind/acme #38

Not enough documentation for EpisodeAdder

Having read the [docs](https://github.com/deepmind/acme/blob/master/docs/components.md) and the code for the [episode adder](https://github.com/deepmind/acme/blob/master/acme/adders/reverb/episode.py#…

drozzy updated 4 years ago
6
rlcode/reinforcement-learning #48

Failing to converge with increase in grid-size (Grid World)

If I increase both the HEIGHT and WIDTH from 5 to 10 keeping the obstacles and the final goal at the same position, Deep SARSA network doesn't seem to converge. What do you think is the problem? Shoul…

akileshbadrinaaraayanan updated 7 years ago
5
szcf-weiya/RLnotes #3

Cliff Walking

Example 6.6 of Sutton's book, ![image](https://user-images.githubusercontent.com/13688320/86022630-36108a80-ba5d-11ea-847e-e67d75b38f1d.png) ![image](https://user-images.githubusercontent.com/136883…

szcf-weiya updated 4 years ago
1
LucasAlegre/sumo-rl #155

Issue when running a custom environment

Hello, I am running your sarsa_resco.py code with a custom environment, representing downtown Athens so around 30-40 traffic lights. When I added the custom environment to the simulation, I get the…

Mehdi-Inane updated 1 year ago
4
itu-square/symsim #76

A Policy abstraction [generalized exploration parameter, ext…

Russel & Norvig p. 842, Fig 21.8 (p.844), all refs to 3rd edition, use a generalized exploration function, which allows for the agent to decrease or stop exploration over time. They define a function…

wasowski updated 2 years ago
1
rihardsk/rl-glue-ext #68

Create a page for "Mines"

``` Create a page for the Mines environment, give a little background, and then link to it from everywhere to give it some context. ``` Original issue reported on code.google.com by `brian.ta...@gmai…

GoogleCodeExporter updated 9 years ago
2

上一页 1...2 3 4 5 6 7 8...43 下一页

429 results for sarsa

429 results
for sarsa