qlearning-algorithm Search Results

66 results
for qlearning-algorithm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

sentenai/reinforce #12

Add eligibility trace variants in algorithms

If you're unfamiliar with eligibility traces, they basically unify temporal-difference learning with monte carlo methods -- essentially you hold a buffer in memory of an agent's experience and perform…

stites updated 6 years ago
6
HemingwayLee/test #1

Image

![auth](https://user-images.githubusercontent.com/8428372/46090268-5279f600-c1eb-11e8-99ef-1494b9f77653.png) ![body](https://user-images.githubusercontent.com/8428372/46090269-5279f600-c1eb-11e8-95a9…

HemingwayLee updated 2 years ago
58
smilesun/rlR #5

about the first version of the software

@berndbischl @markdumke I think we do not have that much time to wait until a perfect design, I would suggest - I help Markus to finish the first version (at least we wrote those class designs tog…

smilesun updated 6 years ago
16
contiki-ng/contiki-ng #2476

TSCH custom Link Selector provokes queuebuf overflow

hello everyone, i develop a custom RL-based scheduler for TSCH, i inspired from orchestra,alice to differentiate between EB packets and RPL packets, but the link selector i use in my case provokes que…

MehdiKherb updated 1 year ago
1
cbovar/ConvNetSharp #55

Reinforcement Learning

RL has been added to original ConvNetJS. Will you be adding that too? Any plans for LSTM? Thanks

srini1948 updated 6 years ago
49
markusdumke/reinforcelearn #32

Errors in the Agents vignette

I have been working through the Agents vignette (https://cran.rstudio.com/web/packages/reinforcelearn/vignettes/agents.html) and I found a few errors. 1. In the "Value Functions" section, half way …

russellcameronthomas updated 5 years ago
7
google-deepmind/pysc2 #64

Tutorials

Not sure if you are interested but I have written a tutorial for building a basic agent: https://medium.com/@skjb/building-a-basic-pysc2-agent-b109cde1477c https://medium.com/@skjb/building-a-smar…

skjb updated 5 years ago
16
thomasdeneux/paramqt #1

Parameter subobjects

Given this code that's similar to https://github.com/thomasdeneux/param_qt/blob/master/test_ParamQt.py#L98: ``` class PosPar(pm.Parameterized): shape = GObjectSelector('star', …

jbednar updated 4 years ago
17
takuseno/d3rlpy #389

[QUESTION] Adding a new algorithm to d3rlpy

Hi, I'm trying to integrate a model-based RL algorithm to the d3rlpy library but couldn't find any documentation to know where & how to begin. Would really appreciate if you could please point me w…

spsingh37 updated 6 months ago
3
takuseno/d3rlpy #395

ValueError: too many values to unpack (expected 4) when usin…

Hi，it is me again (hhh When I run the example like this： ``` import d3rlpy # prepare dataset dataset, env = d3rlpy.datasets.get_d4rl('hopper-medium-v0') # prepare algorithm cql = d3rlpy.alg…

sky-story updated 5 months ago
6

上一页 1...1 2 3 4 5 6 7...7 下一页

66 results for qlearning-algorithm

66 results
for qlearning-algorithm