a3c-lstm Search Results

160 results
for a3c-lstm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

dgriff777/rl_a3c_pytorch #2

Can you please list out the difference between your code and…

As far as I can see, model hyperparameters are different. Thanks.

slowbull updated 7 years ago
14
deepchem/deepchem #639

GRU layer with RL

I want to use a TensorGraph GRU layer as part of a model for reinforcement learning. It's not clear how to make that work. I'm pretty sure it's going to need some modifications, but I'm not certain …

peastman updated 7 years ago
3
chainer/chainerrl #65

Examples for stateful q learning and a3c

Would it be possible to provide examples for stateful models for q learning and a3c?

kfeeeeee updated 7 years ago
3
pytorch/pytorch #1601

[Feature Request] Weight Normalization

[Layer normalization](https://arxiv.org/abs/1607.06450) seems to be pretty popular for RNNs nowadays, and it is worth having an implementation available. Several people seem to have already rolled the…

Kaixhin updated 7 years ago
11
ikostrikov/pytorch-a3c #5

How to modify code for continuous actions?

Hello :) I was wondering how to modify the code for continuous actions? So for example it could be compared with your naf implementation on openAI gym pendulum, `env = gym.envs.make("Pendulum-v0…

AjayTalati updated 7 years ago
5
awjuliani/DeepRL-Agents #13

A3C: Questions

Dear Juliani Excellent work! I would like to know for how long you trained the A3C? and Number of frames used? How do you find your your results compared to the original paper? (Denny code …

IbrahimSobh updated 7 years ago
1
NVlabs/GA3C #15

Why is pytorch-a3c implementation so much faster?

https://github.com/ikostrikov/pytorch-a3c has an implementation (CPU ONLY) that can converge PongDeterministic-v3 within 15 minutes while the GPU powered GA3C appears to take 2-3 hours to achieve the …

lolz0r updated 7 years ago
3
NVlabs/GA3C #22

Trying to compare this to universe-starter-agent (A3C)

Setting up openai/universe, I used the "universe starter agent" as a smoke test. After adjusting the number of workers to better utilize my CPU, I saw the default PongDeterministic-v3 start winnin…

nczempin updated 7 years ago
83
openai/universe-starter-agent #56

Trained Agent not performing as wel as TensorBoard claims...

I've been playing around with the example code for a few days now but I keep getting the same issue: eg for Pong, after about 3 hours of training, the TensorBoard global/episode_reward goes to about …

aiXander updated 7 years ago
5
muupan/async-rl #1

A3C LSTM

I should support A3C LSTM.

muupan updated 8 years ago
1

上一页 1...10 11 12 13 14 15 16...16 下一页

160 results for a3c-lstm

160 results
for a3c-lstm