a3c-lstm Search Results

160 results
for a3c-lstm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

chainer/chainerrl #158

test steps is too large or infinite loop on my machine

Hi, I could not finish tests, even though it took over 3 hours. Could you tell me what is my bad? It seems that `test_pcl.TestPCL.test_abc_discrete` is stacking. My guess is `steps` for `_test…

ando-takahiro updated 6 years ago
1
hongzimao/pensieve #4

a few questions about the math

1) The training law of the Actor network (Eq.2) uses the gradient of the network times the reward difference A, to evolve the model of the neural network Is the term "detla_theda log pi_theda (s_t,…

shenyueshi updated 7 years ago
1
mihahauke/deep_rl_vizdoom #5

How can I test the model trained with A3C?

Hi @ebonyclock , I wanna train an agent in health_grathering scenario with A3C algorithm, to train it `python3 train_a3c.py -s settings/health_gathering.yml` with `a3c_defaults.yml` settings. …

GoingMyWay updated 7 years ago
2
rlcode/reinforcement-learning #52

Convergence

How many days/episodes did it take until it converged in breakout_a3c? Did you try using LSTM for faster convergence?

ShaniGam updated 7 years ago
5
tensorforce/tensorforce #135

Add optional batch normalization layer

Batch normalization layer ([paper](https://arxiv.org/pdf/1502.03167.pdf)) is widely used when training deep networks. It appears that batch normalization make the network learning faster and generaliz…

0xSSoul updated 7 years ago
9
ikostrikov/pytorch-a3c #33

What is the purpose of `os.environ['OMP_NUM_THREADS'] = '1'`…

I wonder why `os.environ['OMP_NUM_THREADS'] = '1'` is used in the `main` method: https://github.com/ikostrikov/pytorch-a3c/blob/master/main.py#L43. I ran a demo about CartPole-v0 using openai gym w…

xmfbit updated 7 years ago
1
Kaixhin/NoisyNet-A3C #6

Curious about the values of the sigma

Hi @Kaixhin I wonder if the sigma in NoisyNet-A3C will fast shrink to nearly zero or not. since if the values of the sigma is nearly zero, no exploration will be done by the agent. The reason why …

andrewliao11 updated 7 years ago
1
openai/universe-starter-agent #72

Performance on other Atari games

Hi, Pong is a good sanity check. Has anyone tried/adopted the code (A3C-LSTM) on other Atari games like BreakoutDeterministic-v3 and SpaceInvadersDeterministic-v3, and managed to get average scores 50…

pengsun updated 7 years ago
1
miyosuda/async_deep_reinforce #32

Variable net_-1/BasicLSTMCell/Linear/Matrix does not exist, …

Hi @miyosuda , Thank you for sharing the code. When I tried to run the code, I came across some problem. ''' Traceback (most recent call last): File "a3c.py", line 50, in global_network =…

nanxintin updated 7 years ago
3
ikostrikov/pytorch-a3c #11

Possible memory leak?

Training Breakout goes ok but, memory usage exceeds 25gb after 4 hours of training on 16 cpu cores. I wonder if it's related to sharing memory between processes. I run Python 3.5 on scientific lin…

scientist1642 updated 7 years ago
14

上一页 1...10 11 12 13 14 15 16...16 下一页

160 results for a3c-lstm

160 results
for a3c-lstm