a3c-gpu Search Results - Githubissues

222 results
for a3c-gpu

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ikostrikov/pytorch-a3c #42

Mixture of model prediction and update

Hello Ilya, first, thanks for you amazing work. I would have one question about a way how you have designed an A3C training. What you basically do is that you play N steps and you store all the …

dohnala updated 6 years ago
4
xiaowei-hu/pysc2-agents #3

System resources issues

Hi @xhujoy, thanks for creating and sharing this code, I immensely appreciate it! I have some questions though. On what system have you been running the simulations? I run it on a desktop with …

avolny updated 6 years ago
1
ray-project/ray #1486

[rllib] Jenkins test failure "cannot import name spaces".

the following test failure seems to have started recently (and is deterministic?) in ``` docker run --rm --shm-size=10G --memory=10G 5856c64c69ea8839954e3e0384a95122d118d54439be82b30d22dacfae013a…

robertnishihara updated 6 years ago
3
ray-project/ray #1090

[rllib] a3c with pysc2, OOM

env: Ubuntu 16.04, python3.5, GTX1070 I modify the a3c in rllib to run with pysc2, but it crash on startup and print following error message. 2017-10-09 10:23:48.923784: I tensorflow/core/commo…

linshiyx updated 7 years ago
3
tensorflow/tensorflow #6360

Locking mechanisms

Especially when integrating TensorFlow into an exiting multi-threaded application, it's not always easy to use queues for synchronization. Currently, we must use Python locks to lock the `sess.run(...…

danijar updated 6 years ago
47
MatheusMRFM/A3C-LSTM-with-Tensorflow #3

Probles with training

Hi Firstly, really nice code. It helped me to understand A3C fundamentals. However I do struggle to get it to converge. I tried at least four different implementations and most of them are having …

Palkos83 updated 6 years ago
5
Kaixhin/ACER #4

batch_size for off-policy learning

Hey, in the paper for each off-policy learning only one trajectory is sampled, while here you use 16. For the low-level input this won't be too much slower but for higher dimensions this might be an …

jingweiz updated 7 years ago
4
chainer/chainerrl #158

test steps is too large or infinite loop on my machine

Hi, I could not finish tests, even though it took over 3 hours. Could you tell me what is my bad? It seems that `test_pcl.TestPCL.test_abc_discrete` is stacking. My guess is `steps` for `_test…

ando-takahiro updated 6 years ago
1
mihahauke/deep_rl_vizdoom #5

How can I test the model trained with A3C?

Hi @ebonyclock , I wanna train an agent in health_grathering scenario with A3C algorithm, to train it `python3 train_a3c.py -s settings/health_gathering.yml` with `a3c_defaults.yml` settings. …

GoingMyWay updated 7 years ago
2
rlcode/reinforcement-learning #52

Convergence

How many days/episodes did it take until it converged in breakout_a3c? Did you try using LSTM for faster convergence?

ShaniGam updated 7 years ago
5

上一页 1...17 18 19 20 21 22 23...23 下一页

222 results for a3c-gpu

222 results
for a3c-gpu