-
Hello Ilya,
first, thanks for you amazing work. I would have one question about a way how you have designed an A3C training.
What you basically do is that you play N steps and you store all the …
-
Hi @xhujoy,
thanks for creating and sharing this code, I immensely appreciate it!
I have some questions though. On what system have you been running the simulations?
I run it on a desktop with …
-
the following test failure seems to have started recently (and is deterministic?) in
```
docker run --rm --shm-size=10G --memory=10G 5856c64c69ea8839954e3e0384a95122d118d54439be82b30d22dacfae013a…
-
env: Ubuntu 16.04, python3.5, GTX1070
I modify the a3c in rllib to run with pysc2, but it crash on startup and print following error message.
2017-10-09 10:23:48.923784: I tensorflow/core/commo…
-
Especially when integrating TensorFlow into an exiting multi-threaded application, it's not always easy to use queues for synchronization. Currently, we must use Python locks to lock the `sess.run(...…
-
Hi
Firstly, really nice code. It helped me to understand A3C fundamentals. However I do struggle to get it to converge. I tried at least four different implementations and most of them are having …
-
Hey,
in the paper for each off-policy learning only one trajectory is sampled, while here you use 16. For the low-level input this won't be too much slower but for higher dimensions this might be an …
-
Hi,
I could not finish tests, even though it took over 3 hours. Could you tell me what is my bad?
It seems that `test_pcl.TestPCL.test_abc_discrete` is stacking. My guess is `steps` for `_test…
-
Hi @ebonyclock , I wanna train an agent in health_grathering scenario with A3C algorithm, to train it
`python3 train_a3c.py -s settings/health_gathering.yml`
with `a3c_defaults.yml` settings.
…
-
How many days/episodes did it take until it converged in breakout_a3c? Did you try using LSTM for faster convergence?