-
Browsing the code, I can't help but noticing there are no synchronization among workers, i.e., using Lock mechanism to coordinate the updating of shared_model by different workers. Is this how the "ho…
-
I can run your dqn code under pytorch=v0.2, but the a3c seems cannot run correctly. It always logs that
worker xxx exited unexpectedly
worker xxx restarted
xxx is the different proces…
-
File "main.py", line 55, in
env.observation_space.shape[0], env.action_space)
File "/home/bruce/Downloads/pytorch-a3c-master/pytorch-a3c-master/model.py", line 47, in __init__
self.acto…
-
First off, terrific work on repo and blog post, very detailed and clear.
I was able to solve the BipedalWalkerHardcore-v2, average 300+ for 100eps, with rl with an a3c implentation I made but it t…
-
I noticed that in your player_util.py action_train function:
```
if self.done:
if self.gpu_id >= 0:
with torch.cuda.device(self.gpu_id):
self.cx = Variable(torch.z…
-
I am very interested in you pytorch-a3c because of its compactness and very simple structure.
I tried to follow your excellent work, but I cannot run successfully after struggling more than a month.…
-
### System information
- **OS Platform and Distribution (e.g., Linux Ubuntu 16.04)**: MacOS 10.13.3
- **Ray installed from (source or binary)**: source
- **Ray version**: cloned from repo on Ma…
-
Hello Ilya,
first, thanks for you amazing work. I would have one question about a way how you have designed an A3C training.
What you basically do is that you play N steps and you store all the …
-
#118 is keras-rl dead
@ViktorM
I think Keras-RL is one of the best Keras libraries around and is brilliantly structured. Most of the codes I've read are stand-alone i.e. the researcher implements …
-
I found that in original [GAE paper](https://arxiv.org/abs/1506.02438)
eq.16
A_{t}^{GAE} = \sum_{l=0 }^{\infty} (\gamma \tau )^l \delta_{t+l}^{V}
However, in the code the advantage is look li…