-
in your model you have 4 cnn layers and max pooling.
1) dqn 2015 used only 3 cnn layers without pooling
2) a3c 2016 used only 2 cnn layers without pooling
questions:
1) don't you think pooling a…
-
Are lines 41 and 42 in this file to detach the previous states?
https://github.com/dgriff777/rl_a3c_pytorch/blob/master/train.py
-
It is really a great work! I have some questions about copying local gradients. In train.py, what is the purpose of adding condition:
`if shared_param.grad is not None`
If I understand correctly…
-
Now that deepchem 1.2 is almost ready to push out (#668), it's a good time to start planning for things to put into deepchem 1.3. Here are a few features I'd like to see in deepchem 1.3
- PyTorch s…
-
Hi,
I've been trying to implement A3C on the roboschool environment with the pytorch library, but I am getting an error as soon as I import the library. Here's a simpler version of my code:
http…
-
I run the code with `python main.py`, here comes this error.
```
torch/nn/modules/linear.py", line 42, in __init__
self.reset_parameters()
File "/Users/Tiger/projects/NoisyNet-A3C/model.py",…
-
I wonder why `os.environ['OMP_NUM_THREADS'] = '1'` is used in the `main` method: https://github.com/ikostrikov/pytorch-a3c/blob/master/main.py#L43.
I ran a demo about CartPole-v0 using openai gym w…
-
I'm planning on writing a lightweight extension to PyTorch. The goal is to speed up my research in 2 ways
1) Testing many models should be easy, and it should be easy to keep track of the parameters …
-
I try to eval your trained model, however the result has no effect:
```
2017-08-01 21:08:13,757 : reward sum: -21.0, reward mean: -21.0000
[2017-08-01 21:08:13,757] reward sum: -21.0, reward mean: …
-
Training Breakout goes ok but, memory usage exceeds 25gb after 4 hours of training on 16 cpu cores.
I wonder if it's related to sharing memory between processes.
I run Python 3.5 on scientific lin…