Closed ghost closed 7 years ago
The problem is occurred by type difference between real feed and expected.
I think you may use older commit.
My latest commit solve the problem, so you should git pull
.
And python run_a3c.py --atari
( default braek-out ) or python run_a3c.py --env CartPole-v0
.
If you still have problems, please ask.
reinforce()
is defined in torch/autograd/variable.py#198
.
In your environment it may be /home/ajay/anaconda3/envs/pyphi/lib/python3.6/site-packages/torch/autograd/variable.py#198
I think.
REINFORCE is a technique to backward gradient between nodes that are not differentiable for stochastic sampling.
Hi, thank you very much for your help - it works fine now.
I like your implementation of AsyncRMSprop
its very cool :+1:
Hi, thanks for releasing this project,
I get the following error, when running
run_a3c.py
.Also can you tell me where
reinforce()
is defined? It's called on line 80 as,a.reinforce(r - v.data.squeeze())
Thanks for your help