a3c-pytorch Search Results

number9473/nn-algorithm #246

Asynchronous Methods for Deep Reinforcement Learning

# Asynchronous Methods for Deep Reinforcement Learning # - Author: Volodymyr Mnih, Adrià Puigdomènech Badia, Mehdi Mirza, Alex Graves, Timothy P. Lillicrap, Tim Harley, David Silver, Koray Kavukcuo…

joyhuang9473 updated 6 years ago

rarilurelo/pytorch_a3c #3

expected a Variable arg but got numpy.ndarray error

I am new to pyTorch, just cloned your codes and ran them, but got an error. I hope you to point me to the right direction to fix this issue. More specifics: 1. used conda env with python 3.6 …

dylanthomas updated 7 years ago

rarilurelo/pytorch_a3c #5

How to modify code for continuous actions?

Hi @rarilurelo, can I ask if you have been able to modify your code to work with continuous actions - eg pendulum or mountain car? I tired to modify @ikostrikov 's implementation, see here https…

AjayTalati updated 7 years ago

aleju/self-driving-truck #5

Question - Is it possible to apply the A3C RL method to this…

This is quite nice! There are several A3C PyTorch implementations for Atari. Is it possible to do the same with this Truck environment? Thank you.

aurotripathy updated 5 years ago

hongzimao/pensieve #84

Some questions on critic gradient

Dear Hongzi, sorry to bother you, but I went through a few problems about critic gradient when reproducing Pensieve with PyTorch. In ```/sim/a3c.py```,you used the mean square error of **R_batc…

linnaeushuang updated 5 years ago

MorvanZhou/pytorch-A3C #8

is it really a3c implementation? not just actor critic?

where is multi-step loss function and entropy loss function?

sevity updated 3 years ago

uvipen/Super-mario-bros-A3C-pytorch #14

while loop in local test

Hi, I am wondering why there is no break statement in the local_test in this script [https://github.com/uvipen/Super-mario-bros-A3C-pytorch/blob/master/src/process.py]. It seems like the testing loop …

yuchensun97 updated 4 years ago

Alfredvc/paac #3

Low Seaquest avg score compared to A3C

Looking at a handful A3C implementations and results on Seaquest, they appear to score around 50K: - https://gym.openai.com/evaluations/eval_pjjgc9POQJK4IuVw8nXlBw (ConvNet) - https://gym.openai.com…

beniz updated 7 years ago

MorvanZhou/pytorch-A3C #21

Are local gradients accumulated and never reset?

I can't see that the local gradients are ever reset. The values are overwritten by the global weights, but the optimizer `opt` is assigned to the global parameters, so won't this accumulate gradients …

jakkarn updated 2 years ago

p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch #44

TypeError: can't pickle _thread.RLock objects

I run the Cart_Pole.py with A3C&A2C on windows and got the error. Traceback (most recent call last): File "D:/学习/Deep-Reinforcement-Learning-Algorithms-with-PyTorch-master/results/Cart_Pole.py",…

shuferhoo updated 3 years ago

212 results for a3c-pytorch

212 results
for a3c-pytorch