ikostrikov / pytorch-a3c

PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
MIT License
1.23k stars 279 forks source link

GPU version of a3c algorithm? #44

Closed bearpaw closed 6 years ago

bearpaw commented 6 years ago

Hi,

Thanks for the code. I was wondering if there is a GPU version of the A3C algorithm in PyTorch. I notice that the hogwild! example in PyTorch is the CPU version.

For your reference, I've found a GPU version of A3C in tensorflow: https://github.com/caomw/icra2017-visual-navigation

ikostrikov commented 6 years ago

Hi,

A3C is designed specifically to be efficient on CPU.

A2C can be more efficient on GPU: https://github.com/ikostrikov/pytorch-a2c-ppo-acktr

bearpaw commented 6 years ago

I got it. Thank you very much!