ikostrikov / pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
MIT License
3.57k stars 829 forks source link

Look for cuda tensors in step #209

Open rcorbish opened 5 years ago

rcorbish commented 5 years ago

In VecPyTorch::step_async the input is checked for a Tensor ( for discrete actions). Include cuda tensors as well

e.g. if isinstance(actions, (torch.LongTensor, torch.cuda.LongTensor) ):