Convolutional? - Githubissues

nikhilbarhate99 / PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

MIT License

1.57k stars 332 forks source link

You might have to (not sure) change the buffer to handle the images and add Conv2d layers in the actor and critic models. Also to get RL algos (which have not been tested on atari) to train on atari takes quite a bit of work.

You can still first try it on a simple atari game like pong or on another similar env like minAtar

Also you can use some tricks like passing the difference of observation images to the model. refer to Andrej Karpathys blog on Deep RL for pong ... although he implements everything from scratch you can use PyTorch and try to use the same tricks.

nikhilbarhate99 / PPO-PyTorch

Convolutional? #51