rgilman33 / simple-A2C-PPO

Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.
100 stars 26 forks source link