Open SofianChay opened 3 years ago
https://openai.com/blog/openai-baselines-ppo/
https://medium.com/intro-to-artificial-intelligence/proximal-policy-optimization-ppo-a-policy-based-reinforcement-learning-algorithm-3cf126a7562d
https://arxiv.org/abs/1707.06347 (original paper)
[x] should be adapted to both one-player and multi-player training (see dqn for an example)
https://github.com/deepmuseum/Algorithms-for-Reinforcement-Learning/commit/3c0272497d1193700c0129a6444df2fb3d667b4e
https://openai.com/blog/openai-baselines-ppo/
https://medium.com/intro-to-artificial-intelligence/proximal-policy-optimization-ppo-a-policy-based-reinforcement-learning-algorithm-3cf126a7562d
https://arxiv.org/abs/1707.06347 (original paper)
[x] should be adapted to both one-player and multi-player training (see dqn for an example)