This code implements a bare-bones version of the Proximal Policy Optimization (PPO) algorithm for the purpose of training an AI bot to play the game of football. The tutorial series for learning step-by-step implementation of this algorithm can be found in video format here or in a blogpost here.
Tested on Ubuntu 18.04 and a single NVIDIA GPU.
pip3 install Keras
.python3 train.py
script to start the PPO training loop.To render the game screen on a remote display (eg. if using Google Colab), execute the instructions in display_server.sh
. For more information, check out this thread.