A simplified, flexible, commented and understandable implementation of self-play based reinforcement learning based like AlphaGo Zero.
Designed to be easy to adopt for any two-player turn-based adversarial game and any deep learning framework of your choice.
A sample implementation has been provided for the game of Othello in PyTorch, Keras, TensorFlow and Chainer.
Have implementations for Connect4, GoBang and TicTacToe.
A simplified, flexible, commented and understandable implementation of self-play based reinforcement learning based like AlphaGo Zero. Designed to be easy to adopt for any two-player turn-based adversarial game and any deep learning framework of your choice. A sample implementation has been provided for the game of Othello in PyTorch, Keras, TensorFlow and Chainer. Have implementations for Connect4, GoBang and TicTacToe.