initial-h / AlphaZero_Gomoku_MPI

An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
186 stars 43 forks source link
algorithm alphago alphazero alphazero-gomoku deep-reinforcement-learning dirichlet-distribution gomoku mcts mpi4py parallel tensorflow tensorlayer tree-search

AlphaZero-Gomoku-MPI

Updating

Overview

This repo is based on junxiaosong/AlphaZero_Gomoku, sincerely grateful for it.

I do these things:

Strength

References

Blog

Installation Dependencies

How to Install

tensorflow/tensorlayer/pygame install :

pip install tensorflow
pip install tensorlayer
pip install pygame

mpi4py install click here

mpi4py on windows click here

How to Run

Algorithm

It's almost no difference between AlphaGo Zero except APV-MCTS. A PPT can be found in dir demo/slides

Details

Most settings are the same with AlphaGo Zero, details as follow :

Some Tips

Future Work