AlphaGo-Zero-Gobang

You can also read my Blog :)

View a Demo

这是一个基于强化学习的自我博弈模型，运行后的程序如下所示。

python3 MetaZeta.py

我们构建了一个基于MCTS进行决策的 AI玩家，由残差神经网络辅助预测落子。

我们可以和训练有素的 AI玩家 对弈，以测试 AI 的下棋水平。

filename	type	description
`TreeNode.py`	MCTS	nodes of the MCTS decision tree
`MCTS.py`	MCTS	Build MCTS decision tree
`AIplayer.py`	MCTS	Build an AI based on MCTS+NN
`Board.py`	Board	store board information
`Game.py`	Board	defines the game process for selfPlay and play-with-Human
`PolicyNN.py`	NN	constructs a residual neural network
`MetaZeta.py`	Main	GUI synthesis for all parties All in one

首先，我们需要设计一些规则来描述棋盘上的信息

然后，我们需要建立一个残差神经网络 (Network structure)

然后，我们需要了解 AI 是如何做出决策的。他是如何积累下棋的知识，并利用学到的知识进行下棋的

最后，我们需要了解强化学习的整个过程（即自我对弈）