milkpku / BetaElephant

Chinese Chess UCCI engine
MIT License
22 stars 13 forks source link

BetaElephant

Chinese Chess Xboard engine using MCTS and DNN

Introduction

AlphaGo has achieved a high winning rate against other Go programs and defeated the top human player Lee Sedol from South Korea. This inspired us to design BetaElephant, a Chinese Chess AI, to confirm whether the framework of AlphaGo can be properly applied to other domains.

BetaElephant is mainly a combination of Monte Carlo Tree Search and several Deep Neutral Networks. MCTS finds the move with the highest winning rate by expanding the search tree, while Policy and Value DNNs provide MCTS with prior probabilities of each move and the valuation of board position. In each circulation of MCTS, it determines a path by prior probabilities and previous searching results, adds a new leaf node, and updates the path by the board valuation and a playing-through result. The DNNs are trained by a novel combination of supervised learning and reinforcement learning, taking data from human expert games and self-playing results.

Compile & Run

All the codes are still under development ...