mokemokechicken / reversi-alpha-zero

Reversi reinforcement learning by AlphaGo Zero methods.
MIT License
677 stars 170 forks source link

fix: Player#moves include only moves in playing games. #4 #5

Closed mokemokechicken closed 7 years ago