mokemokechicken / reversi-alpha-zero

Reversi reinforcement learning by AlphaGo Zero methods.
MIT License
678 stars 170 forks source link

Feature/share sim of another side #46

Closed mokemokechicken closed 6 years ago

mokemokechicken commented 6 years ago