Create basic self-play algorithm

TTitcombe / AlphaDraughts

A PyTorch implementation of AlphaGo Zero applied to Draughts

MIT License

3 stars 2 forks source link

Create basic self-play algorithm #9

Open TTitcombe opened 5 years ago

TTitcombe commented 5 years ago

Create a basic algorithm which takes the board as input and predicts a move. See how well this model can learn playing against itself, using sparse rewards (win or lose) and frequent rewards (take or lose a piece)