Research types of models used for game actors

Research the specifics of model architectures used for making decisions in games such as chess and go. Find out what layers make up models such as AlphaGo and AlphaZero, as well as what other solutions/alternatives exist.

Time tracking

Time Estimate: 2 hours 0 minutes Time spent: 2 hours 30 minutes

Resources

DeepChess: End-to-End Deep Neural Network for Automatic Learning in Chess https://www.cs.tau.ac.il/~wolf/papers/deepchess.pdf
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm https://arxiv.org/pdf/1712.01815.pdf
OpenAI Five Architecture
AlphaZero (Connect4)
AlphaZero: Shedding new light on chess, shogi, and Go https://deepmind.google/discover/blog/alphazero-shedding-new-light-on-chess-shogi-and-go/
Multiple Policy Value Monte Carlo Tree Search https://arxiv.org/pdf/1905.13521v1.pdf
Restricted Boltzmann Machine (RBM) with Practical Implementation https://medium.com/machine-learning-researcher/boltzmann-machine-c2ce76d94da5
Predicting Professional Players’ Chess Moves with Deep Learning https://towardsdatascience.com/predicting-professional-players-chess-moves-with-deep-learning-9de6e305109e
Checkmating One, by Using Many: Combining Mixture of Experts with MCTS to Improve in Chess https://arxiv.org/pdf/2401.16852.pdf

Klazkin / player-zero

Research types of models used for game actors #58

Time tracking

Resources