Closed Klazkin closed 2 months ago
Explain the context of the issue, what is being addressed in detail.
(From "Mastering the Game of Go without Human Knowledge")
Network architecture modifications:
Self-Play implementations
NN Training
Notes on further optimizations
Time Estimate: 7 hours 0 minutes Time spent: 7 hours 0 minutes
7 hours 0 minutes
A Simple Alpha(Go) Zero Tutorial https://suragnair.github.io/posts/alphazero.html https://github.com/suragnair/alpha-zero-general/blob/master/MCTS.py
Explain the context of the issue, what is being addressed in detail.
(From "Mastering the Game of Go without Human Knowledge")
The goal
Network architecture modifications:
Self-Play implementations
Actions are chosen probabilistically?NN Training
Notes on further optimizations
Time tracking
Time Estimate:
7 hours 0 minutes
Time spent:7 hours 0 minutes
Resources
A Simple Alpha(Go) Zero Tutorial https://suragnair.github.io/posts/alphazero.html https://github.com/suragnair/alpha-zero-general/blob/master/MCTS.py