suragnair / alpha-zero-general

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
MIT License
3.74k stars 1.01k forks source link

A fast, parallel and batched fork of this repo #290

Closed Bobingstern closed 1 year ago

Bobingstern commented 1 year ago

https://github.com/kevaday/alphazero-general This repo has all of the improvements listed in the README although it only supports pytorch. There are some tiny bugs that I have been trying to sort out but if anyone wants a general AlphaZero implementation that works for tougher envs here it is. It's also based off of this one! https://github.com/Bobingstern/AlphaZero-Connect-4 is an example I made using tensorflow.js training using it. It supports FPU reduction, model gating, dirichlet noise and cython for even faster speed. Some envs are a little broken though but I will send a PR eventually with the fixed ones and more.

suragnair commented 1 year ago

Thanks for letting me know! Added to README.