Open Unimax opened 4 years ago
I guess you can try DQN algorithm but traditionnal AI/RL algorithms for chess are based or inspired by AlphaZero: By combining MCTS and Q-Value estimator, you may find the results from the MuZero paper: https://arxiv.org/abs/1911.08265
Note that Ray framework have released an "AlphaZero" like algorithm: https://github.com/ray-project/ray/tree/master/rllib/contrib/alpha_zero
Best regards, Adrien
thanks, @Hadjubuntu. I knew about AlphaZero and MuZero but i wanted to compare the strength of algos like PPO,SAC and DQN on the game like chess (basically something more useful/interesting than standard gym envs). Also, thanks for the Ray framework link didn't knew that project.
hi, i was wondering if i can apply the baselines algo on the game chess either https://github.com/genyrosk/gym-chess or a custom chess env. based on python-chess.
i was thinking 18x8x8 input and 64*64+permotion as action space.
let me know if anyone can help me do this. Note: goal is not to make the best chess ai in the world :)