caleb-vicente / RL_tutorials

0 stars 0 forks source link

Monte Carlo Model Tree for Connect 4 #20

Open caleb-vicente opened 8 months ago

caleb-vicente commented 8 months ago

Check this implementation: https://medium.com/towards-data-science/deep-reinforcement-learning-and-monte-carlo-tree-search-with-connect-4-ba22a4713e7a

Create the following:

caleb-vicente commented 8 months ago

The main issue with the previous implementations is that it does´t have any type of abstration such as a neural network. It is only saving states and learning from them. Alpha Zero, on the contrary, makes the rollout stage based on a self-learning nn. https://web.stanford.edu/~surag/posts/alphazero.html