Klazkin / player-zero

1 stars 0 forks source link

Implement monte carlo tree search actor #47

Closed Klazkin closed 8 months ago

Klazkin commented 8 months ago

Implement an actor that chooses its action based on estimated action values from running simulations and random roll outs. While a lot of the groundwork has already been done, notably in #44. There are still some significant challenges that may occur, notably we must:

The goal

Time tracking

Time Estimate: 3 hours 30 minutes Time spent: 14 hours 30 minutes

Resources

MCTS https://en.wikipedia.org/wiki/Monte_Carlo_tree_search, https://www.geeksforgeeks.org/ml-monte-carlo-tree-search-mcts/, https://webdocs.cs.ualberta.ca/~hayward/396/jem/mcts.html https://www.cs.swarthmore.edu/~mitchell/classes/cs63/f20/reading/mcts.html Time https://www.geeksforgeeks.org/measure-execution-time-function-cpp/

Klazkin commented 8 months ago

for if I forget why this took over 2x times than expected, I still dont know the cause but for some reason my random generator has a 1 in a 100000 chance to generate a number that is outside of the defined range for which it is supposed to generate numbers.

Klazkin commented 8 months ago

There are still some bugs/improvements that can/need to be made, however, due to this already taking way too much time, those improvements will get their own issues. This one is finally closed.