Closed Klazkin closed 8 months ago
for if I forget why this took over 2x times than expected, I still dont know the cause but for some reason my random generator has a 1 in a 100000 chance to generate a number that is outside of the defined range for which it is supposed to generate numbers.
There are still some bugs/improvements that can/need to be made, however, due to this already taking way too much time, those improvements will get their own issues. This one is finally closed.
Implement an actor that chooses its action based on estimated action values from running simulations and random roll outs. While a lot of the groundwork has already been done, notably in #44. There are still some significant challenges that may occur, notably we must:
The goal
Time tracking
Time Estimate:
3 hours 30 minutes
Time spent:14 hours 30 minutes
Resources
MCTS https://en.wikipedia.org/wiki/Monte_Carlo_tree_search, https://www.geeksforgeeks.org/ml-monte-carlo-tree-search-mcts/, https://webdocs.cs.ualberta.ca/~hayward/396/jem/mcts.html https://www.cs.swarthmore.edu/~mitchell/classes/cs63/f20/reading/mcts.html Time https://www.geeksforgeeks.org/measure-execution-time-function-cpp/