Multi-armed Bandit - Githubissues

@matthewvedder

I filled out the stratsdir with some basic strategies. My understanding of MAB is it optimizes over the simple strategies. So for example something like this:

Play random for first 100 moves
Calculate how each of the strategies would have performed against the opponent for the first 100
Start using the one that would have done the best

matthewvedder / kaggle-rock-paper-scissors

Multi-armed Bandit #3