SanchoGGP / ggp-base

The General Game Playing Base Package
8 stars 4 forks source link

Observation: MCTS is wrong for 0/100 puzzles #368

Open arr28 opened 9 years ago

arr28 commented 9 years ago

MCTS is always the wrong choice for 0/100 puzzles. If we see a 100-scoring solution, we'll take it. Otherwise, the score is 0, so all nodes in the tree have score 0 - but we still select through them and back-propagate the 0 score.

But, by doing MCTS, we suffer selection & back-propagation tests. On a recent test with C4, we gained a 10x raw speed improvement by doing depth-first min-max with alpha-beta pruning. (No select + back-prop. step. No building a tree in memory.) Obviously, in puzzles, the alpha-beta doesn't apply, but if we lack anything better to do, depth-first search is better than MCTS.

When I say "lack anything better to do", we'd obviously want to keep the special rollout policies that we have as part of the depth-first search.

arr28 commented 9 years ago

Some cautions...