Open kmcrage opened 6 years ago
huang ab search done: with k=5 its a bit weaker than uct
In uct, we could use Q = (total_value/num_visits + v_plus(depth-1))/2 but then do we halve cpuct too? In a combined search, we want to minmax over at least 5 children, but all with visits>0.
or maybe Q=min(v_plus, total_value/num_visits) ie the lower of the upper bounds (or maybe the max) In the opening, we want uct to dominate, but in the endgame we want ab to dominate.
or a clamp between v_minus and v_plus
http://talkchess.com/forum3/viewtopic.php?t=66886 http://se1f330a320707f8e.jimcontent.com/download/version/1467247204/module/12396903227/name/a%20rollout-based%20search%20algorithm%20unifying%20mcts%20and%20alpha-beta.pdf