Root Policy Softmax Temperature

The original KataGo paper used a constant root softmax temperature of 1.03, which is currently the default in our implementation. The subsequent blog post, however, describes something else:

In KataGo's g170 run, this temperature was 1.25 for the early game, decaying exponentially to 1.1 for the rest of the game with a halflife in turns of the board dimensions

We should experiment with this alternative parameterization.

Note that the original idea comes from an academic group's replication of AlphaGo. The rationale of the idea and the experimental evidence in favor of it are described here.

shindavid / AlphaZeroArcade

Root Policy Softmax Temperature #35