yenw / LeelaZero_PhoenixGo

LeelaZero + PhoenixGo's weights
19 stars 4 forks source link

Value of a parameter #9

Closed alreadydone closed 5 years ago

alreadydone commented 6 years ago

https://github.com/alreadydone/lz/commit/74777dcfe879570041328166983c6f716ccddc96#diff-ede74db51bef85d4bbaf2ee65b64a513R97 PhoenixGo uses c_puct = 2.5 but I think they use values in the range [-1,1] while LZ uses values (winrates) in the range [0,1], so in my opinion when you translate that to LZ you should put c_puct = 2.5/2 = 1.25.

alreadydone commented 6 years ago

Also I think policy temperature is not enabled: https://github.com/Tencent/PhoenixGo/blob/4d9c496e05fedba09f3343b27b73e72417d83fd7/etc/mcts_1gpu.conf#L25 https://github.com/Tencent/PhoenixGo/blob/2278ed1e8a3563c8e66d344c93fe75b933e834dc/mcts/mcts_engine.cc#L307 So cfg_softmax_temp should still be 0, not 0.67.

yenw commented 6 years ago

Thank you for your correction.