Closed sotetsuk closed 1 year ago
The original AlphaGo used chinese rules. The chinese rules were used also for Pachi.
Notice that Pachi is weak. We evaluated also versus different versions of AlphaZero (e.g., with the different number of simulations).
Hi! Mctx and the original paper "Policy improvement by planning with Gumbel" are both amazing 👍 I would like to try reproducing them myself. I have a question regarding the original paper.
In the experiments using Go, the evaluation was performed on Pachi. While Pachi supports the rules of
japanese|chinese|aga|new_zealand|simplified_ing
, it does not seem to support the most common rule in computer Go, theTromp-Taylor
rule.So, I have two questions:
(1) What rule was used for scoring in Go during training? (2) When evaluating with Pachi, which rule was specified for playing?