Closed arisliang closed 6 years ago
I apologize it was an outdated argument. It should be --gtp_policy. You can find it here
And there are several strategies you can pick for the underlying Go playing agent. The default option is to use DNN+MCTS, while picking a random policy could validate a working program without invoking expensive neural networks.
For the implementation of strategies, see utils/strategies.py. And you might be interested in the training process of this Go agent, the central training strategy can be found here
Simply copy paste the command in README, would have unrecognized arguments for policy error.
What does this argument do?