如果只有价值网络，没有策略网络可以吗？

junxiaosong / AlphaZero_Gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

MIT License

3.33k stars 970 forks source link

Open huyp182 opened 6 years ago

huyp182 commented 6 years ago

MCTS根据价值网络来进行模拟和下棋，不知道这样的话效果如何？

junxiaosong commented 6 years ago

我觉得理论上是可以的，AlphaGo出来之前很多人应该就是这么做的，但效果应该会变差吧，因为策略网络其实就是在搜索的时候给出一些更有潜力的分支，这样能提高搜索效率，把有限的搜索次数用在最关键的部分