-
As training networks from scratch is "incredibly wasteful", do people perhaps share their AlphaZero.jl trained networks anywhere? I've read the docs and haven't seen that mentioned!
Also I have a m…
-
您好!我在使用您的代码进行测试的时候(8*8棋盘),训练了4000轮达到了对于4000次模拟的纯蒙特卡洛算法10局全胜的结果,在与5000次模拟的蒙特卡洛算法中5胜5平。但是我在观察与传统五子棋引擎、人类与自我对弈的过程中,发现了两个问题:
1. 可能出现已经有明显的必胜的落子的走法时,选择不落在那里
2. 对于棋盘中还有较多可落子位置时,棋盘边缘的棋子,似乎根本不会考虑落子。。比如一开始就在…
-
ai@ai:~/AlphaZero_Gomoku-master$ python human_play.py
Player 1 with X
Player 2 with O
0 1 2 3 4 5 6 7
7 _ _ _ _ _…
-
I think the API is pretty stable and the implementation is solid. What do you think about 1.0 @tkf ? Do you have any possible breaking changes in mind, we might want to do?
-
I would like to try machine learning attempt, how can I store learning data ?
-
Dear,
thank you for sharing your code. I made some modifications just to let it run under Tensorflow 2 avoiding some errors on placeholders and importing the slim library. I run it on CartPole with…
-
I noticed this paper:
https://goattack.alignmentfund.org/
The authors claim to have found a weakness in Katago. At first I thought these studies were pointless.
But recently they released some n…
-
Trying to debug larger width environments (7 currently).
Things to try:
1. Different metric (Average Q-value from 2015 paper https://arxiv.org/pdf/1312.5602.pdf).
```
5.1 Training and Sta…
-
There was some discussion in #1681 of this, but that's clearly not the right place for it and I think there were some misunderstandings.
It looks to me as if an alarmingly large fraction of LZ cont…
gjm11 updated
6 years ago
-
Does alphazero support reuse-tree?