-
# Goal
As a developer, I want to develop and compare Reinforcement Learning (RL) algorithms, so that I can teach AI agents cryptocurrency trading.
# Consider
* Analyzing work by others ([here](http…
-
I can't found single_play.py.
How can I start to train it self-mode?
And had you trained with alphago zero method and how about result?
Thanks.
-
https://github.com/tensorflow/minigo is another open source implementation of AlphaGo Zero. Could you compare minigo and SuperGo? What are the strength and weaknesses of each?
-
之後有計畫新的訓練功能嗎?
例如:
1.類似AlphaGo Zero的自我对弈
2.多模態
3.huggingface數據集
等有潛能但項目沒有的功能
-
在敲你的代码过程中遇到了两个问题,麻烦您给指导一下:
1. 根据AlphaGo Zero论文中的描述,在MCTS的backup过程中,首先根据policy-value network得到叶子节点的p,v,之后使用v来更新各个树内节点的Q值。在你的代码中使用的是函数update_recursive(leaf_value),这其中的leaf_value应该就是论文中该叶子节点的v对吧?为什么在mct…
-
In the article of 深入浅出看懂AlphaGo元 at [网络输入数据](https://charlesliuyx.github.io/2017/10/18/%E6%B7%B1%E5%85%A5%E6%B5%85%E5%87%BA%E7%9C%8B%E6%87%82AlphaGo%E5%85%83/#%E7%BD%91%E7%BB%9C%E8%BE%93%E5%85%A5%E6%9…
-
This neural network architecture is quite different from that in Alphago Zero's paper, for instance, the latter took a resnet approach, using 1 convolutional block and 19 residual blocks.
Simply sta…
-
First, great implementation, this really helped me understand how AlphaGo Zero works, and I’ve used it for other games as well.
There’s is still one thing I couldn’t understand though. Why do you re…
-
I started researching the alpha-zero-general algorithm, but I found this parameter in the main.py module
> 'updateThreshold': 0.6, # During arena playoff, new neural net will be accepted if threshold…
-
Using TensorFlow backend.
Traceback (most recent call last):
File "/Users/fernando/Desenvolvimento/Projetos/alphago/alphago-zero.py", line 3, in
import keras
File "/Users/fernando/Desenvo…
ipsec updated
6 years ago