alphazero Search Results

jonathan-laurent/AlphaZero.jl #215

Trained networks?

As training networks from scratch is "incredibly wasteful", do people perhaps share their AlphaZero.jl trained networks anywhere? I've read the docs and haven't seen that mentioned! Also I have a m…

tasuki updated 4 months ago

junxiaosong/AlphaZero_Gomoku #54

关于模型的走棋策略的问题

您好！我在使用您的代码进行测试的时候（8*8棋盘），训练了4000轮达到了对于4000次模拟的纯蒙特卡洛算法10局全胜的结果，在与5000次模拟的蒙特卡洛算法中5胜5平。但是我在观察与传统五子棋引擎、人类与自我对弈的过程中，发现了两个问题： 1. 可能出现已经有明显的必胜的落子的走法时，选择不落在那里 2. 对于棋盘中还有较多可落子位置时，棋盘边缘的棋子，似乎根本不会考虑落子。。比如一开始就在…

Egolas updated 6 years ago

junxiaosong/AlphaZero_Gomoku #81

I win , computer lose,hahaha

ai@ai:~/AlphaZero_Gomoku-master$ python human_play.py Player 1 with X Player 2 with O 0 1 2 3 4 5 6 7 7 _ _ _ _ _…

AI-liu updated 4 years ago

jw3126/Setfield.jl #146

1.0 release

I think the API is pretty stable and the implementation is solid. What do you think about 1.0 @tkf ? Do you have any possible breaking changes in mind, we might want to do?

jw3126 updated 2 years ago

SebLague/Chess-Challenge #111

Machine Learning Not Possible (Token Capacity)

I would like to try machine learning attempt, how can I store learning data ?

MohammadHomsee updated 1 year ago

tmoer/alphazero_singleplayer #2

Poor performances and getting stuck

Dear, thank you for sharing your code. I made some modifications just to let it run under Tensorflow 2 avoiding some errors on placeholders and importing the slim library. I run it on CartPole with…

fede72bari updated 1 year ago

lightvector/KataGo #705

Adversarial Policies Beat Professional-Level Go AIs

I noticed this paper: https://goattack.alignmentfund.org/ The authors claim to have found a weakness in Katago. At first I thought these studies were pointless. But recently they released some n…

LL145 updated 1 year ago

ldoshi/rome-wasnt-built-in-a-day #213

Investigate epsilon and sweep hyperparameters for DQN

Trying to debug larger width environments (7 currently). Things to try: 1. Different metric (Average Q-value from 2015 paper https://arxiv.org/pdf/1312.5602.pdf). ``` 5.1 Training and Sta…

josephmaa updated 10 hours ago

leela-zero/leela-zero #1766

Consider speeding up "post-resignation" play in no-resign se…

There was some discussion in #1681 of this, but that's clearly not the right place for it and I think there were some misunderstandings. It looks to me as if an alarmingly large fraction of LZ cont…

gjm11 updated 6 years ago

google-deepmind/open_spiel #1181

Does alphazero support reuse-tree?

Nightbringers updated 5 months ago

981 results for alphazero

981 results
for alphazero