alphazero Search Results

944 results
for alphazero

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

openai/baselines #1085

can i apply any baseline algo to game like chess?

hi, i was wondering if i can apply the baselines algo on the game chess either https://github.com/genyrosk/gym-chess or a custom chess env. based on python-chess. i was thinking 18x8x8 input and 6…

Unimax updated 4 years ago
2
jonathan-laurent/AlphaZero.jl #140

Supervised learning and samples

The idea was said by Jonathan: "I guess what you've have to do is generate many samples of the kind that are stored in AlphaZero's memory buffer. You can take these samples either from human play dat…

StepHaze updated 2 years ago
15
junxiaosong/AlphaZero_Gomoku #24

MCTS最终得出的行为%pi的问题

在敲你的代码过程中遇到了两个问题，麻烦您给指导一下： 1. 根据AlphaGo Zero论文中的描述，在MCTS的backup过程中,首先根据policy-value network得到叶子节点的p，v，之后使用v来更新各个树内节点的Q值。在你的代码中使用的是函数update_recursive(leaf_value),这其中的leaf_value应该就是论文中该叶子节点的v对吧？为什么在mct…

xiaoyangzai updated 6 years ago
5
werner-duvaud/muzero-general #191

Sampled MuZero implementation

### Search before asking - [X] I have searched the MuZero [issues](https://github.com/werner-duvaud/muzero-general/issues) and found no similar feature requests. ### Description Hey, I'm wonder…

matthiaskiller updated 3 months ago
1
mokemokechicken/reversi-alpha-zero #13

Mastering Chess and Shogi by Self-Play with a General Reinfo…

FYI: https://arxiv.org/abs/1712.01815

mokemokechicken updated 6 years ago
7
davidjustice149/Chess #1

Create a list of project goals.

What is the purpose of this project? What is it meant to achieve? What will be the deliverables?

makerbreak updated 4 years ago
1
werner-duvaud/muzero-general #231

Chess and other non-trivial games

### Search before asking - [X] I have searched the MuZero [issues](https://github.com/werner-duvaud/muzero-general/issues) and found no similar feature requests. ### Description Is there a success…

StepHaze updated 3 months ago
2
mlsdpk/alphazero-checkers-pygame #24

Implementation of MiniMax algorithm

We need some kind of AI to play with and evaluate our agent after training. There are many algorithms we can implement and in this project, we can try the **MiniMax** algorithm with and without _alpha…

mlsdpk updated 2 years ago
1
suragnair/alpha-zero-general #226

updateThreshold

I started researching the alpha-zero-general algorithm, but I found this parameter in the main.py module > 'updateThreshold': 0.6, # During arena playoff, new neural net will be accepted if threshold…

Vovak1919 updated 3 years ago
3
werner-duvaud/muzero-general #210

Only One Player: Can we use MuZero?

### Search before asking - [X] I have searched the MuZero [issues](https://github.com/werner-duvaud/muzero-general/issues) and found no similar feature requests. ### Description Perfect ideas and…

1121091694 updated 3 months ago
2

上一页 1...6 7 8 9 10 11 12...95 下一页

944 results for alphazero

944 results
for alphazero