gumbel-alphazero Search Results

14 results
for gumbel-alphazero

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

sotetsuk/pgx #1115

Royal Game of Ur Environment

I would like to request Royal Game of Ur environment for Pgx. Royal Game of Ur is a simple race game with chance and perfect information but it has some distinct features. First, it might be an old…

Alian3785 updated 8 months ago
4
CGLemon/Sayuri #17

selfplay 不使用gpu

按照默认的各个参数， $ cp -r bash selfplay-course $ cd selfplay-course $ bash setup.sh -s .. $ bash selfplay.sh 有gpu但是没使用，是哪里需要改设置吗

Nightbringers updated 4 months ago
18
opendilab/LightZero #215

When will Go be supported？

alphazero , muzero, Gumbel MuZero in go game

Nightbringers updated 1 month ago
1
lowrollr/mctx-az #2

speed issue

hello, In my test, muzero_policy and alphazero_policy both much slower than gumbel_muzero_policy, use same num_simulations. And why not make gumbel_muzero_policy and muzero_policy also subtree p…

Nightbringers updated 5 months ago
11
lowrollr/turbozero #7

speed issue

hello，mctx-az don't have issue button， so i ask here. The speed of alphazero_policy is much slower than gumbel_muzero_policy in my test, do you know why? And muzero_policy also much slower than g…

Nightbringers updated 6 months ago
1
sr5434/AlphaZero #1

where is Dynamic and representation

It looks like still a alphazero, can you Implementing muzero in go game?

Nightbringers updated 4 months ago
6
google-deepmind/mctx #66

Combining Gumbel MuZero and Stochastic MuZero

This library contains implementations of - [Gumbel MuZero](https://openreview.net/forum?id=bERaNdoegnO) (policy improvement) - [Stochastic MuZero](https://openreview.net/forum?id=X6D9bAHhBQ1) (cha…

carlosgmartin updated 3 months ago
4
kobanium/Ray #150

Some question about Gumbel learning.

Yuki Kobayashi, After reading the [paper](https://openreview.net/forum?id=bERaNdoegnO) and [source code](https://github.com/deepmind/mctx), I am still a bit confused. #### 1. What's the Q value…

CGLemon updated 6 months ago
8
kobanium/TamaGo #69

Is it better to use mixed value approximation?

In the paper (Appendix D), DeepMind used the mixed value approximation instead of simple one. It seems that your implementation is simple one. In my experience, the simple one can work on 9x9. But it …

CGLemon updated 11 months ago
5
kobanium/TamaGo #73

search_sequential_halvingの実装に疑問があります

search_sequential_halvingの実装において select_move_by_sequential_halving_for_rootではルート局面でnp.argmaxされておりそれを候補数呼び出していますが、 ”POLICY IMPROVEMENT BY PLANNING WITH GUMBEL”のAlgorithm 2 Sequential Halving with Gum…

bleu48 updated 1 year ago
2

14 results for gumbel-alphazero

14 results
for gumbel-alphazero