-
I would like to request Royal Game of Ur environment for Pgx. Royal Game of Ur is a simple race game with chance and perfect information but it has some distinct features.
First, it might be an old…
-
按照默认的各个参数,
$ cp -r bash selfplay-course
$ cd selfplay-course
$ bash setup.sh -s ..
$ bash selfplay.sh
有gpu但是没使用,是哪里需要改设置吗
-
alphazero , muzero, Gumbel MuZero in go game
-
hello, In my test, muzero_policy and alphazero_policy both much slower than gumbel_muzero_policy, use same num_simulations.
And why not make gumbel_muzero_policy and muzero_policy also subtree p…
-
hello,mctx-az don't have issue button, so i ask here.
The speed of alphazero_policy is much slower than gumbel_muzero_policy in my test, do you know why? And muzero_policy also much slower than g…
-
It looks like still a alphazero, can you Implementing muzero in go game?
-
This library contains implementations of
- [Gumbel MuZero](https://openreview.net/forum?id=bERaNdoegnO) (policy improvement)
- [Stochastic MuZero](https://openreview.net/forum?id=X6D9bAHhBQ1) (cha…
-
Yuki Kobayashi,
After reading the [paper](https://openreview.net/forum?id=bERaNdoegnO) and [source code](https://github.com/deepmind/mctx), I am still a bit confused.
#### 1. What's the Q value…
-
In the paper (Appendix D), DeepMind used the mixed value approximation instead of simple one. It seems that your implementation is simple one. In my experience, the simple one can work on 9x9. But it …
-
search_sequential_halvingの実装において
select_move_by_sequential_halving_for_rootではルート局面でnp.argmaxされておりそれを候補数呼び出していますが、
”POLICY IMPROVEMENT BY PLANNING WITH GUMBEL”のAlgorithm 2 Sequential Halving with Gum…