-
I need to usue the [stochastic muzero policy](https://github.com/google-deepmind/mctx/blob/663455f3d35bfce9cfb0bdfe90a3c72b54093c11/mctx/_src/policies.py#L234).
I inspected the muax class and can s…
-
按照默认的各个参数,
$ cp -r bash selfplay-course
$ cd selfplay-course
$ bash setup.sh -s ..
$ bash selfplay.sh
有gpu但是没使用,是哪里需要改设置吗
-
I would like to request Royal Game of Ur environment for Pgx. Royal Game of Ur is a simple race game with chance and perfect information but it has some distinct features.
First, it might be an old…
-
A person told me that the bottleneck can improve the 30% performance without losing precision. But Kata Go uses the nest bottleneck instead the bottleneck. Is the nest bottleneck significantly better …
-
I've seen others reporting that the `muzero_policy` is slow and I've run into this problem myself so I wanted to add a bit more information. I'm not expecting a solution to this problem, but it might …
-
### Description
I've got a self-play (not important) function that I would like to execute on the CPU, but I'm running into an issue where the execution mysteriously freezes on the 32nd iteration. No…
-
alphazero , muzero, Gumbel MuZero in go game
-
This library contains implementations of
- [Gumbel MuZero](https://openreview.net/forum?id=bERaNdoegnO) (policy improvement)
- [Stochastic MuZero](https://openreview.net/forum?id=X6D9bAHhBQ1) (cha…
-
hello, In my test, muzero_policy and alphazero_policy both much slower than gumbel_muzero_policy, use same num_simulations.
And why not make gumbel_muzero_policy and muzero_policy also subtree p…
-
hello,mctx-az don't have issue button, so i ask here.
The speed of alphazero_policy is much slower than gumbel_muzero_policy in my test, do you know why? And muzero_policy also much slower than g…