gumbel-muzero Search Results

bwfbowen/muax #8

Activate Stochastic Muzero Policy

I need to usue the [stochastic muzero policy](https://github.com/google-deepmind/mctx/blob/663455f3d35bfce9cfb0bdfe90a3c72b54093c11/mctx/_src/policies.py#L234). I inspected the muax class and can s…

Karlheinzniebuhr updated 10 months ago

CGLemon/Sayuri #17

selfplay 不使用gpu

按照默认的各个参数， $ cp -r bash selfplay-course $ cd selfplay-course $ bash setup.sh -s .. $ bash selfplay.sh 有gpu但是没使用，是哪里需要改设置吗

Nightbringers updated 7 months ago

sotetsuk/pgx #1115

Royal Game of Ur Environment

I would like to request Royal Game of Ur environment for Pgx. Royal Game of Ur is a simple race game with chance and perfect information but it has some distinct features. First, it might be an old…

Alian3785 updated 10 months ago

lightvector/KataGo #757

What is performance of bottleneck and nest bottleneck?

A person told me that the bottleneck can improve the 30% performance without losing precision. But Kata Go uses the nest bottleneck instead the bottleneck. Is the nest bottleneck significantly better …

CGLemon updated 1 year ago

google-deepmind/mctx #93

`muzero_policy` search vs `gumbel_muzero_policy` search perf…

I've seen others reporting that the `muzero_policy` is slow and I've run into this problem myself so I wanted to add a bit more information. I'm not expecting a solution to this problem, but it might …

LeonEricsson updated 3 months ago

jax-ml/jax #21946

pmap race condition (?)

### Description I've got a self-play (not important) function that I would like to execute on the CPU, but I'm running into an issue where the execution mysteriously freezes on the 32nd iteration. No…

LeonEricsson updated 3 months ago

opendilab/LightZero #215

When will Go be supported？

alphazero , muzero, Gumbel MuZero in go game

Nightbringers updated 4 months ago

google-deepmind/mctx #66

Combining Gumbel MuZero and Stochastic MuZero

This library contains implementations of - [Gumbel MuZero](https://openreview.net/forum?id=bERaNdoegnO) (policy improvement) - [Stochastic MuZero](https://openreview.net/forum?id=X6D9bAHhBQ1) (cha…

carlosgmartin updated 5 months ago

lowrollr/mctx-az #2

speed issue

hello, In my test, muzero_policy and alphazero_policy both much slower than gumbel_muzero_policy, use same num_simulations. And why not make gumbel_muzero_policy and muzero_policy also subtree p…

Nightbringers updated 8 months ago

lowrollr/turbozero #7

speed issue

hello，mctx-az don't have issue button， so i ask here. The speed of alphazero_policy is much slower than gumbel_muzero_policy in my test, do you know why? And muzero_policy also much slower than g…

Nightbringers updated 8 months ago

35 results for gumbel-muzero

35 results
for gumbel-muzero