gumbel-muzero Search Results

35 results
for gumbel-muzero

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

sr5434/AlphaZero #1

where is Dynamic and representation

It looks like still a alphazero, can you Implementing muzero in go game?

Nightbringers updated 7 months ago
6
opendilab/LightZero #174

Question about gumbel_scale and dirichlet noise in Gumbel Mu…

I notice that your implementation of Gumbel MuZero has two differences with the original algorithm. 1. The gumbel_scale is fixed to 10.0, while mctx uses 1.0 or 0.0 (only in evaluation). 2. Dirich…

sbl1996 updated 8 months ago
1
google-deepmind/mctx #81

muzero_policy is much slower than gumbel_muzero_policy

I found the time cost use method muzero_policy is much longer than gumbel_muzero_policy in same num_simulations, roughly three times as much. is this normal? and why?

Nightbringers updated 9 months ago
1
opendilab/LightZero #145

gumbel_muzero error

when running: python3 ./zoo/box2d/lunarlander/config/lunarlander_disc_gumbel_muzero_config.py It will pop out the following error: File "/home/1project/LightZero-main/lzero/policy/gumbel_muze…

lewis841214 updated 10 months ago
1
opendilab/LightZero #180

JAX support

Would you consider adding support for [JAX](https://github.com/google/jax)?

carlosgmartin updated 8 months ago
3
google-deepmind/mctx #83

Few questions about training and num_simulations

In alzero paper, the elo of go is exceed 5000, but in Gumbel paper, the elo of go is below 3000. why? if a agent training use num_simulations==800, then i continue train use num_simulations==40…

Nightbringers updated 8 months ago
16
opendilab/LightZero #143

No module named 'lzero.worker.gumbel_muzero_collector'

When trying to run gumbel_muzero: python3 ./zoo/board_games/tictactoe/config/tictactoe_gumbel_muzero_bot_mode_config.py on the main branch, this error pop out. And it seems that there's no 'gumbe…

lewis841214 updated 10 months ago
1
opendilab/LightZero #158

[action_mask error]

for any game which set the "action_mask" not equal all 1, for example when creating the BaseEnv: if not self._continuous: action_mask = np.ones(self.discrete_action_num, 'int8'…

lewis841214 updated 9 months ago
6
google-deepmind/mctx #84

Question regarding `qtransform_by_parent_and_siblings` in `m…

Hello, I have a question about the [`qtransform_by_parent_and_siblings`](https://github.com/google-deepmind/mctx/blob/main/mctx/_src/qtransforms.py#L53-L84) function [used in `muzero_policy` as def…

sotetsuk updated 9 months ago
1
google-deepmind/mctx #65

Request to add link to Pgx repository in README

Hello, I hope this message finds you well. I am reaching out to kindly request the addition of a link to my project, [Pgx](http://github.com/sotetsuk/pgx), in the README file of the Mctx repository…

sotetsuk updated 1 year ago
1

上一页 1...1 2 3 4...4 下一页

35 results for gumbel-muzero

35 results
for gumbel-muzero