muzero-stochastic Search Results

43 results
for muzero-stochastic

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

google-deepmind/mctx #93

`muzero_policy` search vs `gumbel_muzero_policy` search perf…

I've seen others reporting that the `muzero_policy` is slow and I've run into this problem myself so I wanted to add a bit more information. I'm not expecting a solution to this problem, but it might …

LeonEricsson updated 5 months ago
1
google-deepmind/mctx #79

Computing both decision and chance branches of recurrent fun…

Hello all! Thank you for creating mctx for the community. I added a performance enhancement to my copy of mctx and am wondering if you're interested in adding it to the official repo. Computing bot…

evanatyourservice updated 4 months ago
5
google-deepmind/mctx #66

Combining Gumbel MuZero and Stochastic MuZero

This library contains implementations of - [Gumbel MuZero](https://openreview.net/forum?id=bERaNdoegnO) (policy improvement) - [Stochastic MuZero](https://openreview.net/forum?id=X6D9bAHhBQ1) (cha…

carlosgmartin updated 7 months ago
4
opendilab/LightZero #236

Could you provide detail code example to customize the env a…

We have read the docs carefully. However, we cannot find such example. All we can find is: https://github.com/opendilab/LightZero/blob/main/docs/source/tutorials/envs/customize_envs_zh.md What w…

yuuma002 updated 4 months ago
7
kenjyoung/MinAtar #35

deterministic versions of environments

Hi, I'd like to ask about how to make minatar environments deterministic for research on MBRL algorithms such as MuZero, which are designed for deterministic environments. Apart from setting th…

JINKEHE updated 9 months ago
3
google-deepmind/mctx #88

Stochastic MuZero issues invalid actions and outcomes

Example: ```python3 import jax import mctx from jax import numpy as jnp, random def main(): n_actions = 7 n_outcomes = 3 batch_size = 1 root = mctx.RootFnOutput( # ty…

carlosgmartin updated 10 months ago
7
opendilab/LightZero #180

JAX support

Would you consider adding support for [JAX](https://github.com/google/jax)?

carlosgmartin updated 10 months ago
3
opendilab/LightZero #134

Computer resource and len error while running stochastic_muz…

While running python3 ./zoo/game_2048/config/stochastic_muzero_2048_config.py It will pop out some error in GameSegment at line 171: assert len(next_segment_observations)

lewis841214 updated 1 year ago
4
DHDev0/Stochastic-muzero #10

Problem adapting Stoch-muzero to custom gymnasium environmen…

I'm trying to adapt the tutorial code to my environment which has the following dimensions: ``` env.observation_space.shape[0] #Continuous 50 ``` ``` env.action_space.n #Discrete 3 ``` I'm …

Karlheinzniebuhr updated 1 year ago
1
google-deepmind/mctx #71

Automatically determine num_actions and num_chance_outcomes …

It is possible to automatically determine the `num_actions` and `num_chance_outcomes` parameters to [`stochastic_muzero_policy`](https://github.com/google-deepmind/mctx/blob/main/mctx/_src/policies.py…

carlosgmartin updated 1 year ago
2

上一页 1...1 2 3 4 5...5 下一页

43 results for muzero-stochastic

43 results
for muzero-stochastic