muzero-stochastic Search Results

43 results
for muzero-stochastic

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

DHDev0/Stochastic-muzero #2

training loss: nan

Hi Daniel, I'm trying to run a custom environment (works with muzero) with your Stochastic-muzero version. After creating a config file (just changing env name in experiment_450_config.json) I'm…

ipsec updated 1 year ago
6
google-deepmind/mctx #38

sampling action from tree-search policy

Thanks for releasing this library!! In the Gumbel MuZero paper (appendix F), I read: ![image](https://user-images.githubusercontent.com/16518885/214013465-4f15ec1f-403d-4bd3-a882-b1197958df5b.p…

drasros updated 1 year ago
13
google-deepmind/mctx #33

Understanding RootFnOutput

Hello! I'm trying to use `mctx` library to train MuZero agent and have some troubles with understanding `RootFnOutput`. I have two methods `.root` and `.recurrent`: ```python def root( …

kefirski updated 1 year ago
6
google-deepmind/mctx #50

Question about the improved policy in Gumbel MuZero

Hi, thanks for open sourcing the great library! I'm using it to experiment with MCTS on a project, and I have a question regarding the function \sigma used in constructing the improved policy: \pi'…

karroyan updated 1 year ago
3
DHDev0/Stochastic-muzero #3

Default experiments are not converging

Hi Daniel, I'm running the experiment_450_config.json without modifications with this command: ``` python muzero_cli.py train report config/experiment_450_config.json ``` And I'm getting th…

ipsec updated 1 year ago
7
google-deepmind/mctx #37

Irregular action and chance outcome outputs within search wi…

Hello! Thank you for recently adding an implementation of stochastic muzero. I was testing it, but it seems chance outcome and action outputs within the search are irregular. I made this test and prin…

evanatyourservice updated 1 year ago
9
hr0nix/omega #16

Use MCTS-in-JAX

Just an idea. https://github.com/deepmind/mctx

ipsec updated 2 years ago
3
timoklein/alphazero-gym #10

Should entropy bonus be also calculated during planning?

Recently, I finished reading this repo code. And I found that the entropy bonus of a state value from SAC is only added at the last output step. This routine let me can't help but thinking: If t…

dbsxdbsx updated 1 year ago
6
csabaiBio/elte_ml_journal_club #2

suggestions

Erre kíváncsi lennék: Exploring Weight Agnostic Neural Networks Tuesday, August 27, 2019 Posted by Adam Gaier, Student Researcher and David Ha, Staff Research Scientist, Google Research, Tokyo …

icsabai updated 2 years ago
106
werner-duvaud/muzero-general #124

Question for NONDETERMINISTIC game

As muzero implies that it is only suitable for deterministic game, does it mean that using original muzero would produce worse result than that from classic model-free algo in nondeterministic game? …

dbsxdbsx updated 3 years ago
1

上一页 1...1 2 3 4 5...5 下一页

43 results for muzero-stochastic

43 results
for muzero-stochastic