muzero Search Results - Githubissues

EdanToledo/Stoix #77

[FEATURE] Add stochastic muzero implementation

Add stochastic muzero implementation - [paper](https://openreview.net/pdf?id=X6D9bAHhBQ1) and the [pseudocode](https://gist.github.com/Mononofu/7548d8aa4bf94e12bc7eb7662fd60b56) With this improved …

ipsec updated 3 weeks ago

plkmo/AlphaZero_Connect4 #4

MuZero!!

Hey, since MuZero is very similar but more general, could you PLEASE do a similar article and repo for that? many applications will do better with a more simple version that doesn't have to scale acr…

AwokeKnowing updated 4 years ago

werner-duvaud/muzero-general #191

Sampled MuZero implementation

### Search before asking - [X] I have searched the MuZero [issues](https://github.com/werner-duvaud/muzero-general/issues) and found no similar feature requests. ### Description Hey, I'm wonder…

matthiaskiller updated 3 months ago

bwfbowen/muax #8

Activate Stochastic Muzero Policy

I need to usue the [stochastic muzero policy](https://github.com/google-deepmind/mctx/blob/663455f3d35bfce9cfb0bdfe90a3c72b54093c11/mctx/_src/policies.py#L234). I inspected the muax class and can s…

Karlheinzniebuhr updated 7 months ago

kgex/developer-roadmap #402

Add MuZero resource

DineshkumarS05 updated 1 year ago

JannisFengler/MuZero #1

ValueError: expected sequence of length 4 at dim 1 (got 0)

I got this error while running the script. Thanks for doing it. C:\Users\Predator\Desktop\DeepLearning AI\muzeroNew\muzero.py:81: UserWarning: Creating a tensor from a list of numpy.ndarrays is ext…

science64 updated 2 months ago

pytorch/rl #1845

[Feature Request] Muzero and MCTS implementations

## Motivation It would be great to have an MCTS and Alphazero implementation, including other model-based RL for benchmarking and comparison. ## Solution I can write a loss function of this po…

Prakyathkantharaju updated 5 months ago

werner-duvaud/muzero-general #185

MuZero Unplugged

Hey, I'm wondering if there is any intention to expand the code basis for MuZero unplugged to make it work in an offline RL setting?

tbskrpmnns updated 1 year ago

XinJingHao/PPO-Continuous-Pytorch #1

extending continuous action to muzero

We appreciate your clean & robust implementation of PPO Continuous Action! We wonder if you could extend Continuous Action to MuZero? There have been implementations of MuZero Continuous Action by o…

meioses updated 2 years ago

Eclectic-Sheep/sheeprl #241

enabling self play

hi, i tried out this project and it is one of the few that actually works off the shelf, thank you for your work. Is there a way to enable self play when training an agent? My usecase is to use Dr…

drblallo updated 2 months ago

394 results for muzero

394 results
for muzero