-
When I try to train a model using stochastic muzero MLP with chance encoder, I am getting error related to indexing at this line: https://github.com/opendilab/LightZero/blob/f1511fb5cdda4d31e61c5831c8…
-
Add stochastic muzero implementation - [paper](https://openreview.net/pdf?id=X6D9bAHhBQ1) and the [pseudocode](https://gist.github.com/Mononofu/7548d8aa4bf94e12bc7eb7662fd60b56)
With this improved …
ipsec updated
3 weeks ago
-
Sorry to bother you, but could you please let me know if there is any MuZero code for highway-v0 published here?
-
Hey, since MuZero is very similar but more general, could you PLEASE do a similar article and repo for that? many applications will do better with a more simple version that doesn't have to scale acr…
-
Or are you guys working on it on a branch somewhere? I don't even need it to be performant, I just want it to be implemented correctly so if I use your repo to write a paper, no reviewers will request…
-
-
I need to usue the [stochastic muzero policy](https://github.com/google-deepmind/mctx/blob/663455f3d35bfce9cfb0bdfe90a3c72b54093c11/mctx/_src/policies.py#L234).
I inspected the muax class and can s…
-
### Search before asking
- [X] I have searched the MuZero [issues](https://github.com/werner-duvaud/muzero-general/issues) and found no similar feature requests.
### Description
Hey,
I'm wonder…
-
Hey,
I'm wondering if there is any intention to expand the code basis for MuZero unplugged to make it work in an offline RL setting?
-
There's a lot of hot stuff in the pipeline re. MCTS, the vanilla sampler, etc.
One thing I'm afraid of is that there's going to be a lot of spaghetti involving bespoke/subtly different ways to call…