-
Add stochastic muzero implementation - [paper](https://openreview.net/pdf?id=X6D9bAHhBQ1) and the [pseudocode](https://gist.github.com/Mononofu/7548d8aa4bf94e12bc7eb7662fd60b56)
With this improved …
ipsec updated
3 weeks ago
-
Hey, since MuZero is very similar but more general, could you PLEASE do a similar article and repo for that? many applications will do better with a more simple version that doesn't have to scale acr…
-
### Search before asking
- [X] I have searched the MuZero [issues](https://github.com/werner-duvaud/muzero-general/issues) and found no similar feature requests.
### Description
Hey,
I'm wonder…
-
I need to usue the [stochastic muzero policy](https://github.com/google-deepmind/mctx/blob/663455f3d35bfce9cfb0bdfe90a3c72b54093c11/mctx/_src/policies.py#L234).
I inspected the muax class and can s…
-
-
I got this error while running the script. Thanks for doing it.
C:\Users\Predator\Desktop\DeepLearning AI\muzeroNew\muzero.py:81: UserWarning: Creating a tensor from a list of numpy.ndarrays is ext…
-
## Motivation
It would be great to have an MCTS and Alphazero implementation, including other model-based RL for benchmarking and comparison.
## Solution
I can write a loss function of this po…
-
Hey,
I'm wondering if there is any intention to expand the code basis for MuZero unplugged to make it work in an offline RL setting?
-
We appreciate your clean & robust implementation of PPO Continuous Action!
We wonder if you could extend Continuous Action to MuZero?
There have been implementations of MuZero Continuous Action by o…
-
hi,
i tried out this project and it is one of the few that actually works off the shelf, thank you for your work.
Is there a way to enable self play when training an agent? My usecase is to use Dr…