-
Hey,
I'm wondering if there is any intention to expand the code basis for MuZero unplugged to make it work in an offline RL setting?
-
Hi there,
First of all, great work and thank you for opensourcing your code!
I have a question regarding reanalyze: you chose to reanalyze 99% of policy targets and 100% of value targets. I am j…
-
In the codebase, there are training and evaluation scripts. This is great. But, I lack an inference script here, in which I can run the existing weights on the environment and see how it performs visu…
-
Hi! Thanks for implementing that Deepmind paper. What do you think about merging with some highly optimized distributed implementation of the MuZero family member ([SpeedyZero](https://openreview.net/…
-
微博内容精选
-
Hey,
I was really impressed by DeepMind's latest progress in their offline RL version of the MuZero algorithm [arXiv:2104.06294](https://arxiv.org/abs/2104.06294). Since it provides sota results fo…