muzero-unplugged Search Results

6 results
for muzero-unplugged

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

werner-duvaud/muzero-general #185

MuZero Unplugged

Hey, I'm wondering if there is any intention to expand the code basis for MuZero unplugged to make it work in an offline RL setting?

tbskrpmnns updated 1 year ago
7
YeWR/EfficientZero #14

Question: Why not reanalyze 100% policy targets?

Hi there, First of all, great work and thank you for opensourcing your code! I have a question regarding reanalyze: you chose to reanalyze 99% of policy targets and 100% of value targets. I am j…

Hwhitetooth updated 2 years ago
1
opendilab/LightZero #62

Lacking inference script

In the codebase, there are training and evaluation scripts. This is great. But, I lack an inference script here, in which I can run the existing weights on the environment and see how it performs visu…

samkoesnadi updated 1 year ago
4
DHDev0/Stochastic-muzero #6

What about merging with SpeedyZero code base?

Hi! Thanks for implementing that Deepmind paper. What do you think about merging with some highly optimized distributed implementation of the MuZero family member ([SpeedyZero](https://openreview.net/…

GrigoryEvko updated 1 year ago
4
fly51fly/aicoco #3

爱可可老师24小时热门分享

微博内容精选

fly51fly updated 4 months ago
1907
takuseno/d3rlpy #165

[REQUEST] MuZero Unplugged

Hey, I was really impressed by DeepMind's latest progress in their offline RL version of the MuZero algorithm [arXiv:2104.06294](https://arxiv.org/abs/2104.06294). Since it provides sota results fo…

tbskrpmnns updated 2 years ago
4

6 results for muzero-unplugged

MuZero Unplugged

Question: Why not reanalyze 100% policy targets?

Lacking inference script

What about merging with SpeedyZero code base?

爱可可老师24小时热门分享

[REQUEST] MuZero Unplugged

6 results
for muzero-unplugged