muzero-general Search Results

99 results
for muzero-general

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Unity-Technologies/ml-agents #5734

Possible ways to replicate a env.

I am using the MCTS algorithm and need to replicate a env object at current step . And then run the copies of the env object separately. I have tried copy.deepcopy() and pickle.dumps(), but it got…

smgreat updated 11 months ago
2
TJU-DRL-LAB/AI-Optimizer #2

Is the sampled muzero Implementation complete?

I compared the sampled muzero code with the muzero general but I didn't find the code about the number of samples and the policy improvement, can you tell me what changes you have made?

FYQ0919 updated 1 year ago
2
opendilab/LightZero #82

Training process gets killed due to OOM

### Summary of issue The training process gets killed by the kernel. There is a log in `dmesg` stating that the reason is "out of memory". **Model**: MuZero with self-supervision **Environment**:…

aceofgreens updated 1 year ago
1
opendilab/LightZero #62

Lacking inference script

In the codebase, there are training and evaluation scripts. This is great. But, I lack an inference script here, in which I can run the existing weights on the environment and see how it performs visu…

samkoesnadi updated 1 year ago
4
vwxyzjn/cleanrl #350

Reproduction of Muesli

## Problem Description [Muesli](https://arxiv.org/abs/2104.06159) is a next-generation policy gradient algorithm from DeepMind that performs exceptionally well. Notably, it can match MuZero’s SOTA …

vwxyzjn updated 9 months ago
23
fly51fly/aicoco #3

爱可可老师24小时热门分享

微博内容精选

fly51fly updated 4 months ago
1907
google-deepmind/open_spiel #595

MuZero implementation using OpenSpiel

First of all, I want to thank the developers for this awesome project! It's simple, clean yet powerful. I really enjoyed playing with it. I'm currently studying at the University of Alberta under t…

uduse updated 1 year ago
21
DHDev0/Stochastic-muzero #2

training loss: nan

Hi Daniel, I'm trying to run a custom environment (works with muzero) with your Stochastic-muzero version. After creating a config file (just changing env name in experiment_450_config.json) I'm…

ipsec updated 1 year ago
6
google-deepmind/open_spiel #820

Gym Wrapper to work with multiagent RL frameworks?

I found Openspiel super useful in my research. I am wondering if the API is friendly to any of the popular multiagent RL frameworks(e.g. rllib, stablebaselines3, tianshou) so that we can use different…

cuijiaxun updated 1 year ago
4
alex-petrenko/sample-factory #252

[advice]?

Hi, recently upgraded to a Ryzen 7950 (16 cores/32 threads, PCI-E 5) and converted to SF2. The Nvidia 1050 peaks at ~75% (using NVIDIA-SMI) while the cpu's are not really working hard. It used to be t…

jarlva updated 1 year ago
4

上一页 1...1 2 3 4 5 6 7...10 下一页

99 results for muzero-general

99 results
for muzero-general