muzero-general Search Results

99 results
for muzero-general

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

werner-duvaud/muzero-general #157

RAM memory usage of SelfPlay.continuous_self_play() keeps gr…

I'm using ubuntu 18.04 and ray 1.3 Even when capping replay_buffer_size the memory increases almost linearly for each SelfPlay.continuous_self_play() agent. Is this a known bug? Are there any su…

adalsteinnpals updated 1 year ago
8
google-deepmind/mctx #66

Combining Gumbel MuZero and Stochastic MuZero

This library contains implementations of - [Gumbel MuZero](https://openreview.net/forum?id=bERaNdoegnO) (policy improvement) - [Stochastic MuZero](https://openreview.net/forum?id=X6D9bAHhBQ1) (cha…

carlosgmartin updated 5 months ago
4
google-deepmind/mctx #87

Sampled MuZero

Would you consider adding support for [Sampled MuZero](https://arxiv.org/abs/2104.06303)?

carlosgmartin updated 5 months ago
4
Linesight-RL/linesight #6

Stuck on Connecting to TMInterface0...

As the title says when opening **observe_manual_run_to_extract_checkpoints.py** it just says Connecting to TMInterface0... and nothing happens. I have tried finishing the map etc, so I dont know what …

NumseBacon updated 2 months ago
73
leela-zero/leela-zero #2560

End of first training run - next steps

Hi all, there's 500k games now without a network promoting. This means the training window is "full". I increased it to 750k games, and did a last learning rate lowering (0.00001 @ bs=96). If this …

gcp updated 3 weeks ago
59
CuriosAI/sai #34

Progress stalled?

I just wanted to reassure everyone that if the progress stalls we are going to increase the visits and we believe that in a few generations the upgrade will restore a good rate of improvement

Vandertic updated 4 years ago
83
google-deepmind/open_spiel #1154

Recommended Alphazero training config parameters

I'm training the Hex game with Alphazero right now with a 5x5 board size. My config works but is very slow, taking about 2 days to do 100 steps which gets the model to a sufficient level op play. …

robinpdev updated 5 months ago
6
opendilab/LightZero #175

Does `downsample = True` lead to masking input data?

For large input dimensions like Atari, it is mentioned in the codebase that the `downsample` parameter is set to `True`. I want to use `downsample = True` for my custom environment, since the `Sampled…

ekiefl updated 8 months ago
1
DHDev0/Stochastic-muzero #10

Problem adapting Stoch-muzero to custom gymnasium environmen…

I'm trying to adapt the tutorial code to my environment which has the following dimensions: ``` env.observation_space.shape[0] #Continuous 50 ``` ``` env.action_space.n #Discrete 3 ``` I'm …

Karlheinzniebuhr updated 11 months ago
1
werner-duvaud/muzero-general #139

RuntimeError: cuDNN error: CUDNN_STATUS_NOT_INITIALIZED

2021-03-19 00:04:00,443.ERROR worker.py:1037 -- Possible unhandled error from worker: ray::Trainer.continuous_update_weights() (pid=24804, ip=192.168.1.229) File "python/ray/_raylet.pyx",…

theword updated 1 year ago
10

上一页 1...1 2 3 4 5 6 7...10 下一页

99 results for muzero-general

99 results
for muzero-general