muzero Search Results - Githubissues

397 results
for muzero

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

werner-duvaud/muzero-general #8

Question: Why use negative board for observation in connect4…

Hi, I'm trying to write a Gomoku game with MuZero. I'm learning from Connect4 since it's also a two player game. However I noticed the following code: ``` def get_observation(self): …

littleV updated 4 years ago
5
werner-duvaud/muzero-general #6

value/reward transform issue

I think the way you transform value/reward is a little mismatch with the original paper at this line (https://github.com/werner-duvaud/muzero-general/blob/fe791e8651645ea05f5b582157b4892588ee56ca/trai…

xuxiyang1993 updated 4 years ago
3
werner-duvaud/muzero-general #53

Breakout

Thank you very much for a comprehensive implementation. I ran Breakout with the current configuration, except changing the actors from 350 to 4 since I ran into memory problems with Ray. I am usin…

pdutoit2011 updated 4 years ago
5
werner-duvaud/muzero-general #10

Unknown error related to ray each time on exiting the run

Hi @werner-duvaud When I'm running the latest code I always get an error when exiting the process. Please see attached screen output below: I did delete old repo and had a fresh check out, stil…

littleV updated 4 years ago
5
werner-duvaud/muzero-general #19

Determining who is next to play inside the MCST

In https://github.com/werner-duvaud/muzero-general/blob/283e3538485be0e36ef77f402249666f735f5278/self_play.py#L262 you essentially assume actions are taken by players in alternating order for two-play…

fidel-schaposnik updated 4 years ago
2
werner-duvaud/muzero-general #2

Slow playback, Trainable and lunar

1. For some reason the playback is really slow (even if I remove the "press enter to continue" prompt). 2. Also, I'm curious why you didn't implement [Trainable](https://ray.readthedocs.io/en/lates…

drozzy updated 4 years ago
1
tensorflow/minigo #947

does minigo relate with deepmind muzero paper

as you said it is based off mugo.

l1t1 updated 4 years ago
1

上一页 1...34 35 36 37 38 39 40...40 下一页

397 results for muzero

Question: Why use negative board for observation in connect4…

value/reward transform issue

Breakout

Unknown error related to ray each time on exiting the run

Determining who is next to play inside the MCST

Slow playback, Trainable and lunar

does minigo relate with deepmind muzero paper

397 results
for muzero