-
Hi,
I'm trying to write a Gomoku game with MuZero. I'm learning from Connect4 since it's also a two player game. However I noticed the following code:
```
def get_observation(self):
…
-
I think the way you transform value/reward is a little mismatch with the original paper at this line (https://github.com/werner-duvaud/muzero-general/blob/fe791e8651645ea05f5b582157b4892588ee56ca/trai…
-
Thank you very much for a comprehensive implementation.
I ran Breakout with the current configuration, except changing the actors from 350 to 4 since I ran
into memory problems with Ray. I am usin…
-
Hi @werner-duvaud
When I'm running the latest code I always get an error when exiting the process. Please see attached screen output below:
I did delete old repo and had a fresh check out, stil…
-
In https://github.com/werner-duvaud/muzero-general/blob/283e3538485be0e36ef77f402249666f735f5278/self_play.py#L262 you essentially assume actions are taken by players in alternating order for two-play…
-
1. For some reason the playback is really slow (even if I remove the "press enter to continue" prompt).
2. Also, I'm curious why you didn't implement [Trainable](https://ray.readthedocs.io/en/lates…
-
as you said it is based off mugo.