issues
search
kaesve
/
muzero
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.
MIT License
148
stars
23
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
About the x-axis of the CartPole learning curve figure
#8
tjuHaoXiaotian
opened
2 years ago
0
Don't assume file format for Muzero Model files.
#7
frankbryce
closed
3 years ago
1
MountainCar encoder file doesn't exist
#6
frankbryce
closed
3 years ago
5
Game with unbounded number of turns lead to stack overflow
#5
sherpal
closed
3 years ago
2
Missing `/out/MuZeroOut/board\r_temp.pth.tar` after backpropagation
#4
sherpal
closed
3 years ago
2
Unit Tests Deprecated
#3
joeryjoery
opened
3 years ago
0
AlphaZero MemoryLeak on Gym Environments.
#2
joeryjoery
opened
3 years ago
1
Choose environments
#1
kaesve
closed
3 years ago
0