Describe the bug
MuZero shows very good performance in some environment such as cartpole, pong mlagent, atari (pong, breakout). However, it shows bad performance in most of the Atari environment (spaceinvaders, qbert, enduro, seaquest, ...)
To Reproduce
Try running MuZero algorithm in environments other than pong and breakout
Expected behavior
It shows worse performance when compared to other algorithms.
Describe the bug MuZero shows very good performance in some environment such as cartpole, pong mlagent, atari (pong, breakout). However, it shows bad performance in most of the Atari environment (spaceinvaders, qbert, enduro, seaquest, ...)
To Reproduce Try running MuZero algorithm in environments other than pong and breakout
Expected behavior It shows worse performance when compared to other algorithms.
Screenshots
Development Env. (OS, version, libraries): Linux, Python 3.8, jorldy 0.3.0 requirement
Additional context Add any other context about the problem here.