Custom environment - Githubissues

Hello, thank you for your patience.

Regarding how to test one environment on various algorithms, we have provided a brief instruction on setting up the config file here, hoping to be helpful to you.

Regarding the use of AlphaZero, indeed as you mentioned, it requires the environment to have a perfect simulator. Specifically, like the board game environment, it needs to have the capability to reset(state), you can refer to the implementation of the Gomoku environment. Therefore, if you want to apply AlphaZero to a custom environment, you need to ensure that your environment can implement this function. If the environment does not meet these conditions, you might not be able to use AlphaZero. As an alternative, you could consider using the MuZero algorithm, which conducts MCTS within the learned model. You can refer to the specific details in the MuZero research paper.

If you have other questions, please feel free to contact us. Thank you again for your attention and support!

opendilab / LightZero

Custom environment #219