werner-duvaud / muzero-general

MuZero
https://github.com/werner-duvaud/muzero-general/wiki/MuZero-Documentation
MIT License
2.42k stars 602 forks source link

Sampled MuZero implementation #191

Open matthiaskiller opened 2 years ago

matthiaskiller commented 2 years ago

Search before asking

Description

Hey,

I'm wondering if there is any intention to expand the code towards Sampled MuZero to make it work for continuous action spaces? According to Learning and Planning in Complex Action Spaces by Hubert et al.

Thanks!

Additional context

No response

puyuan1996 commented 3 months ago

Hello, thank you to the contributors for their outstanding work on this repository. Regarding the issue you've raised, you might be interested in the project "LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios". This repository not only supports the AlphaZero algorithm but also extends support to MuZero and a series of related algorithms (including Sampled MuZero) and environments, which might meet your requirements. Best wishes.