opendilab / LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
https://huggingface.co/spaces/OpenDILabCommunity/ZeroPal
Apache License 2.0
1.15k stars 120 forks source link

feature(pu): add seller env, self-judge pipeline and mcts/alphazero config #276

Open puyuan1996 opened 2 months ago

puyuan1996 commented 2 months ago

image

以 seller 环境为例的 self-judge pipeline.pdf