issues
search
opendilab
/
LightZero
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
https://huggingface.co/spaces/OpenDILabCommunity/ZeroPal
Apache License 2.0
1.15k
stars
120
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
fix(pu): fix smz compile_args and num_simulations bug in world_model
#297
puyuan1996
closed
17 hours ago
0
feature(pu): add eval_benchmark test
#296
puyuan1996
closed
1 week ago
0
feature(pu): add atari100k metric utils
#295
puyuan1996
closed
1 week ago
0
feature(pu): add wandb support in lz
#294
puyuan1996
closed
4 days ago
0
docker error
#293
huskyth
closed
1 week ago
0
polish(pu): polish efficiency and performance on atari and dmc
#292
puyuan1996
closed
5 days ago
0
Regarding the parameter settings of the `metadrive` environment
#291
pixeli99
opened
3 weeks ago
4
fix(roland): fix typo in model/utils.py
#290
Roland0511
closed
2 weeks ago
1
TMP: polish(pu): polish efficiency and performance on atari and dmc
#289
puyuan1996
closed
2 weeks ago
1
fix(pu): use display_frames_as_gif in cartpole
#288
puyuan1996
closed
1 month ago
0
如果想渲染环境如何设置?
#287
ldepn
closed
1 month ago
1
TMP: polish(pu): polish efficiency and performance on atari and dmc
#286
puyuan1996
closed
2 weeks ago
1
关于利用GPU加速MCTS
#285
fixtech
closed
1 month ago
1
fix(sk): fix stochastic_muzero_model_mlp.py with chance encoder
#284
ShivamKumar2002
closed
1 month ago
0
Stochastic MuZero MLP Issues Related to Chance Space
#283
ShivamKumar2002
closed
1 month ago
1
ZeroPal is down.
#282
AbelHutten
closed
1 month ago
3
WIP: polish(pu): polish efficiency and performance on atari and dmc
#281
puyuan1996
closed
2 weeks ago
1
TMP: polish(pu): polish unizero efficiency and tune atari100k performance
#279
puyuan1996
closed
2 weeks ago
1
TMP: polish(pu): polish sampled unizero in continuous action space
#278
puyuan1996
closed
2 weeks ago
1
WIP: polish(pu): polish unizero efficiency and tune atari100k performance
#277
puyuan1996
closed
2 weeks ago
1
feature(pu): add seller env, self-judge pipeline and mcts/alphazero config
#276
puyuan1996
opened
2 months ago
0
fix(sk): fix wrong chance values in stochastic muzero
#275
ShivamKumar2002
closed
2 months ago
0
feature(pu): unizero efficiency optimization and ddp configs
#274
puyuan1996
opened
2 months ago
0
How can i use TeaforN loss
#273
snailma0229
closed
2 months ago
1
feature(pu): polish chess env and its render method, add unittest and configs
#272
puyuan1996
closed
2 months ago
3
feature(wrh): add continuous action space in mt for unizero.
#271
ruiheng123
opened
2 months ago
0
Configs are unrunnable due to ImportError of smz_tree
#270
TianrenWang
closed
2 months ago
2
Requesting Guidance on training and testing in a tetris environment. #265
#267
lunathanael
opened
3 months ago
0
feature(pu): add rope that use the true timestep index as pos_index
#266
puyuan1996
opened
3 months ago
0
Discussion: Requesting assistance and guidance with implementation of RL algorithms and models in the context of Tetris
#265
lunathanael
closed
1 month ago
3
When will Chess be supported for AlphaZero/MuZero?
#264
TianrenWang
closed
2 weeks ago
7
feature(wrh): add RoPE for unizero
#263
ruiheng123
closed
2 months ago
1
feature(wrh): add RoPE for unizero
#262
ruiheng123
closed
3 months ago
0
feature(pu): add rope in unizero's transformer
#261
puyuan1996
closed
3 months ago
1
feature(pu): add Sampled MuZero/UniZero, DMC env and related configs
#260
puyuan1996
closed
3 months ago
0
feature(hus): add self-hosted linux(ubuntu) ci runner
#259
TuTuHuss
closed
3 months ago
0
Unexpected Negative Values for Perceptual Loss in UniZero Implementation
#258
Tiikara
closed
2 months ago
3
Inquiry about Commented Out Loss Calculations in UniZero Implementation
#257
Tiikara
closed
2 months ago
2
feature(wrh): add adaptive batch size for transition
#256
ruiheng123
opened
3 months ago
0
feature(wrh): add harmony dream in unizero
#255
ruiheng123
opened
3 months ago
0
fix(pu): fix DownSample for different obs shape
#254
puyuan1996
closed
3 months ago
0
Flexible Input Image Size Support for UniZero: Implementation Timeline and Contribution Opportunity
#253
Tiikara
closed
3 months ago
5
Implementation of Self-Play Training for Real-Time Environments
#252
Tiikara
closed
1 month ago
2
Unexpected performance drop after resuming UniZero training
#251
Tiikara
closed
2 months ago
3
feature(wrh): update soft modulization in unizero for mt
#250
ruiheng123
opened
3 months ago
0
Need Help with Storing and Using Best Checkpoint with ONNX Model in LightZero
#249
moganli
closed
3 months ago
1
Clarification on the relationship between num_unroll_steps and infer_context_length in UniZero
#248
Tiikara
closed
3 months ago
9
feature(pu): add dmc2gym env and related configs
#247
puyuan1996
closed
2 months ago
0
feature(wrh): add soft modulization in unizero
#246
ruiheng123
closed
3 months ago
1
feature(pu): adopt alphazero to non-zero-sum games
#245
puyuan1996
closed
1 month ago
0
Next