opendilab LightZero issues

opendilab / LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

https://huggingface.co/spaces/OpenDILabCommunity/ZeroPal

Apache License 2.0

1.15k stars 120 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

fix(pu): fix smz compile_args and num_simulations bug in world_model

#297 puyuan1996 closed 17 hours ago
0
feature(pu): add eval_benchmark test

#296 puyuan1996 closed 1 week ago
0
feature(pu): add atari100k metric utils

#295 puyuan1996 closed 1 week ago
0
feature(pu): add wandb support in lz

#294 puyuan1996 closed 4 days ago
0
docker error

#293 huskyth closed 1 week ago
0
polish(pu): polish efficiency and performance on atari and dmc

#292 puyuan1996 closed 5 days ago
0
Regarding the parameter settings of the `metadrive` environment

#291 pixeli99 opened 3 weeks ago
4
fix(roland): fix typo in model/utils.py

#290 Roland0511 closed 2 weeks ago
1
TMP: polish(pu): polish efficiency and performance on atari and dmc

#289 puyuan1996 closed 2 weeks ago
1
fix(pu): use display_frames_as_gif in cartpole

#288 puyuan1996 closed 1 month ago
0
如果想渲染环境如何设置？

#287 ldepn closed 1 month ago
1
TMP: polish(pu): polish efficiency and performance on atari and dmc

#286 puyuan1996 closed 2 weeks ago
1
关于利用GPU加速MCTS

#285 fixtech closed 1 month ago
1
fix(sk): fix stochastic_muzero_model_mlp.py with chance encoder

#284 ShivamKumar2002 closed 1 month ago
0
Stochastic MuZero MLP Issues Related to Chance Space

#283 ShivamKumar2002 closed 1 month ago
1
ZeroPal is down.

#282 AbelHutten closed 1 month ago
3
WIP: polish(pu): polish efficiency and performance on atari and dmc

#281 puyuan1996 closed 2 weeks ago
1
TMP: polish(pu): polish unizero efficiency and tune atari100k performance

#279 puyuan1996 closed 2 weeks ago
1
TMP: polish(pu): polish sampled unizero in continuous action space

#278 puyuan1996 closed 2 weeks ago
1
WIP: polish(pu): polish unizero efficiency and tune atari100k performance

#277 puyuan1996 closed 2 weeks ago
1
feature(pu): add seller env, self-judge pipeline and mcts/alphazero config

#276 puyuan1996 opened 2 months ago
0
fix(sk): fix wrong chance values in stochastic muzero

#275 ShivamKumar2002 closed 2 months ago
0
feature(pu): unizero efficiency optimization and ddp configs

#274 puyuan1996 opened 2 months ago
0
How can i use TeaforN loss

#273 snailma0229 closed 2 months ago
1
feature(pu): polish chess env and its render method, add unittest and configs

#272 puyuan1996 closed 2 months ago
3
feature(wrh): add continuous action space in mt for unizero.

#271 ruiheng123 opened 2 months ago
0
Configs are unrunnable due to ImportError of smz_tree

#270 TianrenWang closed 2 months ago
2
Requesting Guidance on training and testing in a tetris environment. #265

#267 lunathanael opened 3 months ago
0
feature(pu): add rope that use the true timestep index as pos_index

#266 puyuan1996 opened 3 months ago
0
Discussion: Requesting assistance and guidance with implementation of RL algorithms and models in the context of Tetris

#265 lunathanael closed 1 month ago
3
When will Chess be supported for AlphaZero/MuZero?

#264 TianrenWang closed 2 weeks ago
7
feature(wrh): add RoPE for unizero

#263 ruiheng123 closed 2 months ago
1
feature(wrh): add RoPE for unizero

#262 ruiheng123 closed 3 months ago
0
feature(pu): add rope in unizero's transformer

#261 puyuan1996 closed 3 months ago
1
feature(pu): add Sampled MuZero/UniZero, DMC env and related configs

#260 puyuan1996 closed 3 months ago
0
feature(hus): add self-hosted linux(ubuntu) ci runner

#259 TuTuHuss closed 3 months ago
0
Unexpected Negative Values for Perceptual Loss in UniZero Implementation

#258 Tiikara closed 2 months ago
3
Inquiry about Commented Out Loss Calculations in UniZero Implementation

#257 Tiikara closed 2 months ago
2
feature(wrh): add adaptive batch size for transition

#256 ruiheng123 opened 3 months ago
0
feature(wrh): add harmony dream in unizero

#255 ruiheng123 opened 3 months ago
0
fix(pu): fix DownSample for different obs shape

#254 puyuan1996 closed 3 months ago
0
Flexible Input Image Size Support for UniZero: Implementation Timeline and Contribution Opportunity

#253 Tiikara closed 3 months ago
5
Implementation of Self-Play Training for Real-Time Environments

#252 Tiikara closed 1 month ago
2
Unexpected performance drop after resuming UniZero training

#251 Tiikara closed 2 months ago
3
feature(wrh): update soft modulization in unizero for mt

#250 ruiheng123 opened 3 months ago
0
Need Help with Storing and Using Best Checkpoint with ONNX Model in LightZero

#249 moganli closed 3 months ago
1
Clarification on the relationship between num_unroll_steps and infer_context_length in UniZero

#248 Tiikara closed 3 months ago
9
feature(pu): add dmc2gym env and related configs

#247 puyuan1996 closed 2 months ago
0
feature(wrh): add soft modulization in unizero

#246 ruiheng123 closed 3 months ago
1
feature(pu): adopt alphazero to non-zero-sum games

#245 puyuan1996 closed 1 month ago
0