issues
search
opendilab
/
LightZero
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
https://huggingface.co/spaces/OpenDILabCommunity/ZeroPal
Apache License 2.0
1.07k
stars
110
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
ZeroPal is down.
#282
AbelHutten
opened
3 days ago
1
WIP: polish(pu): polish efficiency and performance on atari and dmc
#281
puyuan1996
opened
4 days ago
0
polish(pu): polish unizero efficiency and tune atari100k performance
#279
puyuan1996
opened
6 days ago
0
WIP: polish(pu): polish sampled unizero in continuous action space
#278
puyuan1996
opened
1 week ago
0
WIP: polish(pu): polish unizero efficiency and tune atari100k performance
#277
puyuan1996
opened
1 week ago
0
feature(pu): add seller env, self-judge pipeline and mcts/alphazero config
#276
puyuan1996
opened
1 week ago
0
fix(sk): fix wrong chance values in stochastic muzero
#275
ShivamKumar2002
closed
1 week ago
0
feature(pu): unizero efficiency optimization and ddp configs
#274
puyuan1996
opened
1 month ago
0
How can i use TeaforN loss
#273
snailma0229
closed
4 weeks ago
1
feature(pu): polish chess env and its render method, add unittest and configs
#272
puyuan1996
closed
3 weeks ago
3
feature(wrh): add continuous action space in mt for unizero.
#271
ruiheng123
opened
1 month ago
0
Configs are unrunnable due to ImportError of smz_tree
#270
TianrenWang
closed
1 month ago
2
Requesting Guidance on training and testing in a tetris environment. #265
#267
lunathanael
opened
1 month ago
0
feature(pu): add rope that use the true timestep index as pos_index
#266
puyuan1996
opened
1 month ago
0
Discussion: Requesting assistance and guidance with implementation of RL algorithms and models in the context of Tetris
#265
lunathanael
opened
1 month ago
3
When will Chess be supported for AlphaZero/MuZero?
#264
TianrenWang
opened
1 month ago
7
feature(wrh): add RoPE for unizero
#263
ruiheng123
closed
1 week ago
1
feature(wrh): add RoPE for unizero
#262
ruiheng123
closed
1 month ago
0
feature(pu): add rope in unizero's transformer
#261
puyuan1996
closed
1 month ago
1
feature(pu): add Sampled MuZero/UniZero, DMC env and related configs
#260
puyuan1996
closed
1 month ago
0
feature(hus): add self-hosted linux(ubuntu) ci runner
#259
TuTuHuss
closed
2 months ago
0
Unexpected Negative Values for Perceptual Loss in UniZero Implementation
#258
Tiikara
closed
1 month ago
3
Inquiry about Commented Out Loss Calculations in UniZero Implementation
#257
Tiikara
closed
1 month ago
2
feature(wrh): add adaptive batch size for transition
#256
ruiheng123
opened
2 months ago
0
feature(wrh): add harmony dream in unizero
#255
ruiheng123
opened
2 months ago
0
fix(pu): fix DownSample for different obs shape
#254
puyuan1996
closed
1 month ago
0
Flexible Input Image Size Support for UniZero: Implementation Timeline and Contribution Opportunity
#253
Tiikara
closed
1 month ago
5
Implementation of Self-Play Training for Real-Time Environments
#252
Tiikara
opened
2 months ago
2
Unexpected performance drop after resuming UniZero training
#251
Tiikara
closed
1 week ago
3
feature(wrh): update soft modulization in unizero for mt
#250
ruiheng123
opened
2 months ago
0
Need Help with Storing and Using Best Checkpoint with ONNX Model in LightZero
#249
moganli
closed
1 month ago
1
Clarification on the relationship between num_unroll_steps and infer_context_length in UniZero
#248
Tiikara
closed
2 months ago
9
feature(pu): add dmc2gym env and related configs
#247
puyuan1996
closed
1 month ago
0
feature(wrh): add soft modulization in unizero
#246
ruiheng123
closed
2 months ago
1
feature(pu): adopt alphazero to non-zero-sum games
#245
puyuan1996
closed
6 days ago
0
Issues related to the operating environment
#244
he-cw
closed
2 months ago
4
feature(wrh): add soft modulization in unizero
#243
ruiheng123
closed
2 months ago
1
feature(wrh): Add Harmony Dream loss balance in MuZero
#242
ruiheng123
closed
2 months ago
0
feature(pu): add UniZero multitask related pipeline
#241
puyuan1996
opened
2 months ago
0
WIP: feature(pu): add unizero multitask pipeline (only for reference now)
#240
puyuan1996
closed
2 months ago
0
Clipping reward in Atari while using invertible transform for reward and value target
#239
marintoro
closed
2 months ago
2
feature(xcy): add ReZero algo. and related configs
#238
puyuan1996
closed
3 months ago
0
feature(pu): add lightzero sphinx docs
#237
puyuan1996
closed
2 months ago
0
Could you provide detail code example to customize the env and algo that can run successfully in a entire file?
#236
yuuma002
closed
3 months ago
7
Instillation Failure on Mac (Spyder)
#235
JaredDeightonUTK
closed
3 months ago
3
feature(pu): add unizero citation and related info
#234
puyuan1996
closed
3 months ago
0
Bad performance on long run on MsPacman and SpaceInvaders
#233
marintoro
opened
3 months ago
10
feature(pu): add UniZero algo. and related configs/utils/envs/models
#232
puyuan1996
closed
3 months ago
0
Render and plot results: is there a code snippet?
#231
marintoro
closed
3 months ago
1
feature(rjy): add crowd md env new, and multi-head policy
#230
nighood
opened
3 months ago
0
Next