issues
search
vwxyzjn
/
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
http://docs.cleanrl.dev
Other
4.91k
stars
566
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
ManiSkill2 - Fast Visual RL robotics cleanrl baselines
#366
StoneT2000
opened
1 year ago
2
Poetry install fails: WheelFileValidationError: Pytorch 1.12
#365
samholt
closed
1 year ago
2
code error when running dqn.py
#364
jianzuo
closed
1 year ago
4
Remove unnecessary arg in SAC
#362
vwxyzjn
closed
1 year ago
1
Is SAC exploration-noise used?
#361
StoneT2000
closed
1 year ago
1
Bug in RND Intrinsic Reward Normalization
#360
akarshkumar0101
opened
1 year ago
1
add complex observation atari ppo
#359
ttumiel
opened
1 year ago
1
LSTM weights should have separate orthogonal initializations for each gate
#358
Jammf
opened
1 year ago
0
Remove stale algorithm reference to `ppo_lstm_memory_env.py`
#357
vwxyzjn
closed
1 year ago
2
The file 'ppo_memory_env_lstm.py' can't be found
#356
leeivan1007
closed
1 year ago
2
add tianshou-like JAX+PPO+Mujoco
#355
quangr
opened
1 year ago
7
Add Muesli
#354
shermansiu
closed
7 months ago
10
PPO Complex Obs/Action Space
#353
ttumiel
opened
1 year ago
3
About PPO+Procgen code on Jax
#352
sglucas
closed
7 months ago
7
fix pre-commit
#351
vwxyzjn
closed
1 year ago
1
Reproduction of Muesli
#350
vwxyzjn
closed
7 months ago
23
lowercase Jimmy ba ignore in pre-commit
#349
timoklein
closed
1 year ago
1
Parallel-envs-friendly ppo_continuous_action.py
#348
vwxyzjn
opened
1 year ago
3
Added Polyak update rate for soft DQN target network updates
#347
manjavacas
closed
1 year ago
5
Add Polyak update to DQN
#346
manjavacas
closed
1 year ago
1
Dreamer v1 / v2 [Model-based RL]
#345
dosssman
closed
7 months ago
10
Qdagger: Reincarnate RL
#344
vwxyzjn
closed
1 year ago
18
Proper description of v_min and v_max in C51 parser
#343
qgallouedec
closed
1 year ago
1
Hotfix for #331
#342
vwxyzjn
closed
1 year ago
1
docs fix for ddpg and td3 to include jax implementation
#341
kinalmehta
closed
1 year ago
1
Deprecate `ppo_procgen.py` in favor of EnvPool
#340
vwxyzjn
closed
7 months ago
2
Add test cases
#339
vwxyzjn
closed
1 year ago
1
Sebula PPO (EnvPool's async API)
#338
vwxyzjn
closed
11 months ago
16
bug: incorrect logic in GAE calculation
#337
vwxyzjn
closed
1 year ago
1
update paper link to point to JMLR version
#336
kinalmehta
closed
1 year ago
1
update dqn-jax docs with CPU experiments
#335
kinalmehta
closed
1 year ago
1
chore: simplify syntax
#334
vwxyzjn
closed
1 year ago
1
What is the reason for returning mean in SAC get_action function if it's never used?
#333
sudonymously
opened
1 year ago
1
Fix ppo jax documentation links
#332
51616
closed
1 year ago
1
Add RPO to CleanRL
#331
masud99r
closed
1 year ago
24
Cleanrl for MARL
#330
vbaddam
closed
7 months ago
15
Fix target-network-frequency in DQN documentation
#329
qgallouedec
closed
1 year ago
1
Using jax scan for PPO + atari + envpool XLA
#328
51616
closed
1 year ago
17
Using jax scan for PPO + atari + envpool XLA
#327
51616
closed
1 year ago
8
Updated the pip install poetry lines in the docker files to contain -…
#326
LooseTerrifyingSpaceMonkey
closed
1 year ago
4
GitPod instance errors out when running poetry install
#325
LooseTerrifyingSpaceMonkey
closed
1 year ago
3
Typo in c51.py
#324
mynameisjanus
closed
1 year ago
1
Fix DQN target update frequency
#323
qgallouedec
closed
1 year ago
2
Target network isn't updated to the correct frequency when `target_network_frequency % train_frequency != 0`
#322
qgallouedec
closed
1 year ago
0
Torchx integration
#321
vwxyzjn
closed
1 year ago
2
Implement Gymnasium-compliant PPO script
#320
dtch1997
closed
1 year ago
19
Implement Gymnasium-compliant PPO
#319
dtch1997
closed
1 year ago
5
Implement Gymnasium-compliant PPO script
#318
dtch1997
closed
1 year ago
8
Benchmark `dqn_jax.py` using CPU only
#317
vwxyzjn
closed
1 year ago
0
Update cleanrl-supported-papers-projects.md
#316
masud99r
closed
1 year ago
2
Previous
Next