vwxyzjn cleanrl issues - Githubissues

vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

http://docs.cleanrl.dev

Other

4.91k stars 566 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

ManiSkill2 - Fast Visual RL robotics cleanrl baselines

#366 StoneT2000 opened 1 year ago
2
Poetry install fails: WheelFileValidationError: Pytorch 1.12

#365 samholt closed 1 year ago
2
code error when running dqn.py

#364 jianzuo closed 1 year ago
4
Remove unnecessary arg in SAC

#362 vwxyzjn closed 1 year ago
1
Is SAC exploration-noise used?

#361 StoneT2000 closed 1 year ago
1
Bug in RND Intrinsic Reward Normalization

#360 akarshkumar0101 opened 1 year ago
1
add complex observation atari ppo

#359 ttumiel opened 1 year ago
1
LSTM weights should have separate orthogonal initializations for each gate

#358 Jammf opened 1 year ago
0
Remove stale algorithm reference to `ppo_lstm_memory_env.py`

#357 vwxyzjn closed 1 year ago
2
The file 'ppo_memory_env_lstm.py' can't be found

#356 leeivan1007 closed 1 year ago
2
add tianshou-like JAX+PPO+Mujoco

#355 quangr opened 1 year ago
7
Add Muesli

#354 shermansiu closed 7 months ago
10
PPO Complex Obs/Action Space

#353 ttumiel opened 1 year ago
3
About PPO+Procgen code on Jax

#352 sglucas closed 7 months ago
7
fix pre-commit

#351 vwxyzjn closed 1 year ago
1
Reproduction of Muesli

#350 vwxyzjn closed 7 months ago
23
lowercase Jimmy ba ignore in pre-commit

#349 timoklein closed 1 year ago
1
Parallel-envs-friendly ppo_continuous_action.py

#348 vwxyzjn opened 1 year ago
3
Added Polyak update rate for soft DQN target network updates

#347 manjavacas closed 1 year ago
5
Add Polyak update to DQN

#346 manjavacas closed 1 year ago
1
Dreamer v1 / v2 [Model-based RL]

#345 dosssman closed 7 months ago
10
Qdagger: Reincarnate RL

#344 vwxyzjn closed 1 year ago
18
Proper description of v_min and v_max in C51 parser

#343 qgallouedec closed 1 year ago
1
Hotfix for #331

#342 vwxyzjn closed 1 year ago
1
docs fix for ddpg and td3 to include jax implementation

#341 kinalmehta closed 1 year ago
1
Deprecate `ppo_procgen.py` in favor of EnvPool

#340 vwxyzjn closed 7 months ago
2
Add test cases

#339 vwxyzjn closed 1 year ago
1
Sebula PPO (EnvPool's async API)

#338 vwxyzjn closed 11 months ago
16
bug: incorrect logic in GAE calculation

#337 vwxyzjn closed 1 year ago
1
update paper link to point to JMLR version

#336 kinalmehta closed 1 year ago
1
update dqn-jax docs with CPU experiments

#335 kinalmehta closed 1 year ago
1
chore: simplify syntax

#334 vwxyzjn closed 1 year ago
1
What is the reason for returning mean in SAC get_action function if it's never used?

#333 sudonymously opened 1 year ago
1
Fix ppo jax documentation links

#332 51616 closed 1 year ago
1
Add RPO to CleanRL

#331 masud99r closed 1 year ago
24
Cleanrl for MARL

#330 vbaddam closed 7 months ago
15
Fix target-network-frequency in DQN documentation

#329 qgallouedec closed 1 year ago
1
Using jax scan for PPO + atari + envpool XLA

#328 51616 closed 1 year ago
17
Using jax scan for PPO + atari + envpool XLA

#327 51616 closed 1 year ago
8
Updated the pip install poetry lines in the docker files to contain -…

#326 LooseTerrifyingSpaceMonkey closed 1 year ago
4
GitPod instance errors out when running poetry install

#325 LooseTerrifyingSpaceMonkey closed 1 year ago
3
Typo in c51.py

#324 mynameisjanus closed 1 year ago
1
Fix DQN target update frequency

#323 qgallouedec closed 1 year ago
2
Target network isn't updated to the correct frequency when `target_network_frequency % train_frequency != 0`

#322 qgallouedec closed 1 year ago
0
Torchx integration

#321 vwxyzjn closed 1 year ago
2
Implement Gymnasium-compliant PPO script

#320 dtch1997 closed 1 year ago
19
Implement Gymnasium-compliant PPO

#319 dtch1997 closed 1 year ago
5
Implement Gymnasium-compliant PPO script

#318 dtch1997 closed 1 year ago
8
Benchmark `dqn_jax.py` using CPU only

#317 vwxyzjn closed 1 year ago
0
Update cleanrl-supported-papers-projects.md

#316 masud99r closed 1 year ago
2

Previous Next