issues
search
vwxyzjn
/
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
http://docs.cleanrl.dev
Other
5.54k
stars
631
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
TD3: fixed dimension of clipped_noise for target actions, added noise …
#281
dosssman
closed
2 years ago
11
Problem with multi-agent atari
#280
Matkicail
closed
2 years ago
2
TD3 policy noise bugs
#279
tomjur
closed
2 years ago
2
Algorithm: Option Critic methods
#278
DavidSlayback
opened
2 years ago
1
Update to support Gymnasium
#277
arjun-kg
closed
1 year ago
14
Are you interested in PRs for improvements in performance of PPO LSTM script?
#276
thomasbbrunner
opened
2 years ago
3
Bump oauthlib from 3.2.0 to 3.2.1 in /requirements
#275
dependabot[bot]
closed
2 years ago
2
Bump oauthlib from 3.2.0 to 3.2.1
#274
dependabot[bot]
closed
2 years ago
2
Bump mako from 1.2.1 to 1.2.2
#273
dependabot[bot]
closed
2 years ago
2
Draft: DroQ and TD3+TQC jax implementation
#272
araffin
opened
2 years ago
6
Poetry 1.2
#271
vwxyzjn
closed
2 years ago
5
SAC-discrete implementation
#270
timoklein
closed
1 year ago
35
Multi-objective hyperparameter optimization (DRAFT)
#269
vwxyzjn
opened
2 years ago
1
Update the hyperparameter optimization example script
#268
vwxyzjn
closed
2 years ago
1
WIP: add Diversity is All You Need implementation
#267
kinalmehta
opened
2 years ago
1
SAC discrete
#266
timoklein
closed
10 months ago
3
Multi-objective hyperparameter optimization
#265
vwxyzjn
opened
2 years ago
3
chore: remove unused parameters in jax implementations
#264
kinalmehta
closed
2 years ago
2
Upgrade gym version to 0.26.1
#263
AdityaGudimella
closed
10 months ago
2
Added TQC
#262
AdityaGudimella
closed
10 months ago
5
Working on adding Go1 sim
#261
Neo-X
closed
2 years ago
4
Fix for noise sampling for the TD3 exploration
#260
dosssman
closed
2 years ago
5
Action bias is added twice in TD3 algorithm implementation
#259
implausibleDeniability
closed
2 years ago
2
Add TQC to CleanRL
#258
AdityaGudimella
opened
2 years ago
5
Refactor dqn word choice
#257
vwxyzjn
closed
2 years ago
2
Data corruption due to run naming convention when running on Slurm/GridEngine
#256
Bam4d
opened
2 years ago
1
DQN on MountainCar
#255
qsh-zh
closed
2 years ago
3
Fix docs links in README.md
#254
vwxyzjn
closed
2 years ago
1
Clarify LICENSE info
#253
vwxyzjn
closed
2 years ago
1
Adding unit tests
#252
vwxyzjn
opened
2 years ago
0
Poetry install fails with "isaacgymenvs (rev poetry) is not satisfied"
#251
edlanglois
closed
2 years ago
6
Adding Double DQN
#250
AshwinSankar17
opened
2 years ago
1
RL Formulation
#249
Hannibal046
closed
2 years ago
1
Adding Hierarchical RL Algorithms
#248
DavidSlayback
closed
10 months ago
7
Poetry can't install torch nightly
#247
nyp0x
closed
2 years ago
2
Fix docs (badge, TD3 + JAX, and DQN + JAX)
#246
vwxyzjn
closed
2 years ago
1
Adding TRPO implementation
#245
merak0514
closed
10 months ago
5
AsyncVectorEnv
#244
Jogima-cyber
closed
2 years ago
3
Fix PPO + Isaac Gym Benchmark Script
#243
vwxyzjn
closed
2 years ago
1
Fix links in docs for `ppo_continuous_action_isaacgym.py`
#242
vwxyzjn
closed
2 years ago
1
Remove the github pages CI in favor of vercel
#241
vwxyzjn
closed
2 years ago
1
A question about the `PPO` algorithm
#240
fuyw
closed
2 years ago
5
Replace cloud utilities w/ `torchx`
#239
vwxyzjn
closed
10 months ago
1
Seed envpool environment explicitly
#238
jseppanen
closed
2 years ago
1
Slow `poetry` dependency locking time, and resolution
#237
vwxyzjn
closed
10 months ago
0
Ubuntu runner for poetry lock
#236
vwxyzjn
closed
2 years ago
1
Leverage CI to speed up poetry lock
#235
vwxyzjn
closed
2 years ago
1
Implement PPO-DNA algorithm for Atari
#234
jseppanen
opened
2 years ago
19
Isaac Gym Envs PPO updates
#233
vwxyzjn
closed
2 years ago
5
Isaac Gym Envs PPO updates
#232
markelsanz14
closed
2 years ago
2
Previous
Next