issues
search
AboudyKreidieh
/
h-baselines
A repository of high-performing hierarchical reinforcement learning models and algorithms.
MIT License
277
stars
41
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Ring env merge
#211
AboudyKreidieh
closed
3 years ago
1
Bump tensorflow from 1.15.2 to 2.3.1
#210
dependabot[bot]
closed
3 years ago
1
logdir
#209
AboudyKreidieh
closed
3 years ago
1
bug fix to observation type
#208
AboudyKreidieh
closed
3 years ago
1
replaced the ring env with the original (smaller) one
#207
AboudyKreidieh
closed
3 years ago
1
Dynamic lr 2
#206
AboudyKreidieh
closed
3 years ago
1
Cleanup
#205
AboudyKreidieh
closed
3 years ago
1
Multiagent fixes
#204
AboudyKreidieh
closed
3 years ago
1
Speedups
#203
AboudyKreidieh
closed
3 years ago
1
Cleanup
#202
AboudyKreidieh
closed
3 years ago
1
added Base on-policy object
#201
AboudyKreidieh
closed
3 years ago
1
L2 penalty
#200
AboudyKreidieh
closed
4 years ago
1
Bump tensorflow from 1.15.2 to 1.15.4
#199
dependabot[bot]
closed
3 years ago
2
info at done
#198
AboudyKreidieh
closed
4 years ago
1
Multiagent fixes
#197
AboudyKreidieh
closed
4 years ago
1
Fs env
#196
AboudyKreidieh
closed
4 years ago
1
Model v3
#195
AboudyKreidieh
closed
4 years ago
1
Multiagent fixes
#194
AboudyKreidieh
closed
4 years ago
1
reset from warmup
#193
AboudyKreidieh
closed
4 years ago
1
reduced code duplication in mixed autonomy envs
#192
AboudyKreidieh
closed
4 years ago
1
added SnakeGather environment
#191
AboudyKreidieh
closed
4 years ago
1
SwimmerGather
#190
AboudyKreidieh
closed
4 years ago
1
Renamed OffPolicyRLAlgorithm -> RLAlgorithm
#189
AboudyKreidieh
closed
4 years ago
1
minor cleanup
#188
AboudyKreidieh
closed
4 years ago
1
PPO - multi-fcent / hrl
#187
AboudyKreidieh
closed
2 years ago
2
PPO - fcnet
#186
AboudyKreidieh
closed
4 years ago
1
PPO2
#185
AboudyKreidieh
closed
4 years ago
0
Policy abstraction
#184
AboudyKreidieh
closed
4 years ago
1
minor cleanup
#183
AboudyKreidieh
closed
4 years ago
1
Bug fixes
#182
AboudyKreidieh
closed
4 years ago
1
Brent/exploration
#181
brentgryffindor
closed
2 years ago
0
Cleanup
#180
AboudyKreidieh
closed
4 years ago
0
Ant Sub Goal Visualization
#179
brandontrabucco
closed
4 years ago
0
Brandon merge 2
#178
AboudyKreidieh
closed
4 years ago
1
PPO
#177
AboudyKreidieh
closed
4 years ago
0
removed mentions of fingerprints and centralized_value_functions
#176
AboudyKreidieh
closed
4 years ago
1
Bug fixes
#175
AboudyKreidieh
closed
4 years ago
1
Humanoid
#174
AboudyKreidieh
closed
4 years ago
1
Cleanup
#173
AboudyKreidieh
closed
4 years ago
1
Plotter
#172
AboudyKreidieh
closed
4 years ago
1
updated README in experiments folder
#171
AboudyKreidieh
closed
4 years ago
1
Brandon merge
#170
AboudyKreidieh
closed
2 years ago
1
Cleanup
#169
AboudyKreidieh
closed
4 years ago
1
Multiagent HRL policy
#168
AboudyKreidieh
closed
4 years ago
1
Ray sampler
#167
AboudyKreidieh
closed
4 years ago
1
Multilevel description
#166
AboudyKreidieh
closed
2 years ago
2
Multilevel HIRO
#165
AboudyKreidieh
closed
4 years ago
1
Multiagent tests
#164
AboudyKreidieh
closed
4 years ago
1
Variable inflows
#163
AboudyKreidieh
closed
4 years ago
1
resolved some TODOs and FIXMEs
#162
AboudyKreidieh
closed
4 years ago
1
Previous
Next