AboudyKreidieh h-baselines issues

AboudyKreidieh / h-baselines

A repository of high-performing hierarchical reinforcement learning models and algorithms.

MIT License

277 stars 41 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Ring env merge

#211 AboudyKreidieh closed 3 years ago
1
Bump tensorflow from 1.15.2 to 2.3.1

#210 dependabot[bot] closed 3 years ago
1
logdir

#209 AboudyKreidieh closed 3 years ago
1
bug fix to observation type

#208 AboudyKreidieh closed 3 years ago
1
replaced the ring env with the original (smaller) one

#207 AboudyKreidieh closed 3 years ago
1
Dynamic lr 2

#206 AboudyKreidieh closed 3 years ago
1
Cleanup

#205 AboudyKreidieh closed 3 years ago
1
Multiagent fixes

#204 AboudyKreidieh closed 3 years ago
1
Speedups

#203 AboudyKreidieh closed 3 years ago
1
Cleanup

#202 AboudyKreidieh closed 3 years ago
1
added Base on-policy object

#201 AboudyKreidieh closed 3 years ago
1
L2 penalty

#200 AboudyKreidieh closed 4 years ago
1
Bump tensorflow from 1.15.2 to 1.15.4

#199 dependabot[bot] closed 3 years ago
2
info at done

#198 AboudyKreidieh closed 4 years ago
1
Multiagent fixes

#197 AboudyKreidieh closed 4 years ago
1
Fs env

#196 AboudyKreidieh closed 4 years ago
1
Model v3

#195 AboudyKreidieh closed 4 years ago
1
Multiagent fixes

#194 AboudyKreidieh closed 4 years ago
1
reset from warmup

#193 AboudyKreidieh closed 4 years ago
1
reduced code duplication in mixed autonomy envs

#192 AboudyKreidieh closed 4 years ago
1
added SnakeGather environment

#191 AboudyKreidieh closed 4 years ago
1
SwimmerGather

#190 AboudyKreidieh closed 4 years ago
1
Renamed OffPolicyRLAlgorithm -> RLAlgorithm

#189 AboudyKreidieh closed 4 years ago
1
minor cleanup

#188 AboudyKreidieh closed 4 years ago
1
PPO - multi-fcent / hrl

#187 AboudyKreidieh closed 2 years ago
2
PPO - fcnet

#186 AboudyKreidieh closed 4 years ago
1
PPO2

#185 AboudyKreidieh closed 4 years ago
0
Policy abstraction

#184 AboudyKreidieh closed 4 years ago
1
minor cleanup

#183 AboudyKreidieh closed 4 years ago
1
Bug fixes

#182 AboudyKreidieh closed 4 years ago
1
Brent/exploration

#181 brentgryffindor closed 2 years ago
0
Cleanup

#180 AboudyKreidieh closed 4 years ago
0
Ant Sub Goal Visualization

#179 brandontrabucco closed 4 years ago
0
Brandon merge 2

#178 AboudyKreidieh closed 4 years ago
1
PPO

#177 AboudyKreidieh closed 4 years ago
0
removed mentions of fingerprints and centralized_value_functions

#176 AboudyKreidieh closed 4 years ago
1
Bug fixes

#175 AboudyKreidieh closed 4 years ago
1
Humanoid

#174 AboudyKreidieh closed 4 years ago
1
Cleanup

#173 AboudyKreidieh closed 4 years ago
1
Plotter

#172 AboudyKreidieh closed 4 years ago
1
updated README in experiments folder

#171 AboudyKreidieh closed 4 years ago
1
Brandon merge

#170 AboudyKreidieh closed 2 years ago
1
Cleanup

#169 AboudyKreidieh closed 4 years ago
1
Multiagent HRL policy

#168 AboudyKreidieh closed 4 years ago
1
Ray sampler

#167 AboudyKreidieh closed 4 years ago
1
Multilevel description

#166 AboudyKreidieh closed 2 years ago
2
Multilevel HIRO

#165 AboudyKreidieh closed 4 years ago
1
Multiagent tests

#164 AboudyKreidieh closed 4 years ago
1
Variable inflows

#163 AboudyKreidieh closed 4 years ago
1
resolved some TODOs and FIXMEs

#162 AboudyKreidieh closed 4 years ago
1

Previous Next