issues
search
jviquerat
/
pbo
Policy-based optimization : single-step policy gradient seen as an evolution strategy
MIT License
17
stars
5
forks
source link
Cartpole
#8
Closed
jviquerat
closed
2 years ago