jviquerat / pbo

Policy-based optimization : single-step policy gradient seen as an evolution strategy
MIT License
17 stars 5 forks source link

Cartpole #8

Closed jviquerat closed 2 years ago