jviquerat / pbo

Policy-based optimization : single-step policy gradient seen as an evolution strategy
MIT License
17 stars 5 forks source link