thiagopbueno / model-aware-policy-optimization

MAPO: Model-Aware Policy Optimization algorithm
GNU General Public License v3.0
1 stars 0 forks source link

Build ops for optimizing Policy parameters #8

Closed thiagopbueno closed 5 years ago

thiagopbueno commented 5 years ago

Build ops for implementing a step of gradient descent over the Policy parameters given a TensorFlow optimizer and a batch of experiences.

0xangelo commented 5 years ago

Closed after #60