Build ops for optimizing Policy parameters

thiagopbueno / model-aware-policy-optimization

MAPO: Model-Aware Policy Optimization algorithm

GNU General Public License v3.0

1 stars 0 forks source link

Closed thiagopbueno closed 5 years ago

thiagopbueno commented 5 years ago

Build ops for implementing a step of gradient descent over the Policy parameters given a TensorFlow optimizer and a batch of experiences.

0xangelo commented 5 years ago

Closed after #60