thiagopbueno / model-aware-policy-optimization

MAPO: Model-Aware Policy Optimization algorithm
GNU General Public License v3.0
1 stars 0 forks source link

feat: add critic explained variance statistic #91

Closed 0xangelo closed 5 years ago

0xangelo commented 5 years ago

Add an additional debugging log that helps us see if the critic is predicting the returns. From Berkeley's 2017 Deep RL Bootcamp: Screen Shot 2019-08-24 at 23 59 28