cbfinn / gps

Guided Policy Search
http://rll.berkeley.edu/gps/
Other
594 stars 239 forks source link

ValueError #51

Closed watashihageteru closed 7 years ago

watashihageteru commented 7 years ago

Hi, I encountered the following error when I run the guided policy search algorithm:

ValueError: Failed to find PD solution even for very large eta (check that dynamics and cost are reasonably well conditioned)!

Is there any solution?

P.S. The above error occurred the newest version of GPS. A few months ago, I run the same algorithm altering iterations 12 to 30 in hyperparams.py:

algorithm = { ・・・

'iterations': 12,

'iterations': 30,
・・・

}

Then, the trajectory of the ILQG (Trajectory Samples) became very different from that of the Neural Networks (Policy Samples). At this time, I encountered the following warning: "Final KL divergence after DGD convergence is too high."

watashihageteru commented 7 years ago

Additional Information: Above error and warning occurred in mujoco simulator:

python python/gps/gps_main.py mjc_badmm_example