google-deepmind / mujoco_mpc

Real-time behaviour synthesis with MuJoCo, using Predictive Control
https://github.com/deepmind/mujoco_mpc
Apache License 2.0
898 stars 130 forks source link

Cross entropy planner changes #284

Closed thowell closed 4 months ago

thowell commented 5 months ago

Rollout nominal "elite average" policy with noisy rollouts + minor fixes. @alberthli addresses #282