google-deepmind / mujoco_mpc

Real-time behaviour synthesis with MuJoCo, using Predictive Control
https://github.com/deepmind/mujoco_mpc
Apache License 2.0
898 stars 130 forks source link

Bug: `elite_avg` `state` field is always 0 in `cross_entropy` planner #282

Closed alberthli closed 4 months ago

alberthli commented 5 months ago

As per discussion in chat w/ @thowell , when querying the best trajectory from an agent running CEM, the state field is all 0s. A rollout for the elite_avg trajectory needs to be computed every planner iteration and we should either remove the resetting of elite_avg or place it elsewhere relative to the nominal rollout.

thowell commented 5 months ago

Should be fixed in #284