google-deepmind / mujoco_mpc

Real-time behaviour synthesis with MuJoCo, using Predictive Control
https://github.com/deepmind/mujoco_mpc
Apache License 2.0
951 stars 142 forks source link

Improved iLQG nominal rollout #57

Closed yuvaltassa closed 1 year ago

yuvaltassa commented 1 year ago

The iLQG nominal rollout, defined here, is inefficient. We should do this with a threadpool that rolls out with a parallel "line search" on the feedback term, that includes a 0 coefficient (no feedback, just open-loop). For this, the Action method should also get a feedback_scale parameter (defaulting to 1), and this line should use mju_addToScl.

This is a very simple change that I suspect will improve iLQG significantly.

thowell commented 1 year ago

Implemented in #62