Open silvery107 opened 3 weeks ago
The observation in training is mismatched with deployment. The base_pos should be removed.
base_pos
I was trying to align the RL reward to MPC cost, but it turns out it's better to go without position tracking for both stages.
The observation in training is mismatched with deployment. The
base_pos
should be removed.I was trying to align the RL reward to MPC cost, but it turns out it's better to go without position tracking for both stages.