Closed jc-bao closed 1 year ago
Assumption The penalty for velocity is too large
Trail try different reward function
Result Now the control error becomes smaller.
🧑🏫 Lesson Tracking and Hovering are different tasks, and should use different reward functions.