jc-bao / policy-adaptation-survey

This repository is for comparing the prevailing adaptive control method in both control and learning communities.
Apache License 2.0
7 stars 1 forks source link

The drone hovering at a fixed point close to the origin instead of converging to the center. #5

Closed jc-bao closed 1 year ago

jc-bao commented 1 year ago
Test case: fixed mass to 0.01, fixed the constant disturbance force to 0.8 times object mass*gravity, fixed damping parameter to 0.
image image
jc-bao commented 1 year ago

Check the policy at the origin point

Make sure the drone is initialized at the centre, then it can hover here.

jc-bao commented 1 year ago

Check the policy without the fixed disturbance

Final error is large Cannot fully converge to zero along y speed quickly converge to zero during the training
image image image

Meanwhile, the PID controller can steer the agent to zero quite precisely.

Conclusion: RL-related issue.

jc-bao commented 1 year ago

Update RL to use a non-linear reward. ✅

Current reward function regarding the distance to the centre:

image
jc-bao commented 1 year ago

After adding all uncertain parameters

pi vis plot
Experterror=0.027 image image
Vanilla error=0.106 image image