jc-bao / policy-adaptation-survey

This repository is for comparing the prevailing adaptive control method in both control and learning communities.
Apache License 2.0
7 stars 1 forks source link

Policy dose not converge in quadrotor transportation scenario. #14

Closed jc-bao closed 1 year ago

jc-bao commented 1 year ago

The newly designed quadrotor transportation case cannot hover at a fixed point.

Learned policy

image

The policy tends to hit the boundary of flying space.

Learning curve

image

It cannot converge after a long time of training.

Check with PID

The PID controller can track the center point very precisely. pid_attitude_thrust

jc-bao commented 1 year ago

Check: can the drone hover at a fixed point when the object weight is set to zero?

jc-bao commented 1 year ago

Resolved: the dynamic error

Error dynamics cause the problem in the multi-drone case. Be careful with a GPU-based environment during the tensor operation!