jc-bao / policy-adaptation-survey

This repository is for comparing the prevailing adaptive control method in both control and learning communities.
Apache License 2.0
7 stars 1 forks source link

RMA not work as expected #7

Closed jc-bao closed 1 year ago

jc-bao commented 1 year ago

After the adaptation, the training procedure is fairly slow.

image
jc-bao commented 1 year ago

Sanity check1: make sure all parameter is observable given the past trajectories.

Oracle: directly calculate parameter from past experience ✅

image
jc-bao commented 1 year ago

Sanity check2: remove useless prediction parameters and calculate the result again.

Test case: only keep mass and relative input. Need to learn $m = \frac{F_u}{a+g}$ Result: the loss went down (from 0.15 -> 0.015), but the error is still relatively high.

jc-bao commented 1 year ago

current vector

jc-bao commented 1 year ago
jc-bao commented 1 year ago

Progress1: with only one parameter m

jc-bao commented 1 year ago

Trail: compress the input with 3D adaptor

image
jc-bao commented 1 year ago

Trail: permute before reshaping

Problem resolved!

image
jc-bao commented 1 year ago

Result

image