jc-bao / policy-adaptation-survey

This repository is for comparing the prevailing adaptive control method in both control and learning communities.

Apache License 2.0

7 stars 1 forks source link

RMA not work as expected #7

Closed jc-bao closed 1 year ago

jc-bao commented 1 year ago

After the adaptation, the training procedure is fairly slow.

jc-bao commented 1 year ago

Sanity check1: make sure all parameter is observable given the past trajectories.

Oracle: directly calculate parameter from past experience ✅

jc-bao commented 1 year ago

Sanity check2: remove useless prediction parameters and calculate the result again.

Test case: only keep mass and relative input. Need to learn $m = \frac{F_u}{a+g}$ Result: the loss went down (from 0.15 -> 0.015), but the error is still relatively high.

jc-bao commented 1 year ago

current vector

How to learn the desired function which maps the trajectory to parameters.

jc-bao commented 1 year ago

update loss function
SysID paper.
Sensitivity analysis.

jc-bao commented 1 year ago

Progress1: with only one parameter m

The formula is learned with an MSE error of 0.05.
However, when adding another parameter in, then the loss goes up to 0.9

jc-bao commented 1 year ago

Trail: compress the input with 3D adaptor

Fail: MLP's performance is better than 3D adaptor

jc-bao commented 1 year ago

Trail: permute before reshaping

Problem resolved!

jc-bao commented 1 year ago