chufanchen / read-paper-and-code

0 stars 0 forks source link

Continual Reinforcement Learning with Decision Transformer #156

Open chufanchen opened 5 months ago

chufanchen commented 5 months ago
Task L2M LoRA
hammer-v2 Cell Cell
push-wall-v2 Cell Cell
faucet-close-v2 Cell Cell
push-back-v2 Cell Cell
stick-pull-v2 Cell Cell
handle-press-side-v2 Cell Cell
push-v2 Cell Cell
shelf-place-v2 Cell Cell
window-close-v2 Cell Cell
peg-unplug-side-v2 Cell Cell
chufanchen commented 5 months ago

Try Slow Learner #158

chufanchen commented 5 months ago

PCA and t-SNE

chufanchen commented 5 months ago

LoRA init

\theta_t = (1-\alpha)\theta_{t-1}^\star + \alpha \theta_{t-2}^\star
chufanchen commented 5 months ago

Rehearsal

129