chufanchen / read-paper-and-code

0 stars 0 forks source link

NeurIPS 2022 | Disentangling Transfer in Continual Reinforcement Learning #41

Closed chufanchen closed 7 months ago

chufanchen commented 7 months ago

https://arxiv.org/abs/2209.13900

chufanchen commented 7 months ago

What enables transfer between RL tasks?

Use multi-head SAC(actor, critic, exploration policy) on Continual World.

Transfer in CL

Exploration policy

Data rehearsal

BC improves transfer in long-sequence scenario, but not in two-task scenario.

Regularize the critic

Regularize the critic deteriorates performance. The practical recommendation is to regularize only the actor.

chufanchen commented 7 months ago

Metrics

Average performance

Forward transfer

Forgetting

chufanchen commented 7 months ago

Method

ClonEx-SAC: behavioral cloning, improved exploration and SAC.