NeurIPS 2022 | Disentangling Transfer in Continual Reinforcement Learning

chufanchen commented 7 months ago

chufanchen commented 7 months ago

Use multi-head SAC(actor, critic, exploration policy) on Continual World.

BC improves transfer in long-sequence scenario, but not in two-task scenario.

Regularize the critic deteriorates performance. The practical recommendation is to regularize only the actor.

chufanchen commented 7 months ago

Average performance

Forward transfer

Forgetting

chufanchen commented 7 months ago

ClonEx-SAC: behavioral cloning, improved exploration and SAC.

chufanchen / read-paper-and-code