Open mmatena opened 3 years ago
Add more details, but the idea is say we are given model 1, which is trained on task A. Then we train model 1 on task B to get model 2, which catastrophically forgets task A. We merge model 1 with model 2 to see if we can do well on both tasks. Traditional EWC would be a baseline.
TODO
TODO
Add more details, but the idea is that we partition the train set into N disjoint sets. Then we train models on each partition separately and merge them to cheaply join their work. There could be multiple steps of train then merge. Using the Empirical Fisher and computing it online during training would make this more efficient.
This federated learning paper might be relevant.
TODO
The idea is that I'll add a comment with a motivation and description for experiments I plan to do. I'll keep adding details to the comment as the experiment gets more flushed out. When I actually run the experiment, I'll add details and results of what I actually did to issue #13.