Open daidaiershidi opened 2 weeks ago
Hello,
Thanks for your attention to our work!
Actually, the reported results are all from the adapted models. The adaptation is quite easy and does not require any additional tuning or training, as follows:
For Task i, we apply the corresponding mask and rescaler to the unified task vector:
task_vector_adpt = vector_unified masks[i] rescalers[I]
In Section 4.5: Ablation Study of our paper, we also report the performance of the unified task vector (not adapted).
Hopefully we have addressed your concern.
Thank you for the effective work you've brought. I'd like to know if the result of model fusion is multiple adjusted models or a unified model?