About number of merged models

Hello,

Thanks for your attention to our work!

Actually, the reported results are all from the adapted models. The adaptation is quite easy and does not require any additional tuning or training, as follows:

For Task i, we apply the corresponding mask and rescaler to the unified task vector:

task_vector_adpt = vector_unified masks[i] rescalers[I]

In Section 4.5: Ablation Study of our paper, we also report the performance of the unified task vector (not adapted).

Hopefully we have addressed your concern.

harveyhuang18 / EMR_Merging

About number of merged models #2