arcee-ai / mergekit

Tools for merging pretrained large language models.
GNU Lesser General Public License v3.0
4.8k stars 437 forks source link

Example of a config file for task_arithmetic 'negative' operation and a case for 'Task analogies' #400

Open eunbin079 opened 2 months ago

eunbin079 commented 2 months ago

In the README, the Task Arithmetic paper describes two methods:

**1. Applying a negative operation to the model weights to mitigate a specific behavior.

  1. Using task analogy (Task Vector D ≈ Task Vector C + (Task Vector B − Task Vector A)).** Could I find examples for these?

Detail

Negative Operation

Task Analogy

please help me!! Thank you

NextGenOP commented 1 month ago

have you tried mergekit-mega? As my understanding, It's support multiple merge at once.