Creating top_heads list in compute_universal_function_vector (extract_utils.py) for new models

ericwtodd / function_vectors

Function Vectors in Large Language Models (ICLR 2024)

104 stars 24 forks source link

Yes, of course! To create the top_heads list, we aggregate the indirect effects from all the abstractive datasets for which the model does better than baseline performance (see appendix G in the paper; and E.2 for an example of baseline ICL performance - we use the majority label as our baseline). For GPT-J for example, there were 18 tasks that were used to compute the AIE of each head, based on GPT-J's ICL performance:

gptj_tasks = ['antonym', 'capitalize', 'capitalize_first_letter', 'country-capital', 
              'country-currency', 'english-french', 'english-german', 'english-spanish',
              'landmark-country', 'lowercase_first_letter',  'national_parks', 'park-country',
              'person-sport', 'present-past', 'product-company', 'sentiment', 'singular-plural', 'synonym']

Let me know if you have more questions, thanks!

ericwtodd / function_vectors

Creating top_heads list in compute_universal_function_vector (extract_utils.py) for new models #13