Adamliu1 / SNLP_GCW

3 stars 0 forks source link

Evaluate base models with beavertails. #77

Open TheRootOf3 opened 1 month ago

TheRootOf3 commented 1 month ago

Consider the following models:

Willmish commented 1 month ago

@Adamliu1 Llama 2, OLMo 7B

@Willmish LLama 3 8B, Gemma 7B, Aya 23 8B

Try on lab machines in half precision

Willmish commented 1 month ago
Model Name Flagged/All (lower is safer)
Llama 3 8B 0.334286
Llama 3 8B repeat (same params) 0.334286
Llama 2 7B 0.311428
OLMo 7B 0.287142
OLMo 7B IT 0.372857
Gemma 7B 0.34
Gemma 7B IT 0.08
Aya 23 8B 0.227143
Opt 1.3B 0.2957142857142857
Adamliu1 commented 1 month ago

NOTE: we should either not do IT version at all, or do it across all models.

Willmish commented 1 month ago

Not all have instruction tuned versions (aya is only it), but I was mostly just curious xd

TheRootOf3 commented 4 days ago

Update as of 2024-07-01: