arcee-ai / mergekit

Tools for merging pretrained large language models.
GNU Lesser General Public License v3.0
4.88k stars 446 forks source link

Set Gemma2 lm_head optional instead of aliasing to embed_tokens #406

Closed cg123 closed 3 months ago

cg123 commented 3 months ago

Resolves #385.