arcee-ai / DAM

30 stars 4 forks source link

Trainable/Freezable Layer Norm , Embedding Coefficients and Seamless Logits Computation on-the-fly #33

Closed shamanez closed 1 month ago

shamanez commented 1 month ago

Trainable/Freezable Layer Norm, Embedding Coefficients and Seamless Logits Computation on-the-fly

This PR enhances the model merging process by introducing the ability to pass trainable layer norm coefficients and embedding metrics. The update allows users to control whether these components should be trainable or fixed, improving flexibility and mitigating performance degradation when fine-tuning specific layers.

Key Features: