synnada-ai / mithril

Mithril: A Modular Machine Learning Library for Model Composability
Apache License 2.0
31 stars 8 forks source link

[FEATURE] Align Linear Model Weight Shape with Other Frameworks #47

Open aturker-synnada opened 2 days ago

aturker-synnada commented 2 days ago

Feature Request

Describe the Feature

The current Linear model weight shape is transposed.

Motivation

Aligning the Linear layer's weight shape with other frameworks will make it easier to load pre-trained weights from those frameworks.

Additional Context

This change will also impact LSTM and RNN layers.