TransformerLensOrg / TransformerLens

A library for mechanistic interpretability of GPT-style language models
https://transformerlensorg.github.io/TransformerLens/
MIT License
1.17k stars 241 forks source link

Move out pretrained weight conversions #647

Closed richardkronick closed 1 week ago

richardkronick commented 1 week ago

Description

Moving all of the weight conversion functions from loading_from_pretrained.py into their own individual files to improve manageability. Also included one unit test to ensure that the functions are accessible. Future unit tests will be added.

Fixes # (issue)

Type of change

Please delete options that are not relevant.

Screenshots

Please attach before and after screenshots of the change if applicable.

Checklist: