[Question] loading Llama3-8B-instruct to HookedTransformer got a warning saying You are not using LayerNorm, so the writing weights can't be centered! Skipping!

TransformerLensOrg / TransformerLens

A library for mechanistic interpretability of GPT-style language models

https://transformerlensorg.github.io/TransformerLens/

MIT License

1.39k stars 269 forks source link

[Question] loading Llama3-8B-instruct to HookedTransformer got a warning saying You are not using LayerNorm, so the writing weights can't be centered! Skipping! #653

Closed wangyifei0047 closed 2 months ago

wangyifei0047 commented 2 months ago

Question

My gpu can't download checkpoints directly from huggingface.co, I write a function to load models from local directory, then passing to HookedTransformer. As a result, it jumps a warning, I am a little confused, I raise a few questions: -whether this function configurations are proper; -whether this warning influences the results of mechanistic interpretability

ArthurConmy commented 2 months ago

Read this: https://github.com/TransformerLensOrg/TransformerLens/blob/main/further_comments.md#centering-writing-weights-center_writing_weight

The warning will have no impact on your results. Feel free to pass center_writing_weights=False to the from_pretrained call to stop the warning yapping at you