kyegomez / BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
https://discord.gg/qUtxnK2NMf
MIT License
1.69k stars 155 forks source link

added if else statement to handle post_act_ln #54

Closed Hiromasa-H closed 6 months ago

Hiromasa-H commented 6 months ago
Hiromasa-H commented 6 months ago

I feel the solution presented in the original issue may be much cleaner. I will update the code if necessary.