Closed tirthasheshpatel closed 3 months ago
LLaMA and Mistral Layer Norm should always run in float32. This PR corrects this bug in our implementation.
float32
@mattdangerw Addressed the review comments. Let me know if the diff looks good to you now!
Looks good besides that one potential name change. Thanks!
LLaMA and Mistral Layer Norm should always run in
float32
. This PR corrects this bug in our implementation.