issues
search
tech-srl
/
layer_norm_expressivity_role
Code for the paper "On the Expressivity Role of LayerNorm in Transformers' Attention" (Findings of ACL'2023)
45
stars
3
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
The experimental results cannot be replicated. script do not work!
#1
liveck
opened
1 week ago
2