issues
search
ai-glimpse
/
toyllm
Toy LLM
https://ai-glimpse.github.io/toyllm/
Apache License 2.0
1
stars
0
forks
source link
LLM: BN or LN
#41
Open
shenxiangzhuang
opened
7 months ago
shenxiangzhuang
commented
7 months ago
https://www.pinecone.io/learn/batch-layer-normalization/
https://stats.stackexchange.com/questions/474440/why-do-transformers-use-layer-norm-instead-of-batch-norm
https://arxiv.org/pdf/2201.03545.pdf
https://proceedings.mlr.press/v119/shen20e/shen20e.pdf