issues
search
bkitano
/
llama-from-scratch
Llama from scratch, or How to implement a paper without crying
https://blog.briankitano.com/llama-from-scratch/
477
stars
46
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Spelling
#9
dflock
closed
1 month ago
0
get_rotary_matrix
#8
nkkbr
opened
1 month ago
1
SwiGLU
#7
nkkbr
opened
1 month ago
0
RoPEMaskedAttentionHead
#6
nkkbr
opened
1 month ago
0
no need to softmax before cross_entrpoy
#5
nkkbr
closed
1 month ago
1
Incorrect RMSNorm
#4
arunmallya
opened
3 months ago
3
next level
#3
UmarIgan
closed
1 month ago
1
Just to thank you!
#2
Andreh1982
closed
5 months ago
1
TypeError: MultiheadAttention.forward() got an unexpected keyword argument 'is_causal'
#1
bjpcjp
closed
10 months ago
1