issues
search
XuezheMax
/
megalodon
Reference implementation of Megalodon 7B model
MIT License
485
stars
50
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
The project is very interesting, thanks for publishing it, now I have a few questions
#8
NickyDark1
opened
1 week ago
1
Cuda 11.8/12.1
#7
timmytwoteeth
closed
1 month ago
4
Question about the number of attention chunks in the paper
#6
exhyy
closed
2 months ago
3
How to save model and evaluate on downstream LLM
#5
lose4578
opened
2 months ago
0
Flash Attention V2 vs Megalodon Swift Attention
#4
timmytwoteeth
closed
2 months ago
2
Update README.md
#3
eltociear
opened
2 months ago
0
Failed to install megalodon on V100.
#2
WailordHe
closed
2 months ago
4
any opensource weights?
#1
hijkzzz
opened
2 months ago
2