issues
search
thu-nics
/
MoA
The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>
MIT License
49
stars
2
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
AMD GPU
#1
DJ-Perico
opened
3 weeks ago
2