issues
search
CryVeck
/
QuaRot
Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.
https://arxiv.org/abs/2404.00456
Apache License 2.0
0
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
fixing token_wise rotation size when different from value size
#3
CryVeck
closed
1 day ago
0
Rotation destroy perplexity
#2
CryVeck
opened
1 day ago
3
Rotation hidden size
#1
CryVeck
closed
1 day ago
2