issues
search
AIoT-MLSys-Lab
/
Efficient-LLMs-Survey
[TMLR 2024] Efficient Large Language Models: A Survey
https://arxiv.org/abs/2312.03863
970
stars
82
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Suggest for incorporating a speculative decoding paper
#36
smart-lty
opened
1 week ago
2
There is a paper in structured pruning which I think is not related to Model Pruning
#35
Michael-jze
opened
2 weeks ago
0
Add 1 paper about KV-Cache optimization
#34
shadowpa0327
closed
2 months ago
1
add one paper about SD
#33
callanwu
closed
2 months ago
0
Add two paper
#32
wutaiqiang
closed
3 months ago
1
Add ICLR24 spotlight paper OmniQuant.
#31
ChenMnZ
closed
6 months ago
0
Update README.md
#30
sungnyun
closed
6 months ago
1
Update README.md
#29
qingquansong
closed
7 months ago
0
Kindly ask for adding the QuantEase paper in the list
#28
qingquansong
closed
7 months ago
2
Update README.md
#27
MingLiiii
closed
7 months ago
0
typo
#26
k1rep
closed
8 months ago
2
Systems for LLM
#25
AmberLJC
closed
8 months ago
0
Quantization equation error
#24
LeoMax-Xiong
closed
8 months ago
1
Suggest incorporating one efficient LLM finetuning paper
#23
weitianxin
closed
8 months ago
1
Revert "Update on LLM systems paper"
#22
SUSTechBruce
closed
9 months ago
0
Update on LLM systems paper
#21
AmberLJC
closed
9 months ago
1
Update README.md
#20
alphadl
closed
9 months ago
0
Update README.md
#19
walkerning
closed
9 months ago
0
Update README.md
#18
tuidan
closed
9 months ago
0
wrong Illustration
#17
ludybupt
closed
9 months ago
1
12.12 pr
#16
tuidan
closed
9 months ago
0
Update README.md
#15
samiul272
closed
9 months ago
0
Revert "Tuidan patch 1"
#14
tuidan
closed
9 months ago
0
Tuidan patch 1
#13
tuidan
closed
9 months ago
0
OpenLLM supports fine-tuning now
#12
Jason-cs18
closed
9 months ago
1
Update README.md
#11
eltociear
closed
9 months ago
1
Add paper link and code link of CLEX
#10
lixin4ever
closed
9 months ago
1
Update README.md
#9
tuidan
closed
9 months ago
0
Update README.md
#8
tuidan
closed
9 months ago
0
Update README.md
#7
tuidan
closed
9 months ago
0
Update README.md
#6
tuidan
closed
9 months ago
0
Suggest including an efficient LLM inference work
#5
ZexinLi0w0
closed
9 months ago
1
add MLSys 2023 MegaBlocks
#4
Sunt-ing
closed
9 months ago
0
A NeurIPS paper on efficient architecture
#3
renll
closed
9 months ago
1
Typo
#2
danielz02
closed
9 months ago
2
Add MiniMA for white-box KD
#1
GeneZC
closed
9 months ago
1