issues
search
AIoT-MLSys-Lab
/
Efficient-LLMs-Survey
[TMLR 2024] Efficient Large Language Models: A Survey
https://arxiv.org/abs/2312.03863
1.02k
stars
85
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Suggest for incorporating a speculative decoding paper
#36
smart-lty
opened
1 month ago
2
There is a paper in structured pruning which I think is not related to Model Pruning
#35
Michael-jze
opened
1 month ago
0
Add 1 paper about KV-Cache optimization
#34
shadowpa0327
closed
3 months ago
1
add one paper about SD
#33
callanwu
closed
4 months ago
0
Add two paper
#32
wutaiqiang
closed
4 months ago
1
Add ICLR24 spotlight paper OmniQuant.
#31
ChenMnZ
closed
7 months ago
0
Update README.md
#30
sungnyun
closed
7 months ago
1
Update README.md
#29
qingquansong
closed
8 months ago
0
Kindly ask for adding the QuantEase paper in the list
#28
qingquansong
closed
8 months ago
2
Update README.md
#27
MingLiiii
closed
8 months ago
0
typo
#26
k1rep
closed
9 months ago
2
Systems for LLM
#25
AmberLJC
closed
9 months ago
0
Quantization equation error
#24
LeoMax-Xiong
closed
10 months ago
1
Suggest incorporating one efficient LLM finetuning paper
#23
weitianxin
closed
10 months ago
1
Revert "Update on LLM systems paper"
#22
SUSTechBruce
closed
10 months ago
0
Update on LLM systems paper
#21
AmberLJC
closed
10 months ago
1
Update README.md
#20
alphadl
closed
10 months ago
0
Update README.md
#19
walkerning
closed
10 months ago
0
Update README.md
#18
tuidan
closed
10 months ago
0
wrong Illustration
#17
ludybupt
closed
11 months ago
1
12.12 pr
#16
tuidan
closed
11 months ago
0
Update README.md
#15
samiul272
closed
11 months ago
0
Revert "Tuidan patch 1"
#14
tuidan
closed
11 months ago
0
Tuidan patch 1
#13
tuidan
closed
11 months ago
0
OpenLLM supports fine-tuning now
#12
Jason-cs18
closed
11 months ago
1
Update README.md
#11
eltociear
closed
11 months ago
1
Add paper link and code link of CLEX
#10
lixin4ever
closed
11 months ago
1
Update README.md
#9
tuidan
closed
11 months ago
0
Update README.md
#8
tuidan
closed
11 months ago
0
Update README.md
#7
tuidan
closed
11 months ago
0
Update README.md
#6
tuidan
closed
11 months ago
0
Suggest including an efficient LLM inference work
#5
ZexinLi0w0
closed
11 months ago
1
add MLSys 2023 MegaBlocks
#4
Sunt-ing
closed
11 months ago
0
A NeurIPS paper on efficient architecture
#3
renll
closed
11 months ago
1
Typo
#2
danielz02
closed
11 months ago
2
Add MiniMA for white-box KD
#1
GeneZC
closed
11 months ago
1