AIoT-MLSys-Lab Efficient-LLMs-Survey issues

AIoT-MLSys-Lab / Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

https://arxiv.org/abs/2312.03863

1.02k stars 85 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Suggest for incorporating a speculative decoding paper

#36 smart-lty opened 1 month ago
2
There is a paper in structured pruning which I think is not related to Model Pruning

#35 Michael-jze opened 1 month ago
0
Add 1 paper about KV-Cache optimization

#34 shadowpa0327 closed 3 months ago
1
add one paper about SD

#33 callanwu closed 4 months ago
0
Add two paper

#32 wutaiqiang closed 4 months ago
1
Add ICLR24 spotlight paper OmniQuant.

#31 ChenMnZ closed 7 months ago
0
Update README.md

#30 sungnyun closed 7 months ago
1
Update README.md

#29 qingquansong closed 8 months ago
0
Kindly ask for adding the QuantEase paper in the list

#28 qingquansong closed 8 months ago
2
Update README.md

#27 MingLiiii closed 8 months ago
0
typo

#26 k1rep closed 9 months ago
2
Systems for LLM

#25 AmberLJC closed 9 months ago
0
Quantization equation error

#24 LeoMax-Xiong closed 10 months ago
1
Suggest incorporating one efficient LLM finetuning paper

#23 weitianxin closed 10 months ago
1
Revert "Update on LLM systems paper"

#22 SUSTechBruce closed 10 months ago
0
Update on LLM systems paper

#21 AmberLJC closed 10 months ago
1
Update README.md

#20 alphadl closed 10 months ago
0
Update README.md

#19 walkerning closed 10 months ago
0
Update README.md

#18 tuidan closed 10 months ago
0
wrong Illustration

#17 ludybupt closed 11 months ago
1
12.12 pr

#16 tuidan closed 11 months ago
0
Update README.md

#15 samiul272 closed 11 months ago
0
Revert "Tuidan patch 1"

#14 tuidan closed 11 months ago
0
Tuidan patch 1

#13 tuidan closed 11 months ago
0
OpenLLM supports fine-tuning now

#12 Jason-cs18 closed 11 months ago
1
Update README.md

#11 eltociear closed 11 months ago
1
Add paper link and code link of CLEX

#10 lixin4ever closed 11 months ago
1
Update README.md

#9 tuidan closed 11 months ago
0
Update README.md

#8 tuidan closed 11 months ago
0
Update README.md

#7 tuidan closed 11 months ago
0
Update README.md

#6 tuidan closed 11 months ago
0
Suggest including an efficient LLM inference work

#5 ZexinLi0w0 closed 11 months ago
1
add MLSys 2023 MegaBlocks

#4 Sunt-ing closed 11 months ago
0
A NeurIPS paper on efficient architecture

#3 renll closed 11 months ago
1
Typo

#2 danielz02 closed 11 months ago
2
Add MiniMA for white-box KD

#1 GeneZC closed 11 months ago
1