AIoT-MLSys-Lab / Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey
https://arxiv.org/abs/2312.03863
970 stars 82 forks source link

add one paper about SD #33

Closed callanwu closed 2 months ago

callanwu commented 2 months ago

SEED: Accelerating Reasoning Tree Construction via Scheduled Speculative Decoding, arXiv, 2024 [Paper]