horseee / Awesome-Efficient-LLM

A curated list for Efficient Large Language Models
1.24k stars 93 forks source link

[From Author] Adding Two Recent Papers #27

Closed junchenj closed 2 weeks ago

junchenj commented 2 weeks ago

If PRs are welcome, I'm one of the authors of CacheGen and CacheBlend, and the arxiv and github links are added.