horseee / Awesome-Efficient-LLM

A curated list for Efficient Large Language Models
1.24k stars 93 forks source link

Add paper "Ada-KV: Optimizing KV Cache Eviction by Adaptive Budget Allocation for Efficient LLM Inference" #25

Closed FFY0 closed 3 weeks ago

FFY0 commented 3 weeks ago

Add paper "Ada-KV: Optimizing KV Cache Eviction by Adaptive Budget Allocation for Efficient LLM Inference"