microsoft / vattention

Dynamic Memory Management for Serving LLMs without PagedAttention
MIT License
248 stars 16 forks source link

Add more microbenchmarks #5

Closed apanwariisc closed 4 months ago