issues
search
microsoft
/
vattention
Dynamic Memory Management for Serving LLMs without PagedAttention
MIT License
248
stars
16
forks
source link
Add more microbenchmarks
#5
Closed
apanwariisc
closed
4 months ago