flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving
https://flashinfer.ai
Apache License 2.0
760 stars 64 forks source link

perf: more options for kv tile size #336

Closed yzh119 closed 1 week ago

yzh119 commented 1 week ago

For small query size setting, we might use large kv tile size.