flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving
https://flashinfer.ai
Apache License 2.0
760 stars 64 forks source link

fix: disable other warp layout because of large binary size #326

Closed yzh119 closed 1 week ago

yzh119 commented 1 week ago

Disable #322 for v0.0.6 release because binary size is too large. v0.0.6 will only include bugfix at the moment.