flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving
https://flashinfer.ai
Apache License 2.0
760 stars 64 forks source link

refactor: use sink symbol instead of a placeholder register in row sum mma implementation #347

Closed yzh119 closed 3 days ago

yzh119 commented 3 days ago

This doesn't change performance, just to make code simpler.