flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving
https://flashinfer.ai
Apache License 2.0
760 stars 64 forks source link

ci: update CHANGELOG #344

Closed yzh119 closed 5 days ago

yzh119 commented 5 days ago

Also reduce binary size but limit the maximum number of registers for x_frag and o_frag to 200.