SJTU-IPADS / PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
MIT License
7.96k stars 412 forks source link

Hotfix: failed to build sparse FFN with LLM_GATE_SEQ #178

Closed hodlen closed 7 months ago