issues
search
SJTU-IPADS
/
PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
MIT License
7.96k
stars
412
forks
source link
Use `mul_mat_transpose` at axpy op for large batch
#168
Closed
Begunner
closed
7 months ago