pentium3 / sys_reading

system paper reading notes
229 stars 12 forks source link

PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU #313

Open pentium3 opened 6 months ago

pentium3 commented 6 months ago

https://github.com/SJTU-IPADS/PowerInfer