pentium3 / sys_reading

system paper reading notes
235 stars 12 forks source link

PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU #313

Open pentium3 opened 11 months ago

pentium3 commented 11 months ago

https://github.com/SJTU-IPADS/PowerInfer