issues
search
pentium3
/
sys_reading
system paper reading notes
235
stars
12
forks
source link
PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU
#313
Open
pentium3
opened
11 months ago
pentium3
commented
11 months ago
https://github.com/SJTU-IPADS/PowerInfer
https://github.com/SJTU-IPADS/PowerInfer