efeslab / Atom

[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
259 stars 21 forks source link

porting SVD into Atom #10

Closed shadowpa0327 closed 6 months ago