ruikangliu / FlatQuant

Official PyTorch implementation of FlatQuant: Flatness Matters for LLM Quantization
MIT License
25 stars 3 forks source link