OpenGVLab / OmniQuant

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.
MIT License
626 stars 49 forks source link

[Model Request] MiniCPM #84

Open RanchiZhao opened 2 weeks ago

RanchiZhao commented 2 weeks ago

https://huggingface.co/openbmb/MiniCPM-2B-sft-bf16 I wonder how well can OmniQuant do on those sota SLMs?