ModelTC / llmc

This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".
https://arxiv.org/abs/2405.06001
Apache License 2.0
226 stars 25 forks source link

Fix real_quant zp bug #48

Closed gushiqiao closed 3 weeks ago