ModelTC / llmc

[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".
https://arxiv.org/abs/2405.06001
Apache License 2.0
326 stars 34 forks source link

Update quant.py #162

Closed yhhhli closed 3 weeks ago

Harahan commented 4 weeks ago

@yhhhli Please use pre-commit to check the format before pr.