issues
search
ModelTC
/
llmc
This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".
https://arxiv.org/abs/2405.06001
Apache License 2.0
226
stars
25
forks
source link
Update vllm.md
#77
Closed
gushiqiao
closed
1 week ago