ModelTC / llmc

[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".
https://arxiv.org/abs/2405.06001
Apache License 2.0
328 stars 36 forks source link

modify qwen and mixtral #230

Closed MercuryB1 closed 1 day ago

MercuryB1 commented 1 day ago

add moe gate during the merge procedure