issues
search
ModelTC
/
llmc
[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".
https://arxiv.org/abs/2405.06001
Apache License 2.0
309
stars
33
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Swap
#129
SmudgedWings
closed
2 weeks ago
0
update readme
#128
helloyongyang
closed
2 weeks ago
0
update requirements
#127
helloyongyang
closed
2 weeks ago
0
remove assert in Quarot
#126
helloyongyang
closed
2 weeks ago
0
update for _TRANSFORMERS_LN_TYPES_
#125
helloyongyang
closed
2 weeks ago
0
moe mixtral
#124
MercuryB1
closed
3 weeks ago
0
可以用于部署在高通芯片吗?
#123
xieyi4650
closed
1 week ago
1
vit naive quant
#122
SmudgedWings
closed
3 weeks ago
0
fix bug for BaseDataset without init processor
#121
helloyongyang
closed
3 weeks ago
0
support llama3.1 llama3.2 & transformers 4.45.2 version & fix two dev…
#120
helloyongyang
closed
3 weeks ago
0
update
#119
SmudgedWings
closed
3 weeks ago
0
update docker images
#118
helloyongyang
closed
3 weeks ago
0
update docker images
#117
helloyongyang
closed
3 weeks ago
0
Is awq_w_only.yml & awq_w4a16.yml use the same source code?
#116
LiMa-cas
closed
3 weeks ago
2
Inference per layer mode has no support for Llama model
#115
ZeusXuan
closed
1 month ago
2
Fix asym quant bug
#114
gushiqiao
closed
1 month ago
0
Fix fp-quant bug
#113
gushiqiao
closed
1 month ago
0
Update export_vllm.py
#112
gushiqiao
closed
1 month ago
0
Update __main__.py
#111
gushiqiao
closed
1 month ago
0
Fix utils.py
#110
gushiqiao
closed
1 month ago
0
Fix utils.py bug
#109
gushiqiao
closed
1 month ago
0
Update config files
#108
gushiqiao
closed
1 month ago
0
Support FP8 quant for vllm and sglang
#107
gushiqiao
closed
1 month ago
0
llava support img_txt datasets
#106
SmudgedWings
closed
1 month ago
0
Update README.md
#105
gushiqiao
closed
1 month ago
0
add llama31-405b-quant to readme
#104
zhiwei-dong
closed
1 month ago
0
Support Sglang
#103
gushiqiao
closed
1 month ago
0
Add docker
#102
helloyongyang
closed
1 month ago
0
Fix doc bug
#101
gushiqiao
closed
1 month ago
0
Support MLC-LLM
#100
gushiqiao
closed
1 month ago
0
Support AutoAWQ
#99
gushiqiao
closed
1 month ago
0
Support AutoAWQ
#98
gushiqiao
closed
1 month ago
0
failed to save quantizationed model
#97
LiMa-cas
closed
1 month ago
17
remove benchmark
#96
helloyongyang
closed
1 month ago
0
update
#95
helloyongyang
closed
1 month ago
0
where is run_awq_llama.sh
#94
LiMa-cas
closed
1 month ago
1
Sparsity update
#93
guanchenl
closed
1 month ago
2
Possible Bug in QuaRot Implementation with remove_mean_from_embed()
#92
A-suozhang
closed
1 month ago
4
Update README_zh.md
#91
Harahan
closed
1 month ago
0
fix README
#90
Harahan
closed
1 month ago
0
fix readme
#89
Harahan
closed
1 month ago
0
Update README_ja.md
#88
Harahan
closed
1 month ago
0
Update README_zh.md
#87
Harahan
closed
1 month ago
0
Update README.md
#86
Harahan
closed
1 month ago
0
Update README_zh.md
#85
Harahan
closed
1 month ago
0
fix README
#84
Harahan
closed
1 month ago
0
Update doc
#83
gushiqiao
closed
1 month ago
0
Update doc
#82
gushiqiao
closed
1 month ago
0
Fix doc bug
#81
gushiqiao
closed
1 month ago
0
Fix doc bug
#80
gushiqiao
closed
1 month ago
0
Previous
Next