-
### Search before asking
- [X] I have searched the YOLOv5 [issues](https://github.com/ultralytics/yolov5/issues) and found no similar bug report.
### YOLOv5 Component
Export
### Bug
Hello
When …
-
**Describe the bug**
When I use llm-compressor to quantize llava model, but at the begining, it failed. (Unrecognized configuration class: 'transformers.models.llava.configuration_llava.LlavaConfig'…
-
# Quantization Impact on Model Accuracy | Slightwind
Mistral-7B’s performance on 5-shot MMLU 如果对测试细节不感兴趣,只需要看下面给出的汇总表格即可。
Overview 量化/非量化版本的 Mistral-7B-v0.1 模型在 5-shot MMLU 上的表现:
Quant Type Compute D…
-
### This is my env version:
```
torch:2.2.1
transformers: 4.39.0.dev0
vllm: custom compile at master@24aecf421a4ad5989697010963074904fead9a1b
```
### I use SqueezeLLM quantization my llama-7B tr…
-
### 🚀 The feature, motivation and pitch
I'm working on applications that must run locally in resource-limited HW. Threrefore, quantization becomes essential. Such applications need from multimodal vi…
-
I've begun work on the ability to edit models and import new models. Here are the remaining features I'd like to complete:
- [x] Write out model files - necessary for everything else
- [ ] Model s…
-
i have completed stable diffusion quantization in txt2img as demo shows.
the result is very good.
when i want to transfer sd quantization in inpainting task, i meet the problem that the quantization r…
-
### 🚀 Feature request
Quantization is a widely used technique to accelerate models, particularly when using the [torch.compile](https://pytorch.org/tutorials/intermediate/torch_compile_tutorial.htm…
-
[Block-wise quantization](https://arxiv.org/abs/2110.02861) divides input tensors into smaller blocks that are independently quantized, resulting in faster optimization and high precision quantization…
-
### System Info / 系統信息
torch 2.5.1+cu121
diffusers 0.31.0
torchao 0.7.0+cpu
Python 3.11.10
Windows 11
### Information / 问题信息
- [X] The official example scr…