model-quantization Search Results

1000+ results
for model-quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ultralytics/yolov5 #13404

problem with int8 quantization of tensorrt for models traine…

### Search before asking - [X] I have searched the YOLOv5 [issues](https://github.com/ultralytics/yolov5/issues) and found no similar bug report. ### YOLOv5 Component Export ### Bug Hello When …

skynn1128 updated 2 weeks ago
2
vllm-project/llm-compressor #73

Llava model quantization seems not be supported

**Describe the bug** When I use llm-compressor to quantize llava model, but at the begining, it failed. (Unrecognized configuration class: 'transformers.models.llava.configuration_llava.LlavaConfig'…

caojinpei updated 1 month ago
5
SlightwindSec/slightwindsec.github.io #1

posts/modelquantization/quantization-impact-on-model-accurac…

# Quantization Impact on Model Accuracy | Slightwind Mistral-7B’s performance on 5-shot MMLU 如果对测试细节不感兴趣，只需要看下面给出的汇总表格即可。 Overview 量化/非量化版本的 Mistral-7B-v0.1 模型在 5-shot MMLU 上的表现： Quant Type Compute D…

utterances-bot updated 2 months ago
1
vllm-project/vllm #3226

vllm load SqueezeLLM quantization model failed

### This is my env version: ``` torch:2.2.1 transformers: 4.39.0.dev0 vllm: custom compile at master@24aecf421a4ad5989697010963074904fead9a1b ``` ### I use SqueezeLLM quantization my llama-7B tr…

zuosong-peng updated 3 weeks ago
6
vllm-project/vllm #9324

[Feature]: Quantization support for LLaVA OneVision

### 🚀 The feature, motivation and pitch I'm working on applications that must run locally in resource-limited HW. Threrefore, quantization becomes essential. Such applications need from multimodal vi…

salvaba94 updated 1 month ago
2
descawed/galsdk #27

Model editing and importing

I've begun work on the ability to edit models and import new models. Here are the remaining features I'd like to complete: - [x] Write out model files - necessary for everything else - [ ] Model s…

descawed updated 1 day ago
1
NVIDIA/TensorRT #4212

stable diffusion quantization in inpainting task is poor

i have completed stable diffusion quantization in txt2img as demo shows. the result is very good. when i want to transfer sd quantization in inpainting task, i meet the problem that the quantization r…

worhar updated 1 week ago
3
openvinotoolkit/nncf #2766

[TorchFX] Torch FX/PyTorch 2 Export Quantization

### 🚀 Feature request Quantization is a widely used technique to accelerate models, particularly when using the [torch.compile](https://pytorch.org/tutorials/intermediate/torch_compile_tutorial.htm…

alexsu52 updated 4 hours ago
4
webmachinelearning/webnn #779

Support block-wise quantization

[Block-wise quantization](https://arxiv.org/abs/2110.02861) divides input tensors into smaller blocks that are independently quantized, resulting in faster optimization and high precision quantization…

huningxin updated 2 weeks ago
1
THUDM/CogVideo #509

ImportError: cannot import name 'ActivationCasting' from 'to…

### System Info / 系統信息 torch 2.5.1+cu121 diffusers 0.31.0 torchao 0.7.0+cpu Python 3.11.10 Windows 11 ### Information / 问题信息 - [X] The official example scr…

nitinmukesh updated 1 week ago
1

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for model-quantization

1000+ results
for model-quantization