model-quantization Search Results

1000+ results
for model-quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

levipereira/yolov9-qat #16

Quantization for YOLOv9 Segmentation Models

Hi, I’m using YOLOv9 for segmentation tasks and noticed that quantization is currently supported for object detection models. Since the backbone is the same across all YOLOv9 variants, I wanted to …

GokceSengun updated 1 week ago
3
vllm-project/llm-compressor #852

Why is the speed does not increase after compressed it?

https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct https://github.com/vllm-project/llm-compressor/tree/main/examples/quantization_w8a8_fp8 https://github.com/vllm-project/llm-compressor/tre…

liho00 updated 1 day ago
5
ultralytics/ultralytics #15388

Quantization of PyTorch model for Torch Mobile

### Search before asking - [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and found no similar bug report. ### Ultralytics YOLO Component _No …

NazaRik555 updated 1 day ago
3
mobiusml/hqq #125

Support for HQQ Quantization: Compatibility with LLava and Q…

Hi, I would like to inquire about the support for HQQuant (HQQ) quantization with specific models. I am particularly interested in knowing if the following models are supported for HQQ quantization…

NEWbie0709 updated 18 hours ago
9
SlightwindSec/slightwindsec.github.io #1

posts/modelquantization/quantization-impact-on-model-accurac…

# Quantization Impact on Model Accuracy | Slightwind Mistral-7B’s performance on 5-shot MMLU 如果对测试细节不感兴趣，只需要看下面给出的汇总表格即可。 Overview 量化/非量化版本的 Mistral-7B-v0.1 模型在 5-shot MMLU 上的表现： Quant Type Compute D…

utterances-bot updated 4 weeks ago
1
kijai/ComfyUI-CogVideoXWrapper #165

Can the GGUF model converted to 1.58 bits quantization?

I really do not know much about the AI world and the limitations, but if this model can convert to 1.58, maybe it will make this model more accessible?

charmandercha updated 21 hours ago
2
vllm-project/vllm #9324

[Feature]: Quantization support for LLaVA OneVision

### 🚀 The feature, motivation and pitch I'm working on applications that must run locally in resource-limited HW. Threrefore, quantization becomes essential. Such applications need from multimodal vi…

salvaba94 updated 1 week ago
2
NVIDIA/TensorRT-LLM #2327

awq quantization with gemma 2 9b

### System Info a100 ### Who can help? @Tracin ### Information - [x] The official example scripts - [ ] My own modified scripts ### Tasks - [x] An officially supported task in the `examples` …

Alireza3242 updated 3 hours ago
4
vllm-project/llm-compressor #73

Llava model quantization seems not be supported

**Describe the bug** When I use llm-compressor to quantize llava model, but at the begining, it failed. (Unrecognized configuration class: 'transformers.models.llava.configuration_llava.LlavaConfig'…

caojinpei updated 4 days ago
5
NVIDIA/TensorRT #4212

stable diffusion quantization in inpainting task is poor

i have completed stable diffusion quantization in txt2img as demo shows. the result is very good. when i want to transfer sd quantization in inpainting task, i meet the problem that the quantization r…

worhar updated 2 days ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for model-quantization

1000+ results
for model-quantization