image-quantization Search Results

1000+ results
for image-quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/TensorRT-LLM #1172

Failed to quantize Llama2 70b fine tuned model to AWQ Int4

### System Info - CPU archtecture: x86_64 - CPU/Host memory size: 250GB total - GPU properties - GPU name: 2x NVIDIA A100 80GB - GPU memory size: 160GB total - Libraries - tensorrt @ fi…

aikitoria updated 7 months ago
3
xorbitsai/inference #1712

【BUG】xinference升级0.12.2后运行glm4v出现OOM

模型是glm4-v-9b，显卡是3090和4090 启动命令： xinference launch --model-engine Transformers --model-name glm-4v --size-in-billions 9 --model-format pytorch --quantization none 问题描述： xinference刚刚升级到0.12.2版本后，3…

Yog-AI updated 2 months ago
7
hiyouga/LLaMA-Factory #5561

把qwen2-7b训练模型变更成qwen2.5-32b，训练完成后推理结果不会停止

### Reminder - [X] I have read the README and searched the existing issues. ### System Info - `llamafactory` version: 0.9.1.dev0 - Platform: Linux-5.15.0-91-generic-x86_64-with-glibc2.35 - …

wenocy updated 1 week ago
7
allenbai01/ProxQuant #2

How to run multi-bit quantization?

Thanks for sharing the code. A good paper! May I know how to run multi-bit quantization? Do you have the script or code for it? And have you tried multi-bit quantization on image classification? …

haolibai updated 5 years ago
2
joonb14/TFLiteDetection #1

Missing input quantization

Thank you for the sample! I'm not 100% sure, but I believe that the processor is missing input quantization as described at https://www.tensorflow.org/lite/performance/post_training_integer_quant#r…

CIPop updated 1 year ago
2
microsoft/nni #5174

Quantization tutorial error

**Describe the issue**: I was doing a quantization tutorial (quantization_quick_start_mnist, quantization_speedup) However, I runnig the tutorial on Jupyter notebook and an error occurred Both tu…

ayj8655 updated 1 year ago
2
johnsmith0031/alpaca_lora_4bit #113

Differences between QLoRA and this repo

1. Nomal float + Double quantization QLoRA currently uses zero shot quantization which is different from GPTQ. However, unlike GPTQ, it does not require data, but incurs some performance loss. Theref…

qwopqwop200 updated 1 year ago
3
CompVis/stable-diffusion #229

no module named 'torch.ao'

`python scripts/txt2img.py --prompt "a photograph of a huge bear, style of TIME magazine" --plms /home/grayson/miniconda3/envs/ldm/lib/python3.8/site-packages/torchvision/io/image.py:13: UserWarning…

prismspecs updated 1 year ago
3
microsoft/onnxruntime #13872

Dynamic quantization is useless on AMD cpus(AMD EPYC 7K62 48…

### Describe the issue I do dynamic quantization for my model, and then tested it on Intel and amd cpus respectively. The inference speed can be greatly improved on the Intel CPU, but not on the amd …

TonyUSTC updated 5 months ago
2
openvinotoolkit/openvino #27083

[Performance]: How to assign model inference to specific CPU…

### OpenVINO Version 2024.4.0 ### Operating System Ubuntu 20.04 (LTS) ### Device used for inference CPU ### OpenVINO installation Build from source ### Programming Language …

LinGeLin updated 3 hours ago
2

上一页 1...13 14 15 16 17 18 19...100 下一页

1000+ results for image-quantization

1000+ results
for image-quantization