image-quantization Search Results

1000+ results
for image-quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

WongKinYiu/yolov9 #327

YOLOv9 with Quantization-Aware Training (QAT) for TensorRT

**YOLOv9 with Quantization-Aware Training (QAT) for TensorRT** https://github.com/levipereira/yolov9-qat/ This repository hosts an implementation of YOLOv9 integrated with Quantization-Aware Train…

levipereira updated 3 months ago
41
ollama/ollama #5425

Does having the default quant type being Q4_0 (a legacy form…

The Ollama model hub still has the default quant type of Q4_0 which is a legacy format that under-performs compared to K-quants (Qn_K, e.g. Q4_K_M, Q6_K, Q5_K_L etc...). - Would it perhaps make sen…

sammcj updated 3 weeks ago
2
vllm-project/vllm #4760

[Performance]: Why the avg. througput generation is low?

### Report of performance regression Hi I use this: ``` server_vllm.py \ --model "/data/models_temp/functionary-small-v2.4/" \ --served-model-name "functionary" \ --dtype=bfloat16 \ -…

rvsh2 updated 2 weeks ago
3
deepjavalibrary/djl #3343

Model conversion process failed when deploying Mixtral 8x22B…

## Description Model conversion process failed with djl-tensorrtllm and below serving.properties: ``` image_uri = image_uris.retrieve( framework="djl-tensorrtllm", region=sess…

gsjoy8888 updated 3 months ago
3
nbasyl/LLM-FP4 #4

question of scale factors.

Since We adopt per-tensor quantization for activation and per-channel quantization for weight, I am confused that why the the entries in a_scale tensor at line 230 not share the same value, I suppos…

dulvqingyunLT updated 9 months ago
3
ultralytics/ultralytics #14711

Incorrect Output Results for Quantization of my Yolov8n Mode…

### Search before asking - [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussions) and f…

BeyzaSimsekk updated 2 months ago
6
JaidedAI/EasyOCR #1155

Python virtual machine crashes with segfault

Good night. EasyOCR is crashing the Python 3.10 VM Ubuntu 22.04 / Debian Bullseye (identical problem but Python 3.9) Architecture Raspberry Pi 4 4GBytes Error as follows: Python 3.10.12 (ma…

CdAB63 updated 2 months ago
4
ultralytics/ultralytics #16090

model.export to TFLite with int8 doesn't yield a fully int8 …

### Search before asking - [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and found no similar bug report. ### Ultralytics YOLO Component …

nicklasb updated 3 weeks ago
14
NVIDIA/FasterTransformer #728

[Question] Is it possible to use my own pretrained weights f…

### Branch/Tag/Commit main ### Docker Image Version nvcr.io/nvidia/pytorch:23.04-py3 ### GPU name T4 ### CUDA Driver 470.141.03 ### Reproduced Steps I'm trying to run `calib…

proevgenii updated 1 year ago
3
microsoft/onnxruntime #21496

[Performance] DequantizeLinear, pad and QuantizeLinear opera…

### Describe the issue The DequantizeLinear, pad, and QuantizeLinear operations in the statically quantized model using the optimization level ORT_ENABLE_EXTENDED are not fused into one operation. My…

flytair updated 1 month ago
5

上一页 1...15 16 17 18 19 20 21...100 下一页

1000+ results for image-quantization

1000+ results
for image-quantization