int8-quantization Search Results

1000+ results
for int8-quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

THU-MIG/yolov10 #131

int8 quantization support

Firstly, thanks to all of you for the bravo project! Currently, the model seems like does not support int8 quantization. Any plan on it?

qiangxinglin updated 2 weeks ago
2
AILab-CVC/YOLO-World #339

INT8 Quantization of YOLO-World

1. Is the newly released 'TFLite Export with INT8 Quantization' only quantize the yolov8 backbone（or image encoder）? I note that you emphasis on 'Please use Reparameterized YOLO-World for TFLite!!' ，…

MaverickPigoo updated 2 weeks ago
2
NVIDIA/TensorRT #3865

Improving int8 quantization results.

I have used PTQ for int8 export from pytorch model and despite attempts at calibration, there is a significant drop in detection accuracy. I am moving to quantization aware training to improve the…

severecoder updated 1 month ago
3
pytorch/TensorRT #2961

✨[Feature] add working unified example INT8 quantization and…

I would like to quantize my model to INT8 precision and then compile it using torch_tensorrt. Unfortunately, it is [transformer based vision model](https://github.com/mit-han-lab/efficientvit/blob/ma…

lebionick updated 2 days ago
1
alibaba/MNN #2919

Error in offline int8 quantization of yolov8n model from ult…

# Platform(Include target platform as well if cross-compiling): aarch64, ubuntu20.04 # Github版本: commit a980dba3963efb0ad76b0f3caaf5c21556f69ffe (HEAD -> master, origin/master, origin/HEAD) …

Deephome updated 4 days ago
2
vllm-project/vllm #3975

[RFC]: Int8 Activation Quantization

# Summary * We (engineering at @neuralmagic) are working on support for int8 quantized activations. * This RFC is proposing an _incremental_ approach to quantization, where the initial support for q…

tlrmchlsmth updated 2 months ago
1
THU-MIG/yolov10 #176

How to Convert YOLOv10 Model to TFLite with INT8 Quantizatio…

Hi everyone, I’m working on a project that involves deploying a YOLOv10 model on a mobile/edge device. To improve inference speed and reduce the model size, I want to convert my YOLOv10 model to Te…

AhmedFkih updated 1 week ago
4
pytorch/ao #384

Feedback on `quantize()` API

Previously we do this ```python from torchao.quantization.quant_api import change_linear_weights_to_int8_woqtensors model = torch.compile(model, mode="max-autotune", fullgraph=True) change_lin…

gau-nernst updated 5 days ago
12
ultralytics/ultralytics #13314

How to Convert YOLOv10 Model to TFLite with INT8 Quantizatio…

### Search before asking - [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussions) and fou…

AhmedFkih updated 4 weeks ago
2
openvla/openvla #10

quantization

Are the int8 and int4 quantization mentioned in the paper open source and supported in this repo

qbcleo updated 2 hours ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for int8-quantization

1000+ results
for int8-quantization