int8-inference Search Results

tensorflow/tensorflow #80319

tflite int8 export is twice as large as saved_model.pb

### 1. System information Windows 11 ### 2. Code import tensorflow as tf saved_model_dir = "C:/Users/s/Downloads/best (1)_saved_model" converter = tf.lite.TFLiteConverter.from_saved_mod…

ssdv1 updated 2 days ago

QwenLM/Qwen2.5-Coder #179

I deploy a model using vLLM, I found that in the benchmark,…

I deployed the BF16 and INT8 versions of Qwen2.5-coder-32b-instruct using vLLM (version 0.6.1) and evaluated them with OpenCompass. Surprisingly, BF16 underperformed compared to INT8 on several metric…

endNone updated 2 days ago

google-research/google-research #2278

[kws_streaming] The quantization results in significant prec…

I use the following code to convert the internal state model into TFLite ``` converter.optimizations = [tf.lite.Optimize.DEFAULT] converter.representative_dataset = representative_dataset_gen conv…

ctwillson updated 2 weeks ago

openvinotoolkit/openvino #26777

[Bug]: After conversion of the INT8 quantized tflite model i…

### OpenVINO Version 2024.4.0 ### Operating System Other (Please specify in description) ### Device used for inference CPU ### Framework Keras (TensorFlow 2) ### Model used ResNet50 ### Issu…

cheahber updated 2 weeks ago

meituan/YOLOv6 #882

Int8 pytorch inference

### Before Asking - [X] I have read the [README](https://github.com/meituan/YOLOv6/blob/main/README.md) carefully. 我已经仔细阅读了README上的操作指引。 - [X] I want to train my custom dataset, and I have read the …

snehashis1997 updated 1 year ago

nnstreamer/nnstreamer #4645

Convert directly a tensor from uint8 to int8 with tensor_tra…

Hello, I am trying to run a pipeline containing a model with int8 input format, but it seems that the conversion uint8 -> int8 is not direct and a bit under optimized. I can only get uint8 tenso…

Nicolas-Gsln updated 2 days ago

tensorflow/tensorflow #77293

RuntimeError: failed to create XNNPACK runtimeNode number 29…

### 1. System information - OS Platform and Distribution (e.g., Linux Ubuntu 16.04): Google Colab with Python 3.10.12 - TensorFlow installation (pip package or built from source): pip package -…

isuchy updated 3 weeks ago

VeriSilicon/TIM-VX #704

Default layout inference pass

Hi, I get these warnings when running an int8 quantized model on I.MX 8M Plus using the vx delegate: W [HandleLayoutInfer:332]Op 162: default layout inference pass. W [HandleLayoutInfer:332]Op 56:…

spacycoder updated 2 weeks ago

ultralytics/ultralytics #17535

Effective Techniques for Quantizing YOLO Models (v8, v11) to…

### Search before asking - [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussion…

pagalscientist updated 1 week ago

PaddlePaddle/Paddle #69224

使用PaddleSlim进行离线静态量化并导出模型，使用PaddleLite进一步将量化模型转化为int8模型，使用Pa…

### 请提出你的问题 Please ask your question 运行环境为： Kylinv10 OS Paddle 2.6.0 PaddleSlim 2.6.1 FT2000+ CPU 昆仑芯R200 XPU 原始模型为Pytorch导出的Resnet50转Paddle模型 PTQ代码如下： ```python paddleslim.quant.quant_…

czp97 updated 2 weeks ago

1000+ results for int8-inference

1000+ results
for int8-inference