int8-quantization Search Results

1000+ results
for int8-quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

tensorflow/tensorflow #62923

TFLite Converter, add possibility to ignore some OPs from qu…

### Issue type Feature Request ### Have you reproduced the bug with TensorFlow Nightly? No ### Source binary ### TensorFlow version v2.13.0-17-gf841394b1b7 ### Custom code No ### OS platform…

adamp87 updated 7 months ago
4
microsoft/onnxruntime #20052

[Performance] INT8 quantized model run slower than FP32 mode…

### Describe the issue I quantized a simple CNN model in Pytorch and converted it to onnx. When I tested the runtime of int8 model and fp32 model on CPU, the int8 model was slower. Here my code: [Go…

minhhotboy9x updated 5 months ago
1
breizhn/DTLN #66

Creating integer only models

Nils is it possible to create an integer only models so this could run on accelerators or frameworks such as ArmNN? https://www.tensorflow.org/lite/performance/post_training_quantization#full_integer…

StuartIanNaylor updated 10 months ago
11
OpenPPL/ppq #550

yolov5量化，在ppq/samples/yolo中运行02_Quantization.py文件出现警告和报错，输出模…

[Info] You are exporting PPQ Graph to TensorRT(Onnx + Json). Please Compile the TensorRT INT8 engine manually: from ppq.utils.TensorRTUtil import build_engine build_engine(onnx_file='Quantized…

zjhleaning updated 5 months ago
1
pytorch/ao #134

2:4 sparsity + PTQ(int8) model's inference

Are there any runnable demos of using Sparse-QAT/PTQ (2:4) to accelerate inference, such as applying PTQ to a 2:4 sparse LLaMA for inference acceleration? I am curious about the potential speedup rati…

RanchiZhao updated 5 months ago
7
ultralytics/ultralytics #15769

CLS loss off the charts while fine-tuning yolov8n-seg!

### Search before asking - [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussi…

Gaurav-Shah05 updated 1 month ago
2
microsoft/onnxruntime #8368

int8 quantization on GPU support with transformers like bert…

**Describe the solution you'd like** I found that the latest release of tensorrt 8.0 is support for the int8 quantization on GPU, which is great accelerate inference speed. And now onnxruntime is …

RyanHuangNLP updated 2 years ago
4
Deci-AI/super-gradients #1158

Understanding Quantization results

### 💡 Your Question Hi, I am just checking, I see in the provided results that Yolo-NAS-L does not suffer much reduction in performance going to Yolo-NAS-INT8-L. Can I check what exactly is meant …

lpkoh updated 1 year ago
3
ultralytics/ultralytics #15605

The result of optimizing Ultralytics with OpenVINO was incor…

### Search before asking - [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and found no similar bug report. ### Ultralytics YOLO Component Val …

qinqinqin648 updated 1 month ago
5
bilibili/Index-1.9B #9

role play 下 VRAM使用不斷的增加

模型加載大概占用5G，來回的對話幾次後，就跳到6G，增加一次對話大概增加300MB記憶體，請問有辦法克服這個問題嗎? ============================== python realtime_chat.py --role_name 三三 -----PERFORM NORM HEAD user:你好 /home/allen/miniconda3/envs/index…

allencyhsu updated 3 months ago
2

上一页 1...14 15 16 17 18 19 20...100 下一页

1000+ results for int8-quantization

1000+ results
for int8-quantization