tensorrt-int8-python Search Results

1000+ results
for tensorrt-int8-python

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/onnxruntime #22328

[Performance] Python inference runs faster than C++

### Describe the issue Observe that ONNX model (FP32) executed in C++ runs slower than Python. it's much much worse with TensorRT execution provider. I've tried exporting a F16 model with `keep…

ashwin-999 updated 2 weeks ago
1
NVIDIA/TensorRT #1847

Failed INT8 quantization.

Dear Developers, I am very new to Tensorrt and quantization. Previously I only use the basic example of Tensorrt to generate engines in FP16 because I thought INT8 will compromise accuracy signific…

deephog updated 1 week ago
7
tensorflow/tensorrt #137

Why no improvemenmt in the object_detection example

tensorflow-gpu1.14 CUDA10.1 GPU gtx1080 my_test.json { "model_config": { "model_name": "ssd_resnet_50_fpn_coco", "input_dir": "/home/liujt/software/tensorrt/data", "batch_size…

chinesesoft8 updated 2 years ago
12
microsoft/onnxruntime #21457

TensorRT EP's inference results are abnormal.

### Describe the issue Inference results are outputting abnormally when using YOLOv7 models with TensorRT EP. We have confirmed that the results are normal when using CPU and CUDA. The issue wa…

c1aude updated 1 week ago
37
onnx/keras-onnx #507

Removal of Identity:0 in Output's name

Hi, I'm trying to convert to tensorRT int8 Model using onnx made by keras2Onnx. My environment is as below: python=3.7, keras2onnx=1.7, tensorflow=2.2.0, onnx=1.7, onnxconverter_common=1.7 My s…

abc3698 updated 4 years ago
1
NVIDIA/TensorRT #3914

Pointers for TensorRT model with uint8/int8 input

By using [pytorch-quantization](https://docs.nvidia.com/deeplearning/tensorrt/pytorch-quantization-toolkit/docs/index.html) i was able to create TensorRT engine models that are (almost) fully int8 and…

Michelvl92 updated 1 month ago
20
NVIDIA-AI-IOT/torch2trt #615

loading and doing inference float32 in c++ API are working…

Firstly, thanks for this project that is of high quality. I converte my model with torch2trt in code: ... model_trt_float32 = torch2trt( my_model,[ims],max_batch_size=32); model_trt…

joberzheng updated 3 years ago
2
NVIDIA/TensorRT-LLM #1478

Failed to build engine with the 70B sq_int model

### System Info 4*A800 80G ### Who can help? @Tracin ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks - [X] An officially supported tas…

Opdoop updated 1 week ago
2
NVIDIA/TensorRT #3978

How to make PTQ calibration for a Hybrid Quantization model …

## Description what is the right way to calibrate a hybrid quantization model ？ i built my tensorrt engine from ONNX model by the sub code, i selected the ``` class Calibrator(trt.IInt8EntropyCa…

renshujiajia updated 4 months ago
3
NVIDIA/TensorRT #3754

onnx model convert trt.int8 failure：fallback fp32

## Description When I use TensorRT for int8 quantization, I always encounter the accuracy fallback to fp32. The trt.BuilderFlag.OBEY_PRECISION_CONSTRAINTS parameter does not solve the issue. W…

kakascode updated 7 months ago
6

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for tensorrt-int8-python

1000+ results
for tensorrt-int8-python