tensorrt-int8-python Search Results

1000+ results
for tensorrt-int8-python

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/TensorRT-LLM #1478

Failed to build engine with the 70B sq_int model

### System Info 4*A800 80G ### Who can help? @Tracin ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks - [X] An officially supported tas…

Opdoop updated 2 weeks ago
2
NVIDIA/TensorRT #2125

Convert a quantized conv3d model failed

## Description I'm using pytorch quantization toolkit to quantize my model, which has some conv3d module. The QAT procedure is OK. But when i use trtexec to convert the onnx with Q/DQ pairs to en…

YohannXu updated 2 weeks ago
13
NVIDIA/TensorRT #3754

onnx model convert trt.int8 failure：fallback fp32

## Description When I use TensorRT for int8 quantization, I always encounter the accuracy fallback to fp32. The trt.BuilderFlag.OBEY_PRECISION_CONSTRAINTS parameter does not solve the issue. W…

kakascode updated 7 months ago
6
NVIDIA/TensorRT #3914

Pointers for TensorRT model with uint8/int8 input

By using [pytorch-quantization](https://docs.nvidia.com/deeplearning/tensorrt/pytorch-quantization-toolkit/docs/index.html) i was able to create TensorRT engine models that are (almost) fully int8 and…

Michelvl92 updated 1 month ago
20
NVIDIA/TensorRT #3981

TensorRT for SDXL demo not work well than use diffusers onl…

## Description almost same params Even use int8 , it can't save more memory and slower than use deepcache. Is this supposed to be ? How to save more memory? TensorRT supports dynami…

631068264 updated 4 months ago
1
microsoft/onnxruntime #13381

Can't run qdq model with TRT EP

### Describe the issue 2022-10-20 09:21:09.531367276 [E:onnxruntime:Default, tensorrt_execution_provider.h:58 log] [2022-10-20 09:21:09 ERROR] 4: [network.cpp::validate::2891] Error Code 4: Interna…

mengniwang95 updated 2 years ago
2
NVIDIA/TensorRT #3983

Engine building failure of TensorRT 10.2.0 (pip install) whe…

## Description Fresh install of `pip install tensorrt==10.2.0` Following engine build crashes on Ubuntu 22.04.4 LTS: ``` from polygraphy.backend.trt import EngineFromNetwork EngineFromNet…

ifeherva updated 4 months ago
13
microsoft/onnxruntime #13071

TensorRT EP failed to set INT8 dynamic range.

### Describe the issue I followed the tutorial on: https://github.com/microsoft/onnxruntime-inference-examples/tree/main/quantization/nlp/bert/trt to generate an int8 model. However, whenever I run…

piedras77 updated 1 year ago
9
lyuwenyu/RT-DETR #222

RT-DETR-R18 onnx2tensorrt succeeded in int8 but failed in fp…

I tried to convert RT-DETR-R18 from onnx to tensorrt, and I succeeded in int8, failed in fp16. torch2onnx in STATIC: python tools/export_onnx.py onnx2trt: ./trtexec --onnx=rtdetr.onnx --saveEngin…

shuchang0714 updated 8 months ago
2
NVIDIA-AI-IOT/torch2trt #552

AttributeError: 'NoneType' object has no attribute 'serializ…

when I try to use torch2trt in yolactedge evaluation: ``` python eval.py --trained_model=../Downloads/yolact_edge_54_800000.pth --score_threshold=0.8 --top_k=100 --image=test_Color.jpg --use_tensorr…

victorhqx updated 1 year ago
4

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for tensorrt-int8-python

1000+ results
for tensorrt-int8-python