tensorrt-int8-python Search Results

1000+ results
for tensorrt-int8-python

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/onnxruntime #17312

TensorRT Execution Provider build fail while using TensorRT …

### Describe the issue Hi, I am using ONNX Runtime with TensorRT Execution Provider for a quantized model (YOLO-NAS). While TensorRT cli (trtexec.exe) successfully build the engine from onnx model, t…

namtr92 updated 3 weeks ago
4
ultralytics/ultralytics #15806

RTDETR to tensorrt int8

### Search before asking - [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussi…

shengyu27 updated 2 months ago
12
NVIDIA/TensorRT-LLM #1045

Unable to build Llama-2-13B-Chat on RTX 4070Ti

### System Info - GPU (Nvidia GeForce RTX 4070 Ti) - CPU 13th Gen Intel(R) Core(TM) i5-13600KF - 32 GB RAM - 1TB SSD - OS Windows 11 Package versions: - TensorRT version 9.2.0.post12.dev5 …

kaalen updated 1 day ago
22
dusty-nv/jetson-containers #702

ROS2 Jazzy fails to build because of OpenCV

I tried building an image with ROS2 Jazzy and Jax and it seems that OpenCV failed to install which stops me from creating the image. I have also tried Humble, and it also fails on the same step if t…

0Unkn0wn updated 3 days ago
9
NVIDIA/TensorRT-Model-Optimizer #22

how to quantize onnx to fp8?

Hi again, I've successfully quantized an onnx model to int8, then converted to tensorrt engine and noticed the performance increase compared to fp16. ```bash python -m modelopt.onnx.quantizati…

yuvraj108c updated 2 months ago
7
NVIDIA/TensorRT-LLM #958

Qwen14B model result of long prompt is different with hf res…

### System Info GPU: rtx8000 Diver version: 525.85.05 Cuda version: 12.0 Syetem: ubuntu20.04 ### Who can help? _No response_ ### Information - [ ] The official example scripts - [ ] My own mod…

Lzhang-hub updated 1 week ago
2
NVIDIA/TensorRT #2984

Conversion to int8 with trtexec fails

## Description I am trying to convert onnx model to int8 with latest TensorRT. I got the following error: ``` [05/19/2023-14:42:31] [E] Error[2]: Assertion getter(i) != 0 failed. [05/19/2023-14…

DaraOrange updated 3 months ago
6
NVIDIA/TensorRT-LLM #2158

KeyError: 'llava_llama'

Hi TensorRT-LLM team, Your work is incredible. By following the READme file for [multi-modeling](https://github.com/NVIDIA/TensorRT-LLM/blob/main/examples/multimodal/README.md), we were sucess to run…

tiend1 updated 1 week ago
5
NVIDIA/TensorRT #4068

Missing scale and zeropoint for lot of layers on calibrating…

## Description I generated calibration cache for Vision Transformer onnx model using EntropyCalibration2 method. When trying to generate engine file using cache file for INT8 precision using trte…

Shalini194 updated 3 months ago
14
NVIDIA/TensorRT-LLM #2118

AttributeError: 'PluginConfig' object has no attribute '_str…

### System Info - CPU: X86 - GPU: NVIDIA L20 - python - tensorrt 10.3.0 - tensorrt-cu12 10.3.0 - tensorrt-cu12-bindings 10.3.0 - tensorrt-cu12-libs 10…

BooHwang updated 1 month ago
6

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for tensorrt-int8-python

1000+ results
for tensorrt-int8-python