tensorrt-int8-python Search Results

1000+ results
for tensorrt-int8-python

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/TensorRT #4107

Could not decode serialized type: np.ndarray. This could be …

## Description I'm trying to generate a calibration cache file for post-training-quantizatio using Polygraphy. For which I created custom input json file referring to this [https://github.com/NVIDIA/…

Rashmip-nd updated 2 months ago
2
pytorch/TensorRT #3267

❓ [Question] How do you properly deploy a quantized model wi…

## ❓ Question I have a PTQ model and a QAT model trained with the official pytorch API following the quantization tutorial, and I wish to deploy them on TensorRT for inference. The model is metaforme…

Urania880519 updated 3 weeks ago
4
NVIDIA/TensorRT-Model-Optimizer #14

Tried to apply PTQ to a basic CV CNN network and got slower …

I used mtq.INT8_default_CFG as recommended for CNN networks (mtq.quantize(model, config, forward_loop). My initial model ran at 80FPS after quantization it dropped to 40FPS? I checked the model struct…

tmagcaya updated 1 month ago
13
ultralytics/yolov5 #13411

yolov5 on milk-v tpu 256

### Search before asking - [X] I have searched the YOLOv5 [issues](https://github.com/ultralytics/yolov5/issues) and found no similar bug report. ### YOLOv5 Component _No response_ ### Bug Hi Si…

tcpipchip updated 1 week ago
11
NVIDIA/TensorRT-Model-Optimizer #71

Quantization INT8 SDXL-Turbo on Windows 11 fails?

Hello everyone, Following the Diffusion Models Quanization with Model Optimizer, after this command: `python quantize.py --model sdxl-turbo --format int8 --batch-size 2 --calib-size 32 --collect-met…

joansc updated 3 weeks ago
1
NVIDIA/TensorRT #4072

Could not find any implementation for node /backbone/layers.…

## Description I am using this [calibration script](https://github.com/rmccorm4/tensorrt-utils/tree/master/int8/calibration) to generate the calib cache file for Segformer onnx model. But facing th…

NannilaJagadees updated 3 months ago
6
NVIDIA/TensorRT #3776

Lower-than-Expected Performance Improvement with INT8 Quanti…

## Description I recently attempted to utilize INT8 quantization with Stable Diffusion XL to enhance inference performance based on the claims made in a recent [TensorRT blog post](https://developer.…

teith updated 6 months ago
15
NVIDIA/TensorRT-LLM #967

llama2-7b bad results for int8-kv-cache + per-channel-int8-w…

### System Info 3090 gpu 0.7.1 tensorrt-llm ### Who can help? _No response_ ### Information - [ ] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] An officially suppo…

brisker updated 8 months ago
32
NVIDIA/TensorRT #4079

Accuracy loss of TensorRT 8.6 when running INT8 Quantized Re…

## Description When performing Resnet18 PTQ using TRT-modelopt, I encountered the following issue when compiling the model with TRT. First off, I started with a pretrained resnet18 from torchvi…

YixuanSeanZhou updated 1 week ago
11
tensorflow/tensorflow #58755

No speed improvements after TF-TRT optimizing on a tensorflo…

# Prerequisites Please answer the following questions for yourself before submitting an issue. - [ Y] I am using the latest TensorFlow Model Garden release and TensorFlow 2. - [ Y] I am reporti…

SohaKhazaeli updated 5 days ago
7

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for tensorrt-int8-python

1000+ results
for tensorrt-int8-python