tensorrt-inference Search Results

1000+ results
for tensorrt-inference

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

xuanandsix/CLRNet-onnxruntime-and-tensorrt-demo #6

onnx inference time better than tensorrt inference time

Thanks for your amazing work. It seems the inference time of onnx model is better than the tensorrt model. Is there anything wrong with my testing ? I got 150 ms time inference for the onnx model and …

mamadouDembele updated 2 years ago
1
open-mmlab/mmdeploy #2596

Could mmdeploy support multiple model inference in parallel …

### Motivation In some industry project, we need multiple models to hand multiple different defects type. Under this case， we need one GPU to make inference against different defects with related mod…

kelvinwang139 updated 1 week ago
2
jina-ai/clip-as-service #953

Does the “ViT-L/14” support conversion to TensorRT parameter…

Code file name "clip_txt.py": _MODELS = [ 'RN50::openai', 'RN50::yfcc15m', 'RN50::cc12m', 'RN101::openai', 'RN101::yfcc15m', 'RN50x4::openai', 'ViT-B-32::openai', …

gyd-a updated 1 month ago
1
levipereira/triton-server-yolo #3

Cannot convert onnx to trt yolov9

Can you help? this issue when I run ./start-triton-server.sh Im using **nvcr.io/nvidia/tritonserver:21.07-py3** > root@bf5cff23afa2:/apps# bash ./start-triton-server.sh --models yolov9-e-qat …

SonNNguyen updated 3 weeks ago
1
chicleee/Image-Matching-Paper-List #5

Feature Matching with Fast Inference and TensorRT

Hi, I am doing some research and i am looking for feature matching models that are accurate but can also be optimized for edge devices. For the optimization TensorRT is used and deployed on a Jetson…

williamhoole updated 6 months ago
1
dottxt-ai/outlines #632

Support for TensorRT-LLM

Outlines currently support the vLLM inference engine, it would be great if it could also support the tensorRT-LLM inference engine.

SupreethRao99 updated 2 months ago
7
Peterande/D-FINE #38

Why is the inference speed of models trained on custom datas…

the size of the model trained on my own data is about 165M, and the inference time, including post-processing, is approximately 237ms

bigbro13 updated 3 weeks ago
5
mlcommons/inference #1866

retinanet run harness fails 'executionContext.cpp::setOptimi…

Trying to run offline retinanet in a container with one Nvidia GPU: cm run script --tags=run-mlperf,inference,_find-performance,_full,_r4.1-dev --model=retinanet --implementation=nvidia …

stbailey001 updated 1 week ago
4
CarkusL/CenterPoint #14

result error using TensorRT inference

@CarkusL Thanks for your great work.I Merge pfe_sim.onnx and rpn.onnx to pointpillars_trt.onnx. And use it by TensorRt to inference.But the result is error which is showed in the link. could you help…

ZHUANG-JLU updated 2 years ago
1
triton-inference-server/server #7056

Unable to use maximum batch size parameter in Triton Inferen…

**Description** I have converted a .onnx model file to .plan(TensorRT) file using the 24.02-py3 docker image using builder.max_batch_size = 16. When I tried to deploy this model on Triton Infere…

ManoharMarri updated 4 weeks ago
2

上一页 1...8 9 10 11 12 13 14...100 下一页

1000+ results for tensorrt-inference

1000+ results
for tensorrt-inference