tensorrt-inference Search Results

1000+ results
for tensorrt-inference

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

xlang-ai/instructor-embedding #81

Inference using TensorRT

Hi, I have been exploring models that I can fine tune with my own data to provide embeddings for the task of pair wise similarity calculation. My data looks like: [title][space][url]. I do not ha…

mon28 updated 11 months ago
1
NVIDIA/TensorRT-LLM #1672

How to change num_beams over multiple runs ?

### System Info NVIDIA RTX A6000 ### Who can help? @juney-nvidia Hi I'm interested in using TensorRT-LLM for multiple inference inferences, but I'd like to be able to adjust the `num_be…

OValery16 updated 2 weeks ago
2
Deci-AI/super-gradients #2055

YOLO-NAS: Running onnx/tensorrt inference with dynamic image…

### 💡 Your Question Hello, I was wondering if there is any way to export the YOLO-NAS model to onnx with dynamic image size axes and then convert to TensrRT with dynamic shapes (if needed I can ex…

janmarczak updated 2 months ago
1
mit-han-lab/deepcompressor #31

use qserve with tensorrt-llm raise an error

### System Info - GPU： NVIDIA H100 80G - TensorRT-LLM branch main - TensorRT-LLM commit: 535c9cc6730f5ac999e4b1cb621402b58138f819 ### Information - [x] The official example scripts - [ ]…

anaivebird updated 6 days ago
1
ultralytics/ultralytics #15007

The speed averaged over COCO val images include pre-process,…

### Search before asking - [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussion…

xiaomafei updated 13 hours ago
4
mit-han-lab/efficientvit #116

NaN values with FP16 TensorRT Inference

I'm trying to run FP16 inference using TensorRT 8.5.2.2 on a Xavier NX device, and getting NaN or garbage values. Has anyone encountered a similar issue? - I'm using B0 and B1 segmentation models (…

ovunctuzel-bc updated 3 months ago
6
visionml/pytracking #425

TensorRT Conversion of KeepTrack

I am attempting to convert the pretrained weights of the KeepTrack model to TensorRT for inference. As I am new to TensorRT and ONNX, I would greatly appreciate any guidance or suggestions on how to s…

PradhanSomu updated 1 day ago
4
Project-MONAI/model-zoo #703

VISTA-3D：About TensorRT speedup Error

When I use Vista3D, I encountered the following problems when running the command "python -m monai.bundle run --config_file "['configs/inference.json', 'configs/inference_trt.json']"" **environ…

xiaocaimmm updated 2 weeks ago
1
NVIDIA/TensorRT #4236

how to analyze the performance of each operator（myelin）?

tensorrt 8.6.10 If operators are fused into myelin, how to analyze the performance of each operator? Are there any tools or samples? The core purpose is to optimize the overall inference performance…

cychen3 updated 2 weeks ago
3
triton-inference-server/server #7394

Triton inference is slower than tensorRT

**Description** Im using a simple client inference class base on client example. My tensorRT inference with batchsize 10 with 150ms and my triton with tensorRT backend took 1100ms. This is my client:…

namogg updated 3 months ago
2

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for tensorrt-inference

1000+ results
for tensorrt-inference