tensorrt-inference-server Search Results

1000+ results
for tensorrt-inference-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ELS-RD/transformer-deploy #173

convert_model command not found

Hello get docker image 0.6.0. Just tried to run the two demo command: 1. docker run -it --rm --gpus all \ -v $PWD:/project ghcr.io/els-rd/transformer-deploy:0.6.0 \ bash -c "cd /project && \ …

pint1022 updated 4 months ago
3
triton-inference-server/server #7150

CUDA Graph not work

**Description** CUDA Graph not work in tensorrt backend. The model config as below: ``` platform: "tensorrt_plan" version_policy: { latest: { num_versions: 2}} parameters { key: "execution_mode"…

SunnyGhj updated 4 months ago
4
mlcommons/inference #1844

Fail to build the docker with rootless user

Run the below cm commands for several times and always failed at the same place: (cm) tomcat@tomcat-Dove-Product:~$ cm run script --tags=run-mlperf,inference,_find-performance,_full,_r4.1-dev \ …

Bob123Yang updated 2 days ago
12
open-mmlab/mmdeploy #2628

[Bug] something error with batch nms on jetson orin with jet…

### Checklist - [X] I have searched related issues but cannot get the expected help. - [ ] 2. I have read the [FAQ documentation](https://github.com/open-mmlab/mmdeploy/tree/main/docs/en/faq.md) but …

Ningreka updated 4 weeks ago
2
triton-inference-server/tensorrtllm_backend #388

SAFETENSORS and OpenAI style endpoint

### System Info I have searched the repo here and the main server repo but don't see any information on either a) support for Safetensors (many models are saved that way on HF) and also b) whether th…

RonanKMcGovern updated 1 month ago
5
rapidsai-community/notebooks-contrib #20

Suggestion: cuml trained Model Export for TensorRT Serving

Hello, I have a suggestion for a notebook -- an **example of a cuml trained model being exported so it can be served by TensorRT.** More information on TensorRT: - https://docs.nvidia.com/deeplear…

starlett3 updated 5 years ago
1
NVIDIA/TensorRT-LLM #1649

Is model conversion hardware specific.

```dockerfile #Base Image FROM nvcr.io/nvidia/tritonserver:24.04-trtllm-python-py3 USER root RUN apt update && apt install --no-install-recommends rapidjson-dev python-is-python3 git-lfs curl uuid…

kalpesh22-21 updated 3 months ago
3
aws/deep-learning-containers #3903

[bug] failed to install torch-tensorrt

Checklist - [x] I've prepended issue tag with type of change: [bug] - [ ] (If applicable) I've attached the script to reproduce the bug - [ ] (If applicable) I've documented below the DLC image/doc…

geraldstanje updated 3 months ago
12
microsoft/onnxruntime #21276

[TensorRT ExecutionProvider] Cannot infer the model on a GPU…

### Describe the issue In a scenario where multiple GPU devices are available, when selecting the TensorrtExecutionProvider and choosing device_id = 0, the model infers perfectly. However, when usi…

dat58 updated 2 months ago
6
NVIDIA/TensorRT #4080

Error Code 1: Myelin ([cask.cpp:exec:972] Platform (Cuda) er…

Whenever I am running MLperf Inferencing for Llama2-70b in a docker container, I am getting this below error. I deleted the container image and run again but still same error. Host server is running …

jaiswackhv updated 21 hours ago
3

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for tensorrt-inference-server

1000+ results
for tensorrt-inference-server