tensorrt-inference-server Search Results

1000+ results
for tensorrt-inference-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

npuichigo/openai_trtllm #47

Feature request - Add all v1/ routes

@npuichigo I am trying to use [Triton Inference Server with TensorRT-LLM backend](https://nvidia.github.io/TensorRT-LLM/quick-start-guide.html#deploy-with-triton-inference-server) with [openweb-ui](ht…

visitsb updated 4 months ago
3
kserve/modelmesh-serving #64

TensorRT examples and tests

The Triton Inference Server supports TensorRT models and our the Triton Serving Runtime [indicates this](https://github.com/kserve/modelmesh-serving/blob/main/config/runtimes/triton-2.x.yaml#L28). …

pvaneck updated 3 years ago
1
triton-inference-server/tensorrtllm_backend #319

Expected batch dimension to be 1 for each request for input_…

### System Info **Hardware:** - CPU architecture: x86_64 - CPU memory size: - L1d cache: 2 MiB - L1i cache: 2 MiB - L2 cache: 64 MiB - L3 cache: 256 MiB - GPU name: NVIDIA A100 80GB PCIe …

calvinh99 updated 7 months ago
4
k2-fsa/sherpa #568

unable to launch Triton server on finetuned whisper model

Hi there, I have been finetuning whisper models using huggingface. Further to convert the model to TensorRT_LLM format, i use a HF script that converts the models from its HF format to the original …

StephennFernandes updated 7 months ago
6
triton-inference-server/tensorrtllm_backend #508

Assertion failed: Invalid tensor name: decoder_input_lengths

### System Info - docker image: nvcr.io/nvidia/tritonserver:24.05-trtllm-python-py3 - tensorrt_llm: 0.9.0 ### Who can help? @kaiyux @byshiue ### Information - [ ] The official example scripts…

HowardChenRV updated 2 months ago
1
triton-inference-server/onnxruntime_backend #86

Model loading failure: densenet_onnx fails to load due to "p…

**Description** I am testing tritonserver on the example models fetched using this script: https://github.com/triton-inference-server/server/blob/main/docs/examples/fetch_models.sh triton serve…

shrek updated 1 year ago
4
triton-inference-server/tensorrtllm_backend #388

SAFETENSORS and OpenAI style endpoint

### System Info I have searched the repo here and the main server repo but don't see any information on either a) support for Safetensors (many models are saved that way on HF) and also b) whether th…

RonanKMcGovern updated 3 months ago
5
triton-inference-server/server #6486

[Question] can the TensorRT backend support python defined p…

Looking at the release of TensorRT 9.1.0. I am very happy to see the integration of openai-triton with TensorRT plugins. However [one limitation of this integration is that python must be availabl…

MatthieuToulemont updated 1 year ago
1
dotnet/machinelearning #6212

Add TensorRT support

**Is your feature request related to a problem? Please describe.** Currently the fastest way of executing models for Computer Vision inference is by running a TensorRT-optimised model. It is widely a…

turowicz updated 2 years ago
6
styler00dollar/VSGAN-tensorrt-docker #74

scene_detect not work: TensorRT EP execution context enqueue…

**Environments:** - os: ubuntu server 22.04 LTS - gpu: H100*2 - docker-ce: 5:27.1.2 - nvidia-container-toolkit: 1.16.1 - image: styler00dollar/vsgan_tensorrt:latest (08/15/2024) - commit: …

Sg4Dylan updated 1 month ago
6

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for tensorrt-inference-server

1000+ results
for tensorrt-inference-server