-
There are some erros based on your Dockerfile.
1. cupy cuml cudf cugraph was not installed.
Here is your Dockerfile
FROM nvcr.io/nvidia/tritonserver:22.07-py3
LABEL maintainer="NVIDIA"
LABEL …
-
Hello everyone,
I encountered an error message (as shown below) while trying to run the Mamba model (code below).
Experimental environment:
Cuda11.8 + Pytorch2.0.0 + Triton=2.2.0
What should…
-
### System Info
I have searched the repo here and the main server repo but don't see any information on either a) support for Safetensors (many models are saved that way on HF) and also b) whether th…
-
### System Info
Hi,
I noticed there is no slack, discord or irc channel for tensorrt - which could offload some future tickets by discussing things in the channel - so I created one.
I hope its…
-
So far the latest publicly available triton inference server with paddle backend is `paddlepaddle/triton_paddle:21.10` and there are lots of bug fixes since then. I'm experiencing an increasing amount…
-
```
➜ fauxpilot git:(main) ./launch.sh
[+] Building 0.6s (16/16) FINISHED
=> [fauxpilot-copilot_proxy internal] load .dockerignore …
-
**Is your feature request related to a problem? Please describe.**
Hi team,
we used to use command line to start a Triton server, so it's easy to enable nsys by running command like below
```…
-
### Your current environment
```text
Collecting environment information...
/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/cuda/__init__.py:611: UserWarning: Can't initialize NVML
warni…
-
### Checklist
- [X] 1. I have searched related issues but cannot get the expected help.
- [X] 2. The bug has not been fixed in the latest version.
- [X] 3. Please note that if the bug-related iss…
-