-
### Your current environment
2024-04-24 06:04:07 (27.2 MB/s) - ‘collect_env.py’ saved [24877/24877]
Collecting environment information...
PyTorch version: 2.2.1+cu121
Is debug build: False
CUDA…
-
同时请求会发生堵塞,一个请求结束之后另一个才出答案
-
Checklist
- [x] I've prepended issue tag with type of change: [feature]
- [x] (If applicable) I've documented below the DLC image/dockerfile this relates to
- [x] (If applicable) I've documented th…
-
**Description**
I deployed a bert_base model from hugging face's transformer library via torchscript and Triton's pytorch backend.
But i found **the GPU utilization is around 0**, and performance is…
-
**Description**
Triton build using `./build.py ` fails due to a warning (`-Werror=sign-compare`) which throws an error. The warning comes from `response_cache_test.cc` in the `core` repo ([here](http…
-
**Description**
Error raised when calling more than one request at the same time. Pipeline Stable Diffusion 2.1. Requests were called with perf_analyzer.
**Triton Information**
I use triton conta…
-
Hello @owulveryck,
i try to run this onnx model:
![model](https://user-images.githubusercontent.com/12670730/193880343-3ac5cf92-6f65-49bb-8358-f981bb5b116e.png)
with this code:
```go
p…
-
Open questions from [RFD 172](https://github.com/joyent/rfd/tree/master/rfd/0172).
1. Should these zone be owned by admin?
1. Should these zones be deployed via sdcadm?
1. Should these zones have…
-
## 🐛 Bug Report
After `from catalyst.data.sampler import DistributedSamplerWrapper`, setting CUDA_VISIBLE_DEVICE will have no effect.
To me, this is a bit counterintuitive. Is this correct, I want…
-
**Description**
When Triton Server is hosted in Big Endian machine, GRPC calls with BYTES input fails.
**Triton Information**
What version of Triton are you using? 23.01
Are you using the Trit…