-
**Description**
Triton build using `./build.py ` fails due to a warning (`-Werror=sign-compare`) which throws an error. The warning comes from `response_cache_test.cc` in the `core` repo ([here](http…
-
**Description**
When Triton Server is hosted in Big Endian machine, GRPC calls with BYTES input fails.
**Triton Information**
What version of Triton are you using? 23.01
Are you using the Trit…
-
## 🐛 Bug Report
After `from catalyst.data.sampler import DistributedSamplerWrapper`, setting CUDA_VISIBLE_DEVICE will have no effect.
To me, this is a bit counterintuitive. Is this correct, I want…
-
### Describe the issue
Hi IPEX team,
I have an application where I want to serve multiple models concurrently, and I want to share weights across concurrent instances. I normally do this with `tor…
-
### Checklist
- [X] I have searched related issues but cannot get the expected help.
- [X] 2. I have read the [FAQ documentation](https://github.com/open-mmlab/mmdeploy/tree/main/docs/en/faq.md) but …
-
**Is this a BUG REPORT or FEATURE REQUEST?**:
> Uncomment only one, leave it on its own line:
>
> /kind bug
> /kind feature
**What happened**:
**What you expected to happen**:
**How…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Are you using forge?
No
### Installed conforming to our guide?
- [X] I have read the installation guide and …
-
### Checklist
- [X] 1. I have searched related issues but cannot get the expected help.
- [x] 2. The bug has not been fixed in the latest version.
### Describe the bug
```
Special tokens have been…
-
### System Info
- **Hardware**: AWS g6.12xlarge (us-east-2) / 4x NVIDIA L4 GPU
- **OS**: Ubuntu 24.04 LTS (Noble Numbat)
- **NVIDIA Driver**: nvidia-open 560.28.03
- **CUDA**: 12.6
- **Docker**: …
-
I'm having trouble interpreting some of the results...
After an Automatic Brute Search analysis, when I analyse the result_summary, I look at the Avegrage GPU Utilization.
How is this value de…