-
http://127.0.0.1:8000/tmp/submission/DeviceInfo/
QAIC
-
http://127.0.0.1:8000/krai_qaic_task/DockerSetup/
QAIC
-
### System Info / 系統信息
H100, CUDA 12.4
### Information / 问题信息
```
[rank0]: File "/opt/venv/lib/python3.10/site-packages/transformers/modeling_utils.py", line 4226, in from_pretrained
[ran…
-
**What would you like to be added/modified**:
Sedna is an edge-cloud synergy AI project incubated in KubeEdge SIG AI. Benefiting from the edge-cloud synergy capabilities provided by KubeEdge, Sed…
-
### MediaPipe Solution (you are using)
Android library:com.google.mediapipe:tasks-genai:0.10.14
### Programming language
Android Java
### Are you willing to contribute it
None
### De…
-
-
This is a tracking issue for us to figure out for the service to process multiple requests in parallel "so users wouldn't notice" and we don't need to heavily invest into multiple GPUs
-
### 🚀 The feature, motivation and pitch
Hi, I'm currently working on **deploying vLLM distributed on multi-node in k8s cluster**. I saw that the official documentation provided a link by using [LWS…
-
### Search before asking
- [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and found no similar bug report.
### YOLOv8 Component
Predict
### Bug
ultral…
-
## Description
I am following this doc: https://awslabs.github.io/data-on-eks/docs/gen-ai/inference/GPUs/vLLM-rayserve
Once I run
```
cd data-on-eks/gen-ai/inference/vllm-rayserve-gpu
en…