ai-inference Search Results

1000+ results
for ai-inference

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

xihajun/test-comments #24

SUT Info - Qualcomm Cloud AI - MLPerf Inference

http://127.0.0.1:8000/tmp/submission/DeviceInfo/ QAIC

xihajun updated 1 year ago
2
xihajun/test-comments #4

Docker Setup - Qualcomm Cloud AI - MLPerf Inference

http://127.0.0.1:8000/krai_qaic_task/DockerSetup/ QAIC

xihajun updated 1 year ago
1
THUDM/CogVideo #455

Cannot load safetensors OSError: No such device (os error 19…

### System Info / 系統信息 H100, CUDA 12.4 ### Information / 问题信息 ``` [rank0]: File "/opt/venv/lib/python3.10/site-packages/transformers/modeling_utils.py", line 4226, in from_pretrained [ran…

complexfilter updated 1 day ago
4
kubeedge/sedna #430

Sedna joint inference and federated learning controller opti…

**What would you like to be added/modified**: Sedna is an edge-cloud synergy AI project incubated in KubeEdge SIG AI. Benefiting from the edge-cloud synergy capabilities provided by KubeEdge, Sed…

tangming1996 updated 3 months ago
2
google-ai-edge/mediapipe #5610

Support Gemma2-2b model for inference in Android

### MediaPipe Solution (you are using) Android library：com.google.mediapipe:tasks-genai:0.10.14 ### Programming language Android Java ### Are you willing to contribute it None ### De…

FranzKafkaYu updated 2 days ago
12
VectorSpaceLab/OmniGen #9

Do you think it would be possible to runn it at lower prezio…

Manni1000 updated 1 week ago
3
fedora-copr/logdetective #82

Service can concurrently process multiple requests

This is a tracking issue for us to figure out for the service to process multiple requests in parallel "so users wouldn't notice" and we don't need to heavily invest into multiple GPUs

TomasTomecek updated 1 day ago
2
vllm-project/vllm #8074

[Feature]: Support multi-node serving on Kubernetes

### 🚀 The feature, motivation and pitch Hi, I'm currently working on **deploying vLLM distributed on multi-node in k8s cluster**. I saw that the official documentation provided a link by using [LWS…

linnlh updated 1 month ago
5
ultralytics/ultralytics #14377

ultralytics/examples/YOLOv8-CPP-Inference/

### Search before asking - [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and found no similar bug report. ### YOLOv8 Component Predict ### Bug ultral…

rathorology updated 1 week ago
7
awslabs/data-on-eks #661

Ray Serve Mistral LLM GPU Deploy Worker fails readiness chec…

## Description I am following this doc: https://awslabs.github.io/data-on-eks/docs/gen-ai/inference/GPUs/vLLM-rayserve Once I run ``` cd data-on-eks/gen-ai/inference/vllm-rayserve-gpu en…

calvinraveenthran updated 2 weeks ago
6

上一页 1...13 14 15 16 17 18 19...100 下一页

1000+ results for ai-inference

1000+ results
for ai-inference