inference-server Search Results

1000+ results
for inference-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Stability-AI/StableCascade #113

ROCm / HIP support

*TL;DR: Please add support for AMD GPUs on Linux through ROCm.* Greetings, I was wondering if it is being considered to support ROCm for inference? Since an RX 7900 XTX is currently the only option …

SkyyySi updated 6 months ago
1
mono/CppSharp #1860

[feature request] Support `std::shared_ptr` (and complete Gi…

**UPD:** the missing `std::shared_ptr` support context is in the message https://github.com/mono/CppSharp/issues/1860#issuecomment-2297731111. Currently this support seems missing, and `shared_ptr`/`u…

vadimkantorov updated 4 weeks ago
27
aqlaboratory/openfold #233

ValueError: The number of positions must match the number of…

Can someone help me. Why am I getting this error when I run inference： Traceback (most recent call last): File "/data/lwq/openfold-1.0.0/lib/conda/envs/openfold_venv/lib/python3.7/runpy.py", lin…

liweiqing1997 updated 1 year ago
2
triton-inference-server/onnxruntime_backend #245

CPU Throttling when Deploying Triton with ONNX Backend on Ku…

**Description** I am deploying a YOLOv8 model for object-detection using Triton with ONNX backend on Kubernetes. I have experienced significant CPU throttling in the sidecar container ("queue-proxy")…

langong347 updated 3 months ago
6
triton-inference-server/server #5964

Could not load model using mlflow triton plugin with S3/mini…

**Description** Could not load model using mlflow with minIO as model repository. I have tried this AWS S3 bucket and it worked as expected. have followed this article [MLflow Triton Plugin](https://…

pragadeeshraju updated 2 months ago
10
meta-llama/llama-stack #125

llama stack run failed with AssertionError: Could not find c…

using pyenv + venv + Docker, llama stack run failed and seems cannot found model directory ``` $ llama stack run my-local-stack + '[' -n '' ']' + '[' -z '' ']' + docker run -it -p 5000:5000 -v …

kun432 updated 3 days ago
2
huggingface/text-generation-inference #2464

Failing to unpickle the model

### System Info cargo version cargo 1.80.1 (376290515 2024-07-16) Haven't been able to run the docker file to get more details.. I am trying to run the docker on CPU ### Information - [X] Docke…

ksajan updated 3 weeks ago
4
ultralytics/ultralytics #14710

Specify model version in triton

### Search before asking - [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussions) and fou…

pqthong updated 2 months ago
2
immich-app/immich #12888

It takes a long time to open a shared album in the mobile ap…

### The bug English is not a native language. There are about 55,000 objects in the shared album. In the mobile app, opening an album takes about a minute. Once opened it works quickly. If you leave …

Saneckg updated 1 day ago
1
NVIDIA/FasterTransformer #691

Serve Deberta using FasterTransformer in Triton

Hi, Is there any tutorial that we can refer to so that we could serve a deberta model using fastertransformer in Triton? I think the steps would be: 1. Convert a deberta-v2 model into fastertrans…

sfc-gh-zhwang updated 1 year ago
1

上一页 1...90 91 92 93 94 95 96...100 下一页

1000+ results for inference-server

1000+ results
for inference-server