serving Search Results - Githubissues

1000+ results
for serving

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #457

Support Kuberenetes for Distributed Serving

Only having support for ray for distributed inference will significantly reduce adoption of this tool if it truly is more performant than TGI. TGI can be run as a black-box image on Kubernetes with su…

sam-h-bean updated 3 weeks ago
2
kubernetes-sigs/wg-serving #14

[Serving Catalog] Add HPA configurations

One important (and non-trivial) aspect of running model servers today is to ensure they are able to scale horizontally in response to load. Today, traditional CPU/Memory-based autoscaling are not suff…

raywainman updated 2 months ago
2
MehVahdJukaar/amendments #123

Cauldron Only Makes Single Serving

Me again. Stews made in a cauldron only produce one bowl, the rest of the stew visibly in the cauldron is not accessible.

NotAPotter updated 2 months ago
4
vllm-project/vllm #9739

[Bug]: ValueError: At most 1 image(s) may be provided in one…

### Your current environment vllm-openai/v06.3.1.post-1 ### Model Input Dumps a_request: None, prompt_adapter_request: None. 2024-10-27 23:04:39 INFO 10-27 09:04:39 engine.py:290] Added request ch…

eav-solution updated 4 weeks ago
5
tensorflow/recommenders-addons #467

How can I remove Horovod ops from the savedModel to use with…

Title basically says it, I have trained a model using HorovodAllToAllEmbeddings and saved by doingg: ``` de.keras.models.de_save_model( model, export_dir, overwrit…

alykhantejani updated 1 week ago
2
canonical/knative-operators #243

Can't integrate rocks to `securityContext.runAsNonRoot`: `tr…

### Bug Description While working on `net-istio-webhook` extension rock for knative we had encountered a problem where we can't run rocks in `securityContext.runAsNonRoot`: `true` Kubernetes deploym…

misohu updated 2 weeks ago
3
runpod-workers/worker-vllm #129

Chat completion (template) not working with VLLM 0.6.3 + Ser…

I deployed https://huggingface.co/xingyaoww/Qwen2.5-Coder-32B-Instruct-AWQ-128k model through the Serverless UI, setting max model context window to 129024 and quantization to awq. I deploy it using t…

xingyaoww updated 1 week ago
5
knative-extensions/eventing-kafka-broker #4169

kafka-source-dispatcher pods are not running, they are getti…

**Describe the bug** kafka-source-dispatcher statefulset object is not able to spin up the new pods. It gets deleted immediately after it is first provisioned. Here is the result of kubectl descr…

raswinraaj updated 6 hours ago
3
litestar-org/litestar #3516

Bug: LoggingMiddleware breaks static file serving

### Description If you try to add logging middleware without excluded /static route, then you will get the following error ``` Traceback (most recent call last): File "/workdir/.venv/lib/pyt…

wallseat updated 1 week ago
2
openfoodfacts/openfoodfacts-server #1618

Support Chinese serving size

### Summary - If you enter a serving in chinese, i.e. 38公克, it will not be recognized. And no values per serving are calculated. ### Steps to reproduce - Check out https://world.openfoodfacts.org/pro…

aleene updated 1 month ago
4

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for serving

1000+ results
for serving