model-inference-service Search Results

1000+ results
for model-inference-service

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

android/ai-samples #3

Required LLM feature not found

Trying to run sample on two devices (Pixel 8 Android 15 with "Enable on-device GenAI Features" enabled, Pixel 9 Pro w/ GrapheneOS w/play services & AICore installed), and I get the following error: …

abaker updated 1 day ago
8
elastic/elasticsearch #104538

[ML] Report associations between _inference models and _ml/t…

### Description The _inference service uses the _ml/trained_models API to deploy models for use in the inference service. However, users are still able to manage these deployments using the trained…

maxhniebergall updated 5 months ago
1
opensearch-project/ml-commons #2891

[RFC] Asynchronous Offline Batch Inference and Ingestion to …

### Problem Statement Nowadays remote model servers like AWS SageMaker, BedRock, or OpenAI, Cohere, etc all support batch predict APIs, which allow users to send large amount of synchronous request…

Zhangxunmt updated 1 day ago
1
mlflow/mlflow #13096

[BUG] Signature is not inferred and logged for certain input…

### Issues Policy acknowledgement - [X] I have read and agree to submit bug reports in accordance with the [issues policy](https://www.github.com/mlflow/mlflow/blob/master/ISSUE_POLICY.md) ### Where…

sharan21 updated 2 weeks ago
2
MailyDaily/MailyDailyAndroid #10

API to process requests on a service (local or report)

Create an API service that can be called to process the requests from the app. We can then host this into a server. The API shall accept the role and the token. Instructions for deploying the …

mariankh1 updated 1 day ago
4
elastic/elasticsearch #105518

[ML] Double deployment of trained model causes assertion err…

### Elasticsearch Version 8.14.0-SNAPSHOT ### Installed Plugins _No response_ ### Java Version JBR-17.0.9+8-1166.2-nomod ### OS Version 23.3.0 Darwin Kernel Version 23.3.0: Wed De…

maxhniebergall updated 3 weeks ago
5
aws/sagemaker-inference-toolkit #72

SageMaker inference should be able to run as non-root user.

**Describe the bug** When running as a non-root user within a container, sagemaker-inference fails to start the multi-model-server. This works when all packages are installed as root, and the entry…

akulk314 updated 3 months ago
3
elastic/elasticsearch #110992

[ML] Inference API request hangs when passing an invalid fie…

### Description The inference API supports text embedding and rerank task types. If a inference endpoint is created for text embedding, and a request is made to perform inference and the request co…

jonathan-buttner updated 1 month ago
2
pytorch/serve #2743

[RFC] Sequence Batching for Stateful Inference

### 🚀 The feature ## Author: Li Ning ## Background A stateful model possesses the ability to detect interdependencies between successive inference requests. This type of model maintains a persist…

lxning updated 1 month ago
20
kserve/kserve #3686

InferenceService Model Transition in Pending/InProgress fore…

/kind bug **What steps did you take and what happened:** Deployed inferenceservice iris-classifier-deployment: ``` % kubectl get inferenceservices NAME URL …

CanmingCobble updated 4 months ago
3

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for model-inference-service

1000+ results
for model-inference-service