model-inference-service Search Results

kserve/kserve #3682

model with name <inference service name> does not exist.

/kind bug **What steps did you take and what happened:** I ran the inference service on custom xgboost model that I trained and saved in .joblib extension using the pvc storage option, followed th…

VikasAbhishek updated 1 month ago

ultralytics/ultralytics #13922

Can I load 2 or more models into 1 GPU for inference if I h…

### Search before asking - [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussions) and fou…

darouwan updated 1 week ago

feast-dev/feast #4288

Add a family of Model objects to feast

This PR is part of an effort to improve integration of feast with model serving. Also see #4139 and accompanying draft [RFC](https://docs.google.com/document/d/1PzBbTs_8R73XhuDq3CO0slmGy5S_ci2rwtbx1L-…

tokoko updated 1 week ago

huggingface/text-embeddings-inference #303

CPU Image: High memory usage on startup

### System Info Image: v1.2 CPU Model used: jinaai/jina-embeddings-v2-base-de Deployment: Docker / RH OpenShift ### Information - [X] Docker - [ ] The CLI directly ### Tasks - [X] An officiall…

freinold updated 1 week ago

huggingface/text-generation-inference #2069

Add Environment Variable for OTLP Service Name

### Feature request Currently the Service Name for OTLP is hard-coded as "text-generation-inference.server" Could an environment variable be added which could set this. Something like... resour…

KevinDuffy94 updated 2 weeks ago

BerriAI/litellm #4558

[Bug]: Incorrect Parameters for Nvidia NMI Models

### What happened? I have tried a couple of different models hosted on Nvidia NMI, but none of them support system messages, frequency penalty, or presence penalty. This is causing errors that (I t…

kyleavery updated 2 hours ago

kserve/kserve #3686

InferenceService Model Transition in Pending/InProgress fore…

/kind bug **What steps did you take and what happened:** Deployed inferenceservice iris-classifier-deployment: ``` % kubectl get inferenceservices NAME URL …

CanmingCobble updated 1 month ago

aws/amazon-sagemaker-examples #4666

[Bug Report] You are forcing Jumpstart to use ml.p4d.24xlarg…

**Link to the notebook** In the code below I am clearly passing a different instance type where I want to deploy my trained moedl ``` finetuned_predictor = estimator.deploy( instance_type='ml.…

math-sasso updated 3 weeks ago

triton-inference-server/server #7400

Triton Crash with Signal 11 while using python backend

**Description** After using the Python vllm backend, Triton crashed with signal 11. The model had been loaded and preheated for some time before the crash occurred. **Triton Information** What ve…

burling updated 2 days ago

iotempire/iotempower #94

TFLite Micro Support

[TF Lite Micro (link - supported platforms)](https://www.tensorflow.org/lite/microcontrollers#supported_platforms) makes local node ML inferencing possible, enabling powerful example applications like…

mbz4 updated 2 weeks ago

1000+ results for model-inference-service

1000+ results
for model-inference-service