model-serving Search Results

InftyAI/llmaz #85

Parallel model serving

**What would you like to be added**: Similar to kserve https://kserve.github.io/website/latest/modelserving/v1beta1/custom/custom_model/#parallel-model-inference **Why is this needed**: *…

kerthcet updated 1 month ago

databricks/terraform-provider-databricks #3946

[ISSUE] Issue with `databricks_model_serving` resource

### Configuration ```hcl resource "databricks_model_serving" "this" { name = "e5_${local.STICKY_RANDOM}" config { served_entities { name = "e5_small_v2" enti…

Vikranth-Subramanian updated 4 days ago

tensorflow/recommenders-addons #467

How can I remove Horovod ops from the savedModel to use with…

Title basically says it, I have trained a model using HorovodAllToAllEmbeddings and saved by doingg: ``` de.keras.models.de_save_model( model, export_dir, overwrit…

alykhantejani updated 1 week ago

stikkireddy/mlflow-extensions #35

[FEATURE] Support ray serve engine

Ray Serve is a phenomenal serving engine that abstracts serving and some throughput optimization features like batching, async execution, pipelining, etc. Supports torch and other popular frameworks. …

stikkireddy updated 1 week ago

InftyAI/llmaz #32

Support Deployment for serving most models

We support lws as the default workload, however, most of the cases mutli-hosts is not needed, even with Llama3.1 405B. So maybe this is a better choice.

kerthcet updated 2 months ago

EricLBuehler/candle-vllm #44

Support chat serving for more models

Open this issue for tracking the progress of models supported in candle-vllm.

guoqingbao updated 1 month ago

HarryKing87/pantry-app #158

Add servings and servingUnits on Meal Planner

As part of the ongoing development of the meal planner feature, we need to add the ability to display and edit servings and serving units for each food item. This will provide users with more detailed…

HarryKing87 updated 6 days ago

huggingface/setfit #300

Serving SeTFiT models in torchserve

Im trying to deploy my setfit model in torchserver using a custom handler for this task. The thing is that im not being able to do this since im getting multiple errors while registering the model on …

maccarini updated 2 months ago

karpathy/llama2.c #40

Model serving

Realize this is an orthogonal questions - but what's a simple way to stand up a llama.c model serving so I can access it from LangChain

venuv updated 1 year ago

michaelfeil/infinity #352

Support Integration with KServe

### Feature request [Kserve](https://github.com/kserve/kserve) is a Kubernetes based engine for predictive and generative AI models and provides abstraction for popular model servers like Huggingface…

indranilr updated 3 weeks ago

1000+ results for model-serving

1000+ results
for model-serving