model-serving Search Results

1000+ results
for model-serving

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

openvinotoolkit/model_server #2721

high latency

# Latency in OpenVINO Model Server Inside Kubernetes Cluster ## To Reproduce **Steps to reproduce the behavior:** 1. **Prepare Models Repository**: Followed standard procedures to set up the mode…

sriram-dsl updated 2 weeks ago
8
opendatahub-io/opendatahub-community #95

Model Serving Metrics - Round 2

This is a tracker for all the various bits we will need to track to complete the feature work to complete Model Serving Metrics - Round 2 # Requirements Add requirements # Individual Efforts * UX…

jkoehler-redhat updated 1 year ago
1
dmlc/dgl #4442

Serving DGL models in production

Can you share the best practices for serving DGL models in production? (which of the frameworks is preferred/fully supported - torch serve , TensorFlow serving , Kserve or anything kubeflow based , N…

vishnu1865 updated 2 years ago
1
deepjavalibrary/djl-serving #2385

docker 0.29.0-pytorch-inf2 with meta-llama/Meta-Llama-3.1-8B…

## Description Unable to use open-ai endpoint, getting the error below. ### Error Message PyProcess W-100-model-stdout: The following parameters are not supported by neuron with rolling batch: {'…

yaronr updated 1 month ago
1
pytorch/serve #2046

Error Serving Two Models Simultaneously

### 🐛 Describe the bug I have 2 model archive files in my model store, gender_model.mar and age_model.mar. Each one of these works for inferencing individually with torchserve. Individually I start t…

mdeihim updated 1 year ago
6
pytorch/serve #1458

[RFC]: decorator based model serving

I'd like us to move to an experience like this when serving models with torchserve to follow projects like Fast API. The benefit of this is people can integrate torchserve much more easily in their ex…

msaroufim updated 2 years ago
3
envoyproxy/gateway #4431

Add support for with_request_body in SecurityPolicy.spec.ext…

Envoy supports sending the full request body to the external authorization server via the with_request_body filter configuration. Do you think that it is possible to expose such feature on the Securit…

mjf-89 updated 1 week ago
11
hhk7734/tensorflow-yolov4 #73

Saved model and Tensorflow Serving

I'am trying to use saved model in Tensorflow Serving but without success. **I exported model:** ``` yolo = YOLOv4() yolo.config.parse_names("yolov4-data/coco.names") yolo.config.parse_cfg("yolo…

sakulh updated 3 years ago
1
triton-inference-server/server #7786

Triton x vLLM backend GPU selection issue

**Description** I am currently using triton vllm backend for my kubernetes cluster. There are 2 GPUs that Triton is able to see, however it seems to only choose GPU 0 to load the model weights I h…

Tedyang2003 updated 2 days ago
2
NVIDIA-Merlin/models #129

Serving retrieval model in Triton

https://docs.google.com/presentation/d/1jxj9zjeRRu1BJSf8tzaWoQVr5h7HZbOUsSh39Rcwv80/edit#slide=id.g10be2c57ddf\_7\_3 Aha! Link: https://nvaiinfa.aha.io/features/MERLIN-672

viswa-nvidia updated 2 years ago
1

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for model-serving

1000+ results
for model-serving