model-serving Search Results

1000+ results
for model-serving

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

kubeflow/arena #193

Arena to support serving models with KFServing

KFServing has released 0.1.0 version, we are looking to integrate with arena.

yuzisun updated 5 years ago
1
NVIDIA/gpu-rest-engine #34

Serving models with number of outputs less than 5

Hi! I think, in current implementation the engine cannot server models if they have less than five outputs. The [classify](https://github.com/NVIDIA/gpu-rest-engine/blob/d8d2255884f965b2feca855cb9e18…

sergey-serebryakov updated 6 years ago
1
casys-kaist/LLMServingSim #3

Error: No such file or directory

Hello! I use this simulator for LLM serving, but when I run the following cmd: ```shell python3 -u main.py --model_name 'gpt3-6.7b' --npu_num 1 --npu_group 1 --npu_mem 24 --dataset 'dataset/share-gp…

lhpp1314 updated 1 month ago
1
kserve/modelmesh-serving #392

Specify model size in the InferenceService CRD

Would be nice having a new parameter in the `InferenceService` CRD that allows user to specify the model size (the size in bytes), avoiding the `MODEL_MULTIPLIER` factor to estimate the size. **Is …

andreapairon updated 3 weeks ago
1
kserve/kserve #3977

Change imagePullPolicy of queue-proxy

/kind feature **Describe the solution you'd like** Is there any config to modify the imagePullPolicy of queue-proxy? This question has stoned me for a long time and I've read docs of kserve & knat…

DriverSong updated 1 month ago
1
tensorflow/serving #1816

Apple M1 support

----------------------- ## Feature Request ### Describe the problem the feature is intended to solve TensorFlow is promoting Apples M1 Macs, would be great to have TFServing running on M1 Macs as…

SaschaHeyer updated 2 weeks ago
23
sebp/scikit-survival #472

ONNX export

ONNX export (e.g. with https://onnx.ai/sklearn-onnx/ ) would be very beneficial for deploying trained models to any environment and programming language. Do you have such export options considering ON…

ogencoglu updated 3 months ago
2
opendatahub-io/odh-model-controller #234

Accessing `status.url` from ISVC Returns 503

### Summary ### Steps to Reproduce 1. Deploy the latest `incubation` of odh-operator sources using manifests from [here](https://github.com/opendatahub-io/opendatahub-operator/blob/d4ba37e4b041977…

bartoszmajsak updated 2 months ago
1
vllm-project/vllm #6890

[Bug]: Vllm api server does not receive supported parameter …

### Your current environment ```text The output of `python collect_env.py` ``` ### 🐛 Describe the bug I used the openai compatible server deployed with vllm: ```bash python -m vllm.entr…

thangld201 updated 2 weeks ago
15
deepjavalibrary/djl-serving #1785

question to error model conversion process failed

## Description djl-serving version: djl-inference:0.26.0-tensorrtllm0.7.1 models: - meta-llama/Llama-2-7b-chat see: https://huggingface.co/meta-llama/Llama-2-7b-chat (used this report) - meta-lla…

geraldstanje updated 2 months ago
1

上一页 1...24 25 26 27 28 29 30...100 下一页

1000+ results for model-serving

1000+ results
for model-serving