model-serving Search Results

1000+ results
for model-serving

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

kserve/kserve #3852

Use minio s3 storage output error "Unable to locate credenti…

/kind bug **What steps did you take and what happened:** [A clear and concise description of what the bug is.] init storage-initializer log output: ``` 2024-08-13 01:02:12.899 1 kserve INFO [in…

lklkxcxc updated 1 week ago
3
vllm-project/vllm #4665

[Bug]: Unable to use vLLM for serving fine tuned mistral mod…

### Your current environment ```text The output of `python collect_env.py` Collecting environment information... PyTorch version: 2.2.1+cu121 Is debug build: False CUDA used to build PyT…

praveen-kanamarlapudi updated 2 months ago
5
autrainer/autrainer #19

Support Inference from Hugging Face with Custom Code

At the moment, inference using a model from Hugging Face is only possible with autrainer models, transforms, loaders, etc. as we do not download any `.py` files. To support custom models, the corresp…

ramppdev updated 1 month ago
1
redhat-ai-services/podman-ai-lab-to-rhoai #2

Token authentication service not installed and the curl comm…

When testing the curl command for the mistral7b model, this url will not work, it is only for internal given the 'Token authentication service not installed'. This appears to be an issue in the case …

glamperi updated 1 month ago
3
bojone/bert4keras #372

Serving model by bert-as-service.

提问时请尽可能提供如下信息： ### 基本信息 - 你加载的**预训练模型**: [Robert-tiny-clue](https://github.com/CLUEbenchmark/CLUE) ### 问题描述使用 bert4keras 训练了一个分类模型输出了 save_weights 输出了 ckpt，```model.load_weights``` 预测正常。 …

Jhangsy updated 3 years ago
2
vllm-project/vllm #6312

[Feature]: control over llm_engine placement when multiple g…

### 🚀 The feature, motivation and pitch I need a way to specify which gpu exactly should vllm use when multiple gpus are available. Currently, it automatically occupies all available gpus (https://do…

ummagumm-a updated 3 weeks ago
1
intel-analytics/ipex-llm #12146

Glm4-9b-inference输出错误ISSUE

用以下方式验证glm4-9b-chat模型的输出，serving端报错 curl --request POST \ --url http://127.0.0.1:8000/v1/chat/completions \ --header 'content-type: application/json' \ --data '{ "model": "glm-4-9…

jessie-zhao updated 1 month ago
1
mlflow/mlflow #4272

[FR] MLflow Model Serving -- Add CORS headers to enable call…

## Willingness to contribute The MLflow Community encourages new feature contributions. Would you or another member of your organization be willing to contribute an implementation of this feature (ei…

snosrap updated 2 weeks ago
6
instructlab/instructlab #1971

Expose `merge_system_user_message` for `ilab model train --s…

**Describe the bug** Mixtral-based models (e.g. Prometheus) don't allow System messages to precede User messages in its template. We merge the first System message with the first User message so mess…

JamesKunstle updated 2 weeks ago
5
vllm-project/vllm #8074

[Feature]: Support multi-node serving on Kubernetes

### 🚀 The feature, motivation and pitch Hi, I'm currently working on **deploying vLLM distributed on multi-node in k8s cluster**. I saw that the official documentation provided a link by using [LWS…

linnlh updated 1 month ago
5

上一页 1...11 12 13 14 15 16 17...100 下一页

1000+ results for model-serving

1000+ results
for model-serving