model-serving Search Results

1000+ results
for model-serving

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

kubernetes-sigs/wg-serving #14

[Serving Catalog] Add HPA configurations

One important (and non-trivial) aspect of running model servers today is to ensure they are able to scale horizontally in response to load. Today, traditional CPU/Memory-based autoscaling are not suff…

raywainman updated 1 month ago
2
vllm-project/vllm #4221

[Feature]: Multi-node serving

### 🚀 The feature, motivation and pitch Recently, there are larger models which couldn't be deployed on one machine, such as grok. Can we support efficient multi-node serving? ### Alternatives _No …

jordane95 updated 2 weeks ago
4
PaddlePaddle/Serving #2003

Compilation failure of paddle serving deployment based on Ri…

ERROR: `λ localhost /work/Serving/build-server-npu {v0.9.0} make TARGET=ARMV8 -j16 [ 3%] Built target extern_gflags [ 9%] Built target extern_snappy [ 9%] Built target extern_zlib [ 13%] Perfo…

JasonFlyBeauty updated 3 months ago
5
vllm-project/vllm #9722

[Performance]: How to Improve Performance Under Concurrency

### Proposal to improve performance I am using vllm version 0.6.3.post1 with four 4090 GPUs to infer the qwen2-72B-chat-int4 model. The request speed is very fast for a single request, but the perf…

ljwps updated 2 weeks ago
2
pytorch/serve #3344

Add Detectron2 Support to TorchServe Object Detection Exampl…

## 🚀 The feature Implement support for Detectron2 models within the TorchServe object detection examples. This includes: 1. Developing a custom handler that works seamlessly with both CPU and GP…

Mudassar-MLE updated 1 month ago
2
keras-team/keras #19519

🗺️ Keras Development Roadmap

Here's an overview of the features we intend to work on in the near future. ## Core Keras ### Saving & export - Implement saving support for sharded models (sharded weights files). - Improve…

fchollet updated 2 weeks ago
12
databricks/terraform-provider-databricks #3753

[FEATURE] Support for Feature Spec Creation for Online Table…

### Use-cases I would like the Databricks Terraform Provider to support the creation of a feature_spec object/function within the Unity Catalog. This is essential for serving lookup tables in online …

drewipsonhq updated 1 month ago
1
ollama/ollama #7244

Pulling models from private OCI Registries

According to #2388 it should be possible to push and pull models to a Docker/OCI registry (without authentication). Even though it's an unsupported feature, I find it very useful and would like to…

mitja updated 2 weeks ago
3
databricks/cli #1491

The CLI crashes when I include the `model_serving_endpoints`…

### Describe the issue When I try to validate a bundle that deploys a model serving endpoint the CLI has a runtime error ### Steps to reproduce the behavior Please list the steps required to repr…

danielsteman updated 5 months ago
2
kserve/modelmesh-serving #504

Failed to load model while following the tutorial 'Creating …

**Describe the bug** While following the tutorial '[Creating a custom serving runtime in KServe ModelMesh](https://developer.ibm.com/tutorials/awb-creating-custom-runtimes-in-modelmesh/)' from th…

JimBeam2019 updated 2 weeks ago
3

上一页 1...14 15 16 17 18 19 20...100 下一页

1000+ results for model-serving

1000+ results
for model-serving