-
/kind feature
**Describe the solution you'd like**
[A clear and concise description of what you want to happen.]
Add a sample of MMS using XGBoost model server similar to [sklearn](https://github…
-
[X] I have checked the [documentation](https://docs.ragas.io/) and related resources and couldn't resolve my bug.
**Describe the bug**
I am unable to create test data set, using Ollama models , it…
-
/kind feature
**Describe the solution you'd like**
Crrently KServe only supports one StorageUri, which fit most of the cases. However, in some scenarios like serving fine-tuned models like LoRA,…
-
### 🚀 The feature, motivation and pitch
Thanks for fixing the soft-capping issue of the Gemma 2 models in the last release! I noticed there's still a [comment](https://github.com/vllm-project/vllm/bl…
-
Hey there,
First of all, I want to thank you for this amazing piece of art! Your work has been incredibly valuable.
I noticed that there is a gRPC plugin available in the source code, and I’m in…
-
### Describe the problem the feature is intended to solve
We would like to serve fully-convolutional segmentation models whose input and output tensor sizes are flexible, but not identical. In this …
-
Click to expand!
### Issue Type
Bug
### Source
source
### Tensorflow Version
2.8.2
### Custom Code
Yes
### OS Platform and Distribution
Colab
### Mobile device
_No response_
### Python…
-
### Issues Policy acknowledgement
- [X] I have read and agree to submit bug reports in accordance with the [issues policy](https://www.github.com/mlflow/mlflow/blob/master/ISSUE_POLICY.md)
### W…
-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Is your feature request related to a problem? Please describe.
ONNX is a computation backend serving models.
…
-
Following the pattern from the simple RAG Example in the docs, I've created a DatabricksRM which works when calling like
`rm(query="Model serving API", query_type="text")`
But when trying to use d…