-
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Deploy type
Downstream version (eg. `OpenShift AI 2.4`)
### Version
2.5 RC1
### Current Behavior
When tryi…
lugi0 updated
10 months ago
-
-
On the [recent AutoML and Training WG call](https://docs.google.com/document/d/1MChKfzrKAeFRtYqypFbMXL6ZIc_OgijjkvbqmwRV-64/edit#heading=h.1tzqrxjljjm3) we discuss how we can improve the documentation…
-
Need to support TensorRT 8 model which run in Triton Inference Server 21.11-py3 (or above)
I tried to update Triton Inference Server image to nvcr.io/nvidia/tritonserver:21.09-py3 and got the b…
-
/kind feature
**Describe the solution you'd like**
In order to support Modelcars as described in this [design document](https://docs.google.com/document/d/1Bs4fnP8rhPMaoPoLSYVvuRq-z9vkGPQ0rKbmfH…
rhuss updated
6 months ago
-
ODH-model-controller already monitors `InferenceService` and `ServingRuntime` resources to create routes.
The goal of this spike is to:
- extend the controller also to reconcile `InferenceService`/`…
-
**What happened**:
目前正在做类似的技术选型,如果我现在需要做一个服务,部署在边节点,提供图片的的推理服务
那我的端节点在使用[openyurt](https://github.com/openyurtio/openyurt) 的情况下,该如何选择距离自己最近的边节点?
是通过配置?还是需要每次请求的时候判定一下哪个是最近的存活的边节点,然后提交图片请求?
…
-
**Describe the bug**
I would like to enable gRPC inferencing with modelmesh. When I follow instructions [[1](https://github.com/kserve/modelmesh-serving/tree/main/docs/configuration#exposing-an-e…
-
https://github.com/kserve/kserve/blame/8d5f574f49f91ae700b282a23716f6445aa44f7a/docs/samples/v1beta1/sklearn/v1/sklearn.yaml#L8
Good afternoon,
I've attempted to run my first inference service f…
-
As part of enabling E2E tests in openshift-ci, two Dockerfiles were changed:
* The [controller Dockerfile](https://github.com/opendatahub-io/kserve/blob/master/Dockerfile): https://github.com/opend…