-
**Describe the bug**
When deploying a HuggingFace model with model data on disk, sagemaker SDK still tries to access the AWS API to determine the Sagemaker default bucket. I don't currently have ac…
-
Hi,
I tried to execute this sample on a local k8s.
The KFP version on my cluster is more than 2.0, so I have updated the digit-recognizer code to be compatible with kfp-sdk-v2.
After applying th…
-
I encountered a "No module named xxx" error when loading parameter of my model is called when launching an inference job. Here is the error trace:
2019-07-10 02:21:07,256 rafiki.utils.service INFO …
-
Checklist:
* [x] I've searched in the docs and FAQ for my answer: https://bit.ly/argocd-faq.
* [x] I've included steps to reproduce the bug.
* [x] I've pasted the output of `argocd version`.
*…
-
## Bug Report
Does Tensorflow Serving support XLA compiled SavedModels ? or am I doing something wrong ?
### System information
- **OS Platform and Distribution (e.g., Linux Ubuntu 16.04)**: [D…
-
A presentation for the proposed B&A inference capabilities has been scheduled for 24th Apr '24.
Pre-read: B&A Inference overview ([explainer](https://github.com/privacysandbox/protected-auction-serv…
-
### 🚀 The feature
Torchserve automatically loads and unloads the model on the basis of the request. If I have registered 3 models in torchserve. If one of the models does not get any hit in like 1 da…
-
**Context**
I use Tabby VSCode extension with a local Tabby server.
Currently, when I start VSCode and the Tabby server is not running, it reminds me of that through the yellow indicated extension i…
-
**Description**
The Triton Inference server is deployed on the only CPU device.
There are about 32 models (onnxruntime).
The Triton Inference server outage during the long load testing. It stops …
-
Using model eos4zfy in [this run](https://github.com/ersilia-os/model-inference-pipeline/actions/runs/9202245571/job/25311754747#step:6:78), writing to DynamoDB fails on some workers.
Upon closer i…