-
### Terraform Core Version
1.6.5
### AWS Provider Version
5.31.0
### Affected Resource(s)
Sagemaker Engpoint config.
### Expected Behavior
When creating a jumpstart endpoint through the SageMak…
-
triton nvcr.io/nvidia/tritonserver 23.12-py3
I have 4 RTX4090 Nvidia graphics cards, and my model is an ensemble model, which can be understood as preprocessing+inference. In config.pbtxt, the gpus…
-
**What would you like to be added/modified**:
Sedna is an edge-cloud synergy AI project incubated in KubeEdge SIG AI. Benefiting from the edge-cloud synergy capabilities provided by KubeEdge, Sed…
-
### System Info
Attempting to reuse an existing OpenAI client to stream responses from HF endpoint doesn't work due to a couple of differences. In my case the differences break the .NET client in Azu…
-
**Describe the bug**
When deploying a HuggingFace model with model data on disk, sagemaker SDK still tries to access the AWS API to determine the Sagemaker default bucket. I don't currently have ac…
-
Hi,
I tried to execute this sample on a local k8s.
The KFP version on my cluster is more than 2.0, so I have updated the digit-recognizer code to be compatible with kfp-sdk-v2.
After applying th…
-
Add support to [Wav2vec2](https://huggingface.co/docs/transformers/model_doc/wav2vec2) / Connectionist Temporal Classification (CTC) phoneme models (`Wav2Vec2ForCTC` HuggingFace CTC model class)
**…
-
Thanks for your work and the repo!
As I understand, the inference for multimodal llm (eg llava, qwen-vl) can only be run in batch via the provided scripts here: https://github.com/modelscope/swift…
-
A presentation for the proposed B&A inference capabilities has been scheduled for 24th Apr '24.
Pre-read: B&A Inference overview ([explainer](https://github.com/privacysandbox/protected-auction-serv…
-
The kubeflow models web app (v0.6.0) does not display models although it gets a 200 response for `/models/api/namespaces/namespace/inferenceservices`
```
{
"inferenceServices":[
{
…