Azure / AML-Kubernetes

AzureML customer managed k8s compute samples
MIT License
80 stars 32 forks source link

AzureML Extension installation fails when setting inference=true #293

Open mialtamo opened 2 months ago

mialtamo commented 2 months ago

All,

Getting an error while attempting to install the AzureML Extension with inference enabled.

Receiving this error: Helm Upgrade Failed : : InnerError [release azureml failed, and has been rolled back due to atomic being set: pre-upgrade hooks failed: pod healthcheck failed].

Further logs show this error:

Further logging shows this error: Status: Failed Name: inferencev1 ErrorCode: E45005 ErrorMessage: Azureml Fe deployment confict

This error looks like its saying Inference is being used elsewhere and documentation says to talk to support on assistance migrating.

Any guidance would be helpful here!

It shows everything in a running state but the inference-operator-controller-manager. which is not present at all.