Open aadadeyST opened 3 months ago
Thanks for the feedback! We are routing this to the appropriate team for follow-up. cc @Azure/azure-ml-sdk @azureml-github.
Thanks for the detailed issue, @azureml-github will take a look and get back to you as soon as possible.
Describe the bug When I create an endpoint using the python SDK (
ml_client.online_endpoints.begin_create_or_update(endpoint).wait(120)
) I receive the following error (details have been scrubbed):I had previously set up endpoint
foo-bar
on a Kubernetes cluster (we'll call itaks-one
). I then detachedaks-one
and created a new one (aks-two
). I tried to recreate the endpoint in theaks-two
cluster but I received an error that it couldn't findaks-one
, which had been detached. So I deleted thefoo-bar
endpoint (using the ML Studio UI), but when I ran the Python code to create the endpoint, it gave me the above error. I've checked the Kubernetes service and deleted all of the workloads, services, and configurations related to the previousfoo-bar
deployment but that didn't change anything. I also recreatedaks-one
and tried to create the endpoint there but still received the same error message.Running
az ml online-endpoint list
against the workspace/resource group returns an empty list.To Reproduce Steps to reproduce the behavior:
Expected behavior I should be able to create the endpoint with the same name as the previously deleted one. When I create an endpoint, delete it and recreate it all on the same Kubernetes cluster, I have no issues. This only happens when two clusters are involved.
Screenshots There are currently no endpoints in my workspace:![image](https://github.com/Azure/azure-sdk-for-python/assets/161866816/cc65bd63-7274-4bfd-a418-008da4125e61)