Closed garymm closed 2 weeks ago
It seems that the certification was not set to webhook configurations appropriately.
Could you check the controller state with kubectl get pods -n kubeflow
?
Ah yeah the controller pod can't run because:
Events:
Type Reason Age From Message
---- ------ ---- ---- -------
Warning FailedMount 3m11s (x527 over 17h) kubelet MountVolume.SetUp failed for volume "cert" : secret "katib-webhook-cert" not found
So it seems a secret needs to be created. Is it possible for the katib-standalone kube configs can handle this? If not then I guess instructions need to be added as to how the user can do this on their own before applying the kube configs.
Hmm re-applied and it seems to work now. Not sure what happened the first time. I will close and re-open if I can reproduce.
What happened?
Installed as per the instructions from the docs:
Then used the katib python SDK as per the example in the docs. Creating an experiment fails with:
From some related thread on Slack I gather that the MutatingWebhookConfiguration having empty caBundle may be related:
Outputs:
What did you expect to happen?
I expect to be able to use the Python SDK after installing Katib standalone.
Environment
Kubernetes version:
Katib controller version: 0.17.0
Katib Python SDK version: 0.17.0
Impacted by this bug?
Give it a 👍 We prioritize the issues with most 👍