Closed rummens closed 5 years ago
Turns out we had an mistake in our automation script. The metadb was looking for mysql in the wrong namespace.
@rummens I'm also seeing this. Any tips on figuring out where the problem is?
Sure, for us it was a minor and stupid mistake. We forgot to change the namespace of the metdata db component. So when we ran our tests we changed the namespace to something else then the default and this caused the issue. If you closely look at the error it even says so:
Failed to create ML Metadata Store with config mysql:<host:"metadata-db.kubeflow"
Our correct configuration would be something like metadata-db.kubeflowCustomNamespace
If you want I can ask the developer who actually did it to comment?
All good @rummens ! Thanks for your help.
Yeah, turns out it was something dumb on our end too. We were using kustomize
to add the app: our-app
label to all of the specs. Removed that and now everything works as expected.
Glad to help ;-)
@sahilprasad Could you tell me the exact steps to fix this problem?
/kind bug
What steps did you take and what happened: Deployed Kubeflow 0.6 and the followings pods crash loop: metadata-deployment-6cf77db994-dtbhd, metadata-deployment-6cf77db994-kzv52, metadata-deployment-6cf77db994-mr2gh
The error message is the following:
What did you expect to happen: Deploy without crashing.
Anything else you would like to add: We had the idea that it might be the internal DNS that failed? Any idea how to narrow down the problem?
Environment:
kubectl version
):/etc/os-release
): Cent OS 7