However, when I try to run the MNIST E2E example, I get errors. I think Katib is missing from the deployment...
In component_id = kubeflow-launch-experiment I get the below error.
INFO:root:Generating experiment template.
INFO:root:Creating kubeflow.org/experiments mnist-demo in namespace wow-ai.
ERROR:root:Exception when create kubeflow.org/experiments: 404 page not found
Traceback (most recent call last):
File "/ml/launch_experiment.py", line 137, in <module>
main()
File "/ml/launch_experiment.py", line 120, in main
create_response = experiment.create(inst)
File "/ml/launch_crd.py", line 98, in create
self._log_and_raise_exception(e, "create")
File "/ml/launch_crd.py", line 128, in _log_and_raise_exception
body = json.loads(ex.body)
File "/usr/lib/python2.7/json/__init__.py", line 339, in loads
return _default_decoder.decode(s)
File "/usr/lib/python2.7/json/decoder.py", line 367, in decode
raise ValueError(errmsg("Extra data", s, end, len(s)))
ValueError: Extra data: line 1 column 5 - line 2 column 1 (char 4 - 19)
k get po -n kubeflow | grep katib returns nothing.
I deployed on OKD (open source Openshift) with tekton backend.
This deployment manifest seems to be the most complete (includes dashboard, notebook server, etc.)
However, when I try to run the MNIST E2E example, I get errors. I think Katib is missing from the deployment...
In component_id =
kubeflow-launch-experiment
I get the below error.k get po -n kubeflow | grep katib
returns nothing.Shows no endpoint for
/experiments
I've tried looking through kubeflow docs for how to include katib to my kfdef but it's not clear.