IBM / KubeflowDojo

Repository to hold code, instructions, demos and pointers to presentation assets for Kubeflow Dojo
Apache License 2.0
133 stars 63 forks source link

Katib Missing? #80

Open stobias123 opened 3 years ago

stobias123 commented 3 years ago

I deployed on OKD (open source Openshift) with tekton backend.

This deployment manifest seems to be the most complete (includes dashboard, notebook server, etc.)

However, when I try to run the MNIST E2E example, I get errors. I think Katib is missing from the deployment...

In component_id = kubeflow-launch-experiment I get the below error.

INFO:root:Generating experiment template.
INFO:root:Creating kubeflow.org/experiments mnist-demo in namespace wow-ai.
ERROR:root:Exception when create kubeflow.org/experiments: 404 page not found
Traceback (most recent call last):
  File "/ml/launch_experiment.py", line 137, in <module>
    main()
  File "/ml/launch_experiment.py", line 120, in main
    create_response = experiment.create(inst)
  File "/ml/launch_crd.py", line 98, in create
    self._log_and_raise_exception(e, "create")
  File "/ml/launch_crd.py", line 128, in _log_and_raise_exception
    body = json.loads(ex.body)
  File "/usr/lib/python2.7/json/__init__.py", line 339, in loads
    return _default_decoder.decode(s)
  File "/usr/lib/python2.7/json/decoder.py", line 367, in decode
    raise ValueError(errmsg("Extra data", s, end, len(s)))
ValueError: Extra data: line 1 column 5 - line 2 column 1 (char 4 - 19)

k get po -n kubeflow | grep katib returns nothing.

stobias  kubeflow   master ●  k get virtualservice -o yaml | grep prefix:
          prefix: /
          prefix: /istio/grafana/
            x-forwarded-prefix: /jupyter
          prefix: /jupyter/
            x-forwarded-prefix: /kfam
          prefix: /kfam/
          prefix: /ml_metadata
          prefix: /metadata
          prefix: /pipeline

Shows no endpoint for /experiments

I've tried looking through kubeflow docs for how to include katib to my kfdef but it's not clear.