project-codeflare / codeflare

Simplifying the definition and execution, scaling and deployment of pipelines on the cloud.
https://codeflare.dev
Apache License 2.0
222 stars 36 forks source link

Ray cluster on OpenShift fails due to missing file #36

Open cmisale opened 3 years ago

cmisale commented 3 years ago

Describe the bug Cannot bring up Ray cluster as defined in the OCP tutorial

To Reproduce Steps to reproduce the behavior:

  1. Go to https://codeflare.readthedocs.io/en/latest/getting_started/starting.html#Openshift-Ray-Cluster-Operator
  2. Run pip3 install --upgrade codeflare
  3. Create namespace oc create namespace codeflare
  4. Run ray up ray/python/ray/autoscaler/kubernetes/example-full.yaml fails:
    $ ray up ray/python/ray/autoscaler/kubernetes/example-full.yaml
    Provided cluster configuration file (ray/python/ray/autoscaler/kubernetes/example-full.yaml) does not exist

Expected behavior Bring up Ray cluster on OCP

Desktop (please complete the following information):

Additional context OCP Cluster running on IBM Cloud.

$ oc cluster-info
Kubernetes master is running at https://c100-e.jp-tok.containers.cloud.ibm.com:31129

To further debug and diagnose cluster problems, use 'kubectl cluster-info dump'.

CodeFlare commit hash commit a2b290a115b0cc1317270cef6059d5281215842e