red-hat-data-services / distributed-workloads

Artifacts for installing the Distributed Workloads stack as part of ODH
Apache License 2.0
1 stars 2 forks source link

Fix the failing KubeRay tests running with downstream RHODS operator #5

Closed winklerm closed 1 year ago

winklerm commented 1 year ago

Investigate and fix the following test failure when creating a RayJob with the downstream operator - might be related to [1]:

=== NAME  TestRayJobSubmissionRest
    ray_test.go:185: 
        Unexpected error:
            <*errors.errorString | 0xc0004bbe70>: 
            incorrect response code: 503 for creating Ray Job, response body: <html>
...
              <body>
                <div>
                  <h1>Application is not available</h1>
                  <p>The application is currently not serving requests at this endpoint. It may not have been started or is still starting.</p>

                  <div class="alert alert-info">
                    <p class="info">
                      Possible reasons you are seeing this page:
                    </p>
                    <ul>
                      <li>
                        <strong>The host doesn't exist.</strong>
                        Make sure the hostname was typed correctly and that a route matching this hostname exists.
                      </li>
                      <li>
                        <strong>The host exists, but doesn't have a matching path.</strong>
                        Check if the URL path was typed correctly and that the route was created using the desired path.
                      </li>
                      <li>
                        <strong>Route and path matches, but all pods are down.</strong>
                        Make sure that the resources exposed by this route (pods, services, deployment configs, etc) have at least one pod running.
                      </li>
...
            }
        occurred

[1] https://issues.redhat.com/browse/RHODS-11106