If RayJob custom resources cannot complete indefinitely due to the lack of gang scheduling, it appears that the clusterloader process does not time out even after 20 minutes.
Related issue number
2102
Checks
[ ] I've made sure the tests are passing.
Testing Strategy
[ ] Unit tests
[ ] Manual tests
[ ] This PR is not tested :(
I modified the script a bit to create 10 RayJob CRs on my local Kind cluster.
Why are these changes needed?
TODO:
clusterloader
process does not time out even after 20 minutes.Related issue number
2102
Checks
I modified the script a bit to create 10 RayJob CRs on my local Kind cluster.