ray-project / kuberay

A toolkit to run Ray applications on Kubernetes
Apache License 2.0
982 stars 330 forks source link

[Bug] KubeRay Deployment Failure with Large ServeZip File in Working_Dir #2072

Closed USER-HFC closed 2 months ago

USER-HFC commented 2 months ago

Search before asking

KubeRay Component

ray-operator

What happened + What you expected to happen

I am using KubeRay, with the image being ray_ml:2.9.0. I built a serve that is 92MB in size, and I found that when inserting it into the working_dir, the head node does not fully pull the zip file, leading to deployment failure. However, when I switch it to a smaller servezip, this problem does not occur.

Reproduction script

pass

Anything else

No response

Are you willing to submit a PR?

kevin85421 commented 2 months ago

This seems to be a Ray question. Would you mind opening an issue in the Ray repository and cc me in the issue? Thanks!

kevin85421 commented 2 months ago

Close this one, and track in the Ray repository https://github.com/ray-project/ray/issues/44614.