Closed surenyufuz closed 9 months ago
So should I set different ports for each in the ray start params for ray worker nodes ?
I think so.
Out of curiosity, why do you decide to use the host network mode? If you must use this mode, I suggest running only one Ray Pod on a single Kubernetes node. There could be many issues if multiple Pods on a single Kubernetes node share the same network namespace.
So should I set different ports for each in the ray start params for ray worker nodes ?
I think so.
Out of curiosity, why do you decide to use the host network mode? If you must use this mode, I suggest running only one Ray Pod on a single Kubernetes node. There could be many issues if multiple Pods on a single Kubernetes node share the same network namespace.
Thanks for your reply.
Due to the container port scarcity in my Kubernetes network environment, I have to use the host network mode to deploy ray jobs as many as possible.
Running only one Ray Pod on a single Kubernetes node may be not suitable for a common Kubernetes clusters, because other workloads may exist, but thank you anyway.
By the way, the document described the dashboard-agent-listen-port
settings may be not correct, as the Ray start script shows:
dashboard-agent-listen-port
would be set a default value, rather than a random one,it bothers me a lot. If the document could be fixed, this issue will have nothing to do with bugs, thanks.
Would you mind opening a PR in the Ray repository to update the doc? You can also cc me there.
Would you mind opening a PR in the Ray repository to update the doc? You can also cc me there.
My pleasure, I have raised a related PR https://github.com/ray-project/ray/pull/42060 .
Search before asking
KubeRay Component
ray-operator
What happened + What you expected to happen
I have deployed a ray job with host network mode.
To avoid port conflicts between ray node pods on a kubernetes node, I should set different ports for them, such as
ray-client-server-port
for the head node.But for the ray worker nodes, I am not sure which ports should be set. As the document shows, the
dashboard-agent-listen-port
should be the random value by default.But when I deployed my ray job , I found that two ray worker nodes on a kubernetes node were using the same port
52365
fordashboard-agent-listen-port
:And this prevented me from viewing a node's logs from the dashboard, when I set different
dashboard-agent-listen-port
for ray worker nodes, this problem can be solved.So should I set different ports for each in the ray start params for ray worker nodes ? My guess is that
dashboard-agent-grpc-port
and other ports likenode-manager-port
will have similar issues.Reproduction script
Anything else
No response
Are you willing to submit a PR?