Open Joshuaalbert opened 1 week ago
hi , @Joshuaalbert
Okay, I solved but not by changing the Ray side, but the docker side. Which makes me suspect that Ray has some strange potential unwanted behaviour with docker. I'll explain.
I noticed the node that showed 50B in the above screenshot was the same size as the disk that stores docker images. Now, this is really weird because nowhere am I mounting that partition as a volume in the container. So I tried moving the docker data dir to a different disk, and low-and-behold the storage shown in the dashboard changed to reflect that.
Another useful info: the head and worker docker storage drivers are different between the two nodes. On the head node it is using overlay2 (which is kernel space), and on the worker fuse-overlay (which is user space). When I updated the storage driver on the head node to fuse-overlay it started showing the correct storage size.
What happened + What you expected to happen
When setting
temp-dir
to a value on head it doesn't seem to be reflected in dashboard.on worker it's started like this:
The dashboard shows this
The first line is the head node, the second is a worker node. Both are run in container in working directories volumes mounted to 10TB size disks. Why is the head showing only 50GB. That's the size of
/
on the host, which it shouldn't have access to.Versions / Dependencies
ray 2.37 also same on 2.39
Reproduction script
look at dashboard
Issue Severity
High: It blocks me from completing my task.