ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
https://ray.io
Apache License 2.0
33.76k stars 5.74k forks source link

[Object Spilling] Use subdirectories to avoid large top level inodes for file spilling #14166

Open rkooo567 opened 3 years ago

rkooo567 commented 3 years ago

What is the problem?

For cases like 100TB shuffle, this could be problematic. Spark is doing the same thing for spilling.

cc @clarkzinzow

Reproduction (REQUIRED)

Please provide a short code snippet (less than 50 lines if possible) that can be copy-pasted to reproduce the issue. The snippet should have no external library dependencies (i.e., use fake or mock data / environments):

If the code snippet cannot be run by itself, the issue will be closed with "needs-repro-script".

wuisawesome commented 3 years ago

Btw we can also consider just using a different file system for this (iirc Facebook haystack used XFS to solve this before?)