mars-project / mars

Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
https://mars-project.readthedocs.io
Apache License 2.0
2.69k stars 325 forks source link

Ray-based Shuffle For Mars #2916

Open chaokunyang opened 2 years ago

chaokunyang commented 2 years ago

This issue is used for tracking Ray-based Shuffle For Mars progress:

wjsi commented 2 years ago

I've implemented a half-done branch https://github.com/wjsi/mars/tree/feature/shuffle_svc for shuffle. The reason why it is not merged is that currently storage service lacks the capability of remote readers and writers. Now working on it.

chaokunyang commented 2 years ago

https://github.com/wjsi/mars/tree/feature/shuffle_svc

Is there a design doc for this?

wjsi commented 2 years ago

https://github.com/wjsi/mars/tree/feature/shuffle_svc

Is there a design doc for this?

Not yet. I'll write it soon.