uber / RemoteShuffleService

Remote shuffle service for Apache Spark to store shuffle data on remote servers.
Other
321 stars 100 forks source link

Does Rss support YARN executor preemption? #98

Closed YutingWang98 closed 1 year ago

YutingWang98 commented 1 year ago

We use Rss in our adhoc cluster with preemption enabled, which mean some tasks may fail due to resource being preempted while writing shuffle data.

I am wondering if the shuffle writer has written only with part of the shuffle data when it is preempted, and the server doesn't mark it as corrupted since the stream has been closed properly. Will this cause any shuffle data inconsistency in the partition file? Thanks.