apache / celeborn

Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
https://celeborn.apache.org/
Apache License 2.0
896 stars 361 forks source link

[CELEBORN-1716] Sleeping in CelebornShuffleReader while loop should have an upper bound time #2915

Closed zaynt4606 closed 3 days ago

zaynt4606 commented 2 weeks ago

What changes were proposed in this pull request?

There should be a failure ceiling in development scenarios.

Why are the changes needed?

Usually it won't get stuck in this while loop. I wrote some bugs and it stuck there... 🥹🥹🥹

Does this PR introduce any user-facing change?

How was this patch tested?

image

turboFei commented 2 weeks ago

Usually it won't get stuck in this while loop. I wrote some bugs and it stuck there.

Do you meet this issue in production use case?

Is the bug for testing mode?

zaynt4606 commented 2 weeks ago

Usually it won't get stuck in this while loop. I wrote some bugs and it stuck there.

Do you meet this issue in production use case?

Is the bug for testing mode?

For testing.

It wont happen in production.