apache / celeborn

Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
https://celeborn.apache.org/
Apache License 2.0
886 stars 359 forks source link

[CELEBORN-1673] Support retry create client #2855

Closed RexXiong closed 6 days ago

RexXiong commented 1 week ago

What changes were proposed in this pull request?

As title

Why are the changes needed?

Currently, only Flink retries establishing a client when a connection problem occurs. This would be beneficial for all other engines to implement as well.

Does this PR introduce any user-facing change?

No

How was this patch tested?

UT

SteNicholas commented 6 days ago

Merged to main(v0.6.0) and branch-0.5(v0.5.3).