intelligent-machine-learning / dlrover

DLRover: An Automatic Distributed Deep Learning System
Other
1.22k stars 153 forks source link

optimize check abnormal nodes #1265

Closed BalaBalaYi closed 2 weeks ago

BalaBalaYi commented 2 weeks ago

What changes were proposed in this pull request?

Skip operation if not initialized.

Why are the changes needed?

To avoid error when master failover.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

UT.

codecov[bot] commented 2 weeks ago

Codecov Report

Attention: Patch coverage is 33.33333% with 2 lines in your changes missing coverage. Please review.

Project coverage is 80.56%. Comparing base (f31137e) to head (a79569f). Report is 1 commits behind head on master.

Files with missing lines Patch % Lines
...ver/python/master/elastic_training/rdzv_manager.py 33.33% 2 Missing :warning:
Additional details and impacted files ```diff @@ Coverage Diff @@ ## master #1265 +/- ## ========================================== - Coverage 80.57% 80.56% -0.01% ========================================== Files 218 218 Lines 20009 20012 +3 ========================================== + Hits 16122 16123 +1 - Misses 3887 3889 +2 ```

:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.