intelligent-machine-learning / dlrover

DLRover: An Automatic Distributed Deep Learning System
Other
1.22k stars 153 forks source link

Optimize logging #1276

Closed BalaBalaYi closed 1 week ago

BalaBalaYi commented 1 week ago

What changes were proposed in this pull request?

  1. Output running and pending nodes info if pending timeout.
  2. Use max nodes to calculate lacking ranks.

Why are the changes needed?

To improve logging.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

UT.

codecov[bot] commented 1 week ago

Codecov Report

All modified and coverable lines are covered by tests :white_check_mark:

Project coverage is 80.59%. Comparing base (4df3679) to head (4e70c10). Report is 18 commits behind head on master.

Additional details and impacted files ```diff @@ Coverage Diff @@ ## master #1276 +/- ## ========================================== + Coverage 80.58% 80.59% +0.01% ========================================== Files 219 219 Lines 20042 20058 +16 ========================================== + Hits 16150 16165 +15 - Misses 3892 3893 +1 ```

:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.