intelligent-machine-learning / dlrover

DLRover: An Automatic Distributed Deep Learning System
Other
1.27k stars 167 forks source link

More logging on device using. #1309

Closed BalaBalaYi closed 3 weeks ago

BalaBalaYi commented 3 weeks ago

What changes were proposed in this pull request?

Please describe the changes you have made or proposed in this pull request.

Why are the changes needed?

Explain the purpose or motivation behind these changes. What problem are you trying to solve?

Does this PR introduce any user-facing change?

Specify whether this pull request introduces any changes that users will directly interact with or notice.

How was this patch tested?

Detail the testing process you have undertaken to ensure the changes in this pull request are valid and working as intended.

codecov[bot] commented 3 weeks ago

Codecov Report

Attention: Patch coverage is 85.71429% with 1 line in your changes missing coverage. Please review.

Project coverage is 80.51%. Comparing base (0a77136) to head (cc608a0). Report is 2 commits behind head on master.

Files with missing lines Patch % Lines
dlrover/trainer/torch/node_check/nvidia_gpu.py 80.00% 1 Missing :warning:
Additional details and impacted files ```diff @@ Coverage Diff @@ ## master #1309 +/- ## ========================================== - Coverage 80.51% 80.51% -0.01% ========================================== Files 222 222 Lines 20698 20704 +6 ========================================== + Hits 16666 16670 +4 - Misses 4032 4034 +2 ```

:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.