determined-ai / determined

Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.
https://determined.ai
Apache License 2.0
2.93k stars 347 forks source link

chore: add Nvidia drivers version matching test and bump env [MD-413] #9567

Closed jgongd closed 3 weeks ago

jgongd commented 3 weeks ago

Ticket

Description

In order for fabric manager to work properly, it will need to match the version of the Nvidia driver. Added a test in e2e_tests/tests/environment/test_nvidia_driver.py to make sure. The rest of the changes are bump env.

The related environments repo PR: https://github.com/determined-ai/environments/pull/269

Test Plan

CI tests pass.

Checklist

netlify[bot] commented 3 weeks ago

Deploy Preview for determined-ui canceled.

Name Link
Latest commit bac45220d6771d58f27286c5a9f43a421ec8dd53
Latest deploy log https://app.netlify.com/sites/determined-ui/deploys/667c80257f109d0008321911
codecov[bot] commented 3 weeks ago

Codecov Report

All modified and coverable lines are covered by tests :white_check_mark:

Project coverage is 49.80%. Comparing base (da9025b) to head (bac4522). Report is 13 commits behind head on main.

Additional details and impacted files ```diff @@ Coverage Diff @@ ## main #9567 +/- ## ======================================= Coverage 49.79% 49.80% ======================================= Files 1247 1247 Lines 162235 162235 Branches 2888 2887 -1 ======================================= + Hits 80793 80794 +1 + Misses 81270 81269 -1 Partials 172 172 ``` | [Flag](https://app.codecov.io/gh/determined-ai/determined/pull/9567/flags?src=pr&el=flags&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=determined-ai) | Coverage Δ | | |---|---|---| | [backend](https://app.codecov.io/gh/determined-ai/determined/pull/9567/flags?src=pr&el=flag&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=determined-ai) | `43.87% <100.00%> (-0.01%)` | :arrow_down: | | [harness](https://app.codecov.io/gh/determined-ai/determined/pull/9567/flags?src=pr&el=flag&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=determined-ai) | `63.75% <100.00%> (+<0.01%)` | :arrow_up: | | [web](https://app.codecov.io/gh/determined-ai/determined/pull/9567/flags?src=pr&el=flag&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=determined-ai) | `46.16% <ø> (ø)` | | Flags with carried forward coverage won't be shown. [Click here](https://docs.codecov.io/docs/carryforward-flags?utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=determined-ai#carryforward-flags-in-the-pull-request-comment) to find out more. | [Files](https://app.codecov.io/gh/determined-ai/determined/pull/9567?dropdown=coverage&src=pr&el=tree&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=determined-ai) | Coverage Δ | | |---|---|---| | [harness/determined/deploy/gcp/constants.py](https://app.codecov.io/gh/determined-ai/determined/pull/9567?src=pr&el=tree&filepath=harness%2Fdetermined%2Fdeploy%2Fgcp%2Fconstants.py&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=determined-ai#diff-aGFybmVzcy9kZXRlcm1pbmVkL2RlcGxveS9nY3AvY29uc3RhbnRzLnB5) | `100.00% <100.00%> (ø)` | | | [master/internal/config/provconfig/aws\_config.go](https://app.codecov.io/gh/determined-ai/determined/pull/9567?src=pr&el=tree&filepath=master%2Finternal%2Fconfig%2Fprovconfig%2Faws_config.go&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=determined-ai#diff-bWFzdGVyL2ludGVybmFsL2NvbmZpZy9wcm92Y29uZmlnL2F3c19jb25maWcuZ28=) | `11.20% <ø> (ø)` | | | [master/internal/config/provconfig/gcp\_config.go](https://app.codecov.io/gh/determined-ai/determined/pull/9567?src=pr&el=tree&filepath=master%2Finternal%2Fconfig%2Fprovconfig%2Fgcp_config.go&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=determined-ai#diff-bWFzdGVyL2ludGVybmFsL2NvbmZpZy9wcm92Y29uZmlnL2djcF9jb25maWcuZ28=) | `26.76% <100.00%> (ø)` | | ... and [2 files with indirect coverage changes](https://app.codecov.io/gh/determined-ai/determined/pull/9567/indirect-changes?src=pr&el=tree-more&utm_medium=referral&utm_source=github&utm_content=comment&utm_campaign=pr+comments&utm_term=determined-ai)