kubeflow / training-operator

Distributed ML Training and Fine-Tuning on Kubernetes
https://www.kubeflow.org/docs/components/training
Apache License 2.0
1.62k stars 700 forks source link

Upgrade Deepspeed demo dependencies #2294

Closed Syulin7 closed 1 month ago

Syulin7 commented 1 month ago

What this PR does / why we need it: Upgrade deepspeed demo dependencies and remove unused dependencies.

Which issue(s) this PR fixes (optional, in Fixes #<issue number>, #<issue number>, ... format, will close the issue(s) when PR gets merged): Fixes #2287 #2288

Checklist:

Syulin7 commented 1 month ago

cc @andreyvelich @tenzen-y

coveralls commented 1 month ago

Pull Request Test Coverage Report for Build 11426360935

Details


Totals Coverage Status
Change from base Build 11412887414: 0.0%
Covered Lines: 73
Relevant Lines: 73

💛 - Coveralls
google-oss-prow[bot] commented 1 month ago

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: tenzen-y

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files: - ~~[OWNERS](https://github.com/kubeflow/training-operator/blob/master/OWNERS)~~ [tenzen-y] Approvers can indicate their approval by writing `/approve` in a comment Approvers can cancel approval by writing `/approve cancel` in a comment