volcano-sh / volcano

A Cloud Native Batch System (Project under CNCF)
https://volcano.sh
Apache License 2.0
4.25k stars 971 forks source link

Proposal of dynamic GPU slice plugin #3820

Open sailorvii opened 1 week ago

sailorvii commented 1 week ago

NVIDIA official GPU sharing includes time-slice, MPS and MIG. Currently the MPS and MIG dynamic is not supported, we want to add this into volcano scheduler plugin

volcano-sh-bot commented 1 week ago

Welcome @sailorvii!

It looks like this is your first PR to volcano-sh/volcano.

Thank you, and welcome to Volcano. :smiley:

volcano-sh-bot commented 1 week ago

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: To complete the pull request process, please assign lowang-bh You can assign the PR to them by writing /assign @lowang-bh in a comment when ready.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files: - **[OWNERS](https://github.com/volcano-sh/volcano/blob/master/OWNERS)** Approvers can indicate their approval by writing `/approve` in a comment Approvers can cancel approval by writing `/approve cancel` in a comment
Monokaix commented 1 week ago

Hi, please squash to one commit and sign off.

archlitchi commented 4 days ago

A nice feature, but i have a few recommends:

  1. please add user guide for using dynamic MIG and MPS
  2. please clarify if annotations 'dynamicgpuslice' is a pod annotation or a node annotation?
sailorvii commented 1 day ago

archlitchi

Thanks for your time and review.

  1. Add the usage part.
  2. They're all node annotations. (the title has said “Node labels and annotations”)