kubeflow / arena

A CLI for Kubeflow.
Apache License 2.0
730 stars 177 forks source link

Support setting shared memory for training job. #1104

Closed Syulin7 closed 2 months ago

Syulin7 commented 2 months ago

What this PR does / why we need it: Support setting shared memory for training job(tfjob, pytorchjob)

Which issue(s) this PR fixes (optional, in Fixes #<issue number>, #<issue number>, ... format, will close the issue(s) when PR gets merged): Fixes #1044

ChenYi015 commented 2 months ago

/lgtm /approve

google-oss-prow[bot] commented 2 months ago

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ChenYi015

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files: - ~~[OWNERS](https://github.com/kubeflow/arena/blob/master/OWNERS)~~ [ChenYi015] Approvers can indicate their approval by writing `/approve` in a comment Approvers can cancel approval by writing `/approve cancel` in a comment