Closed bjwswang closed 3 months ago
…cluster
Fixes #
Verified on:
single node & 2gpus (WITH resource limit of "nvidia.com/gpu = 2")
two nodes & each have 1 gpu
Pod logs:
@nkwangleiGIT FYI
…cluster
What type of PR is this?
What this PR does / why we need it
Which issue(s) this PR fixes
Fixes #
Special notes for your reviewer
Verified on:
single node & 2gpus (WITH resource limit of "nvidia.com/gpu = 2")
two nodes & each have 1 gpu
Pod logs: