AliyunContainerService / gpushare-scheduler-extender

GPU Sharing Scheduler for Kubernetes Cluster
Apache License 2.0
1.39k stars 308 forks source link

What does `Pending(Allocated)` when using `kubectl inspect gpushare`? #151

Open cailun01 opened 3 years ago

cailun01 commented 3 years ago

Hello! Followed by Installation Guide, I installed gpushare-scheduler-extender successfully. When I input kubectl inspect gpushare I got this:

NAME   IPADDRESS    GPU0(Allocated/Total)  PENDING(Allocated)  GPU Memory(GiB)
abcd9  146.12.9.23  0/10                   3                   3/10
--------------------------------------------------
Allocated/Total GPU Memory In Cluster:
3/10 (30%)

The PENDING(Allocated) column is 3 and GPU0(Allocated/Total) is 0/10. Would anyone please tell me what is PENDING supposed to mean? Thanks!

I created the pod by this yaml:

apiVersion: v1
kind: Pod
metadata:
  name: gpu-share-pod1
spec:
  restartPolicy: OnFailure
  containers:
    - name: gpu-share-pod1
      image: "cheyang/gpu-player:v2"
      env:
        - name: NVIDIA_VISIBLE_DEVICES
          value: "0"
      resources:
        limits:
         aliyun.com/gpu-mem: 3
duokuiwang commented 3 years ago

I have the same question, do you sovle it?

asctb commented 2 years ago

I have this same exact question, anyone has updates on it ?

icovej commented 1 year ago

I have this question too, have you sovled it?