Pods using GPU are scheduled to a provisioned p-type instance.
Actual Behavior
p-type instances were provisioned well. However, the pod were still on pending status.
After deleting pending pods, the provisioned instance were not terminated even after termination grace period.
You'll need to install the NVidia device plugin which is responsible for registering the gpu resources on the node. See the note here for more information.
Version
Karpenter: v0.14.0
Kubernetes: v1.21.14
Expected Behavior
Pods using GPU are scheduled to a provisioned p-type instance.
Actual Behavior
p-type instances were provisioned well. However, the pod were still on pending status. After deleting pending pods, the provisioned instance were not terminated even after termination grace period.
Steps to Reproduce the Problem
gpu_privisioner.yaml
gpu_deployment.yaml
Resource Specs and Logs