Open Hokwang opened 5 months ago
I am watching long time this daemonset.
# kubectl -n gpu-operator get pod -l app.kubernetes.io/component=nvidia-driver --field-selector spec.nodeName=gn10 -w
NAME READY STATUS RESTARTS AGE
nvidia-driver-daemonset-jgnzk 0/1 Init:0/1 0 14s
nvidia-driver-daemonset-jgnzk 0/1 PodInitializing 0 35s
nvidia-driver-daemonset-jgnzk 0/1 Running 0 36s
nvidia-driver-daemonset-jgnzk 0/1 Terminating 0 81s
nvidia-driver-daemonset-jgnzk 0/1 Terminating 0 91s
nvidia-driver-daemonset-bd5g7 0/1 Pending 0 0s
nvidia-driver-daemonset-bd5g7 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-jgnzk 0/1 Terminating 0 92s
nvidia-driver-daemonset-jgnzk 0/1 Terminating 0 92s
nvidia-driver-daemonset-jgnzk 0/1 Terminating 0 92s
nvidia-driver-daemonset-bd5g7 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-bd5g7 0/1 PodInitializing 0 34s
nvidia-driver-daemonset-bd5g7 0/1 Running 0 35s
nvidia-driver-daemonset-bd5g7 0/1 Terminating 0 86s
nvidia-driver-daemonset-bd5g7 0/1 Terminating 0 92s
nvidia-driver-daemonset-7hnkw 0/1 Pending 0 0s
nvidia-driver-daemonset-7hnkw 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-bd5g7 0/1 Terminating 0 93s
nvidia-driver-daemonset-bd5g7 0/1 Terminating 0 93s
nvidia-driver-daemonset-bd5g7 0/1 Terminating 0 93s
nvidia-driver-daemonset-7hnkw 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-7hnkw 0/1 Terminating 0 22s
nvidia-driver-daemonset-7hnkw 0/1 Terminating 0 22s
nvidia-driver-daemonset-7hnkw 0/1 Terminating 0 23s
nvidia-driver-daemonset-7hnkw 0/1 Terminating 0 23s
nvidia-driver-daemonset-7hnkw 0/1 Terminating 0 23s
nvidia-driver-daemonset-7hnkw 0/1 Terminating 0 23s
nvidia-driver-daemonset-nn2bd 0/1 Pending 0 0s
nvidia-driver-daemonset-nn2bd 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-nn2bd 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-nn2bd 0/1 PodInitializing 0 11s
nvidia-driver-daemonset-nn2bd 0/1 Running 0 12s
nvidia-driver-daemonset-nn2bd 0/1 Terminating 0 33s
nvidia-driver-daemonset-nn2bd 0/1 Terminating 0 63s
nvidia-driver-daemonset-whthp 0/1 Pending 0 0s
nvidia-driver-daemonset-whthp 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-nn2bd 0/1 Terminating 0 64s
nvidia-driver-daemonset-nn2bd 0/1 Terminating 0 64s
nvidia-driver-daemonset-nn2bd 0/1 Terminating 0 64s
nvidia-driver-daemonset-whthp 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-whthp 0/1 PodInitializing 0 34s
nvidia-driver-daemonset-whthp 0/1 Running 0 35s
nvidia-driver-daemonset-whthp 0/1 Terminating 0 36s
nvidia-driver-daemonset-whthp 0/1 Terminating 0 37s
nvidia-driver-daemonset-xwpjv 0/1 Pending 0 0s
nvidia-driver-daemonset-xwpjv 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-whthp 0/1 Terminating 0 37s
nvidia-driver-daemonset-whthp 0/1 Terminating 0 37s
nvidia-driver-daemonset-whthp 0/1 Terminating 0 37s
nvidia-driver-daemonset-xwpjv 0/1 Terminating 0 1s
nvidia-driver-daemonset-xwpjv 0/1 Terminating 0 1s
nvidia-driver-daemonset-xwpjv 0/1 Terminating 0 2s
nvidia-driver-daemonset-xwpjv 0/1 Terminating 0 2s
nvidia-driver-daemonset-xwpjv 0/1 Terminating 0 2s
nvidia-driver-daemonset-xwpjv 0/1 Terminating 0 2s
nvidia-driver-daemonset-xwpjv 0/1 Terminating 0 2s
nvidia-driver-daemonset-qtvvz 0/1 Pending 0 0s
nvidia-driver-daemonset-qtvvz 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-qtvvz 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-qtvvz 0/1 PodInitializing 0 34s
nvidia-driver-daemonset-qtvvz 0/1 Running 0 35s
nvidia-driver-daemonset-qtvvz 0/1 Terminating 0 98s
nvidia-driver-daemonset-qtvvz 0/1 Terminating 0 2m9s
nvidia-driver-daemonset-2wgn4 0/1 Pending 0 0s
nvidia-driver-daemonset-2wgn4 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-qtvvz 0/1 Terminating 0 2m9s
nvidia-driver-daemonset-qtvvz 0/1 Terminating 0 2m9s
nvidia-driver-daemonset-qtvvz 0/1 Terminating 0 2m9s
nvidia-driver-daemonset-2wgn4 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-2wgn4 0/1 PodInitializing 0 34s
nvidia-driver-daemonset-2wgn4 0/1 Running 0 35s
nvidia-driver-daemonset-2wgn4 0/1 Terminating 0 37s
nvidia-driver-daemonset-2wgn4 0/1 Terminating 0 38s
nvidia-driver-daemonset-zw46q 0/1 Pending 0 0s
nvidia-driver-daemonset-zw46q 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-2wgn4 0/1 Terminating 0 39s
nvidia-driver-daemonset-2wgn4 0/1 Terminating 0 39s
nvidia-driver-daemonset-2wgn4 0/1 Terminating 0 39s
nvidia-driver-daemonset-zw46q 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-zw46q 0/1 Terminating 0 26s
nvidia-driver-daemonset-zw46q 0/1 Terminating 0 27s
nvidia-driver-daemonset-zw46q 0/1 Terminating 0 27s
nvidia-driver-daemonset-zw46q 0/1 Terminating 0 27s
nvidia-driver-daemonset-zw46q 0/1 Terminating 0 27s
nvidia-driver-daemonset-zw46q 0/1 Terminating 0 27s
nvidia-driver-daemonset-j5l4x 0/1 Pending 0 0s
nvidia-driver-daemonset-j5l4x 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-j5l4x 0/1 Terminating 0 0s
nvidia-driver-daemonset-j5l4x 0/1 Terminating 0 2s
nvidia-driver-daemonset-j5l4x 0/1 Terminating 0 2s
nvidia-driver-daemonset-j5l4x 0/1 Terminating 0 3s
nvidia-driver-daemonset-j5l4x 0/1 Terminating 0 3s
nvidia-driver-daemonset-j5l4x 0/1 Terminating 0 3s
nvidia-driver-daemonset-j5l4x 0/1 Terminating 0 3s
nvidia-driver-daemonset-l4qfh 0/1 Pending 0 0s
nvidia-driver-daemonset-l4qfh 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-l4qfh 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-l4qfh 0/1 PodInitializing 0 4s
nvidia-driver-daemonset-l4qfh 0/1 Running 0 5s
nvidia-driver-daemonset-l4qfh 0/1 Terminating 0 11s
nvidia-driver-daemonset-l4qfh 0/1 Terminating 0 19s
nvidia-driver-daemonset-jcgcr 0/1 Pending 0 0s
nvidia-driver-daemonset-jcgcr 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-l4qfh 0/1 Terminating 0 20s
nvidia-driver-daemonset-l4qfh 0/1 Terminating 0 20s
nvidia-driver-daemonset-l4qfh 0/1 Terminating 0 20s
nvidia-driver-daemonset-jcgcr 0/1 Terminating 0 1s
nvidia-driver-daemonset-jcgcr 0/1 Terminating 0 2s
nvidia-driver-daemonset-jcgcr 0/1 Terminating 0 2s
nvidia-driver-daemonset-jcgcr 0/1 Terminating 0 3s
nvidia-driver-daemonset-jcgcr 0/1 Terminating 0 3s
nvidia-driver-daemonset-jcgcr 0/1 Terminating 0 3s
nvidia-driver-daemonset-jcgcr 0/1 Terminating 0 3s
nvidia-driver-daemonset-mnst5 0/1 Pending 0 0s
nvidia-driver-daemonset-mnst5 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-mnst5 0/1 Terminating 0 0s
nvidia-driver-daemonset-mnst5 0/1 Terminating 0 1s
nvidia-driver-daemonset-mnst5 0/1 Terminating 0 1s
nvidia-driver-daemonset-mnst5 0/1 Terminating 0 2s
nvidia-driver-daemonset-mnst5 0/1 Terminating 0 2s
nvidia-driver-daemonset-mnst5 0/1 Terminating 0 2s
nvidia-driver-daemonset-mnst5 0/1 Terminating 0 2s
nvidia-driver-daemonset-4k7wb 0/1 Pending 0 0s
nvidia-driver-daemonset-4k7wb 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-4k7wb 0/1 Init:0/1 0 1s
nvidia-driver-daemonset-4k7wb 0/1 Terminating 0 16s
nvidia-driver-daemonset-4k7wb 0/1 Terminating 0 16s
nvidia-driver-daemonset-4k7wb 0/1 Terminating 0 17s
nvidia-driver-daemonset-4k7wb 0/1 Terminating 0 17s
nvidia-driver-daemonset-4k7wb 0/1 Terminating 0 17s
nvidia-driver-daemonset-4k7wb 0/1 Terminating 0 17s
nvidia-driver-daemonset-4rmhj 0/1 Pending 0 0s
nvidia-driver-daemonset-4rmhj 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-4rmhj 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-4rmhj 0/1 PodInitializing 0 17s
nvidia-driver-daemonset-4rmhj 0/1 Terminating 0 17s
nvidia-driver-daemonset-4rmhj 0/1 Terminating 0 18s
nvidia-driver-daemonset-4rmhj 0/1 Terminating 0 18s
nvidia-driver-daemonset-s7j8x 0/1 Pending 0 0s
nvidia-driver-daemonset-s7j8x 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-4rmhj 0/1 Terminating 0 19s
nvidia-driver-daemonset-4rmhj 0/1 Terminating 0 19s
nvidia-driver-daemonset-4rmhj 0/1 Terminating 0 19s
nvidia-driver-daemonset-s7j8x 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-s7j8x 0/1 Terminating 0 4s
nvidia-driver-daemonset-s7j8x 0/1 Terminating 0 4s
nvidia-driver-daemonset-s7j8x 0/1 Terminating 0 5s
nvidia-driver-daemonset-s7j8x 0/1 Terminating 0 5s
nvidia-driver-daemonset-s7j8x 0/1 Terminating 0 5s
nvidia-driver-daemonset-s7j8x 0/1 Terminating 0 5s
nvidia-driver-daemonset-5hrpq 0/1 Pending 0 0s
nvidia-driver-daemonset-5hrpq 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-5hrpq 0/1 Init:0/1 0 1s
nvidia-driver-daemonset-5hrpq 0/1 PodInitializing 0 30s
nvidia-driver-daemonset-5hrpq 0/1 Running 0 31s
nvidia-driver-daemonset-5hrpq 0/1 Terminating 0 105s
nvidia-driver-daemonset-5hrpq 0/1 Terminating 0 2m16s
nvidia-driver-daemonset-5xmh8 0/1 Pending 0 0s
nvidia-driver-daemonset-5xmh8 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-5hrpq 0/1 Terminating 0 2m16s
nvidia-driver-daemonset-5hrpq 0/1 Terminating 0 2m16s
nvidia-driver-daemonset-5hrpq 0/1 Terminating 0 2m16s
nvidia-driver-daemonset-5xmh8 0/1 Init:0/1 0 1s
nvidia-driver-daemonset-5xmh8 0/1 Terminating 0 25s
nvidia-driver-daemonset-5xmh8 0/1 Terminating 0 25s
nvidia-driver-daemonset-5xmh8 0/1 Terminating 0 25s
nvidia-driver-daemonset-5xmh8 0/1 Terminating 0 26s
nvidia-driver-daemonset-5xmh8 0/1 Terminating 0 26s
nvidia-driver-daemonset-5xmh8 0/1 Terminating 0 26s
nvidia-driver-daemonset-4z46z 0/1 Pending 0 0s
nvidia-driver-daemonset-4z46z 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-4z46z 0/1 Init:0/1 0 1s
nvidia-driver-daemonset-4z46z 0/1 Terminating 0 7s
nvidia-driver-daemonset-4z46z 0/1 Terminating 0 7s
nvidia-driver-daemonset-4z46z 0/1 Terminating 0 7s
nvidia-driver-daemonset-4z46z 0/1 Terminating 0 7s
nvidia-driver-daemonset-4z46z 0/1 Terminating 0 7s
nvidia-driver-daemonset-4z46z 0/1 Terminating 0 7s
nvidia-driver-daemonset-hmvhx 0/1 Pending 0 0s
nvidia-driver-daemonset-hmvhx 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-hmvhx 0/1 Init:0/1 0 1s
nvidia-driver-daemonset-hmvhx 0/1 PodInitializing 0 3s
nvidia-driver-daemonset-hmvhx 0/1 Running 0 4s
nvidia-driver-daemonset-hmvhx 0/1 Terminating 0 93s
nvidia-driver-daemonset-hmvhx 0/1 Terminating 0 2m5s
nvidia-driver-daemonset-m5jt6 0/1 Pending 0 0s
nvidia-driver-daemonset-m5jt6 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-hmvhx 0/1 Terminating 0 2m6s
nvidia-driver-daemonset-hmvhx 0/1 Terminating 0 2m6s
nvidia-driver-daemonset-hmvhx 0/1 Terminating 0 2m6s
nvidia-driver-daemonset-m5jt6 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-m5jt6 0/1 PodInitializing 0 34s
nvidia-driver-daemonset-m5jt6 0/1 Running 0 35s
nvidia-driver-daemonset-m5jt6 0/1 Terminating 0 58s
nvidia-driver-daemonset-m5jt6 0/1 Terminating 0 90s
nvidia-driver-daemonset-5z7qw 0/1 Pending 0 0s
nvidia-driver-daemonset-5z7qw 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-m5jt6 0/1 Terminating 0 90s
nvidia-driver-daemonset-m5jt6 0/1 Terminating 0 90s
nvidia-driver-daemonset-m5jt6 0/1 Terminating 0 90s
nvidia-driver-daemonset-5z7qw 0/1 Init:0/1 0 1s
nvidia-driver-daemonset-5z7qw 0/1 Terminating 0 14s
nvidia-driver-daemonset-5z7qw 0/1 Terminating 0 14s
nvidia-driver-daemonset-5z7qw 0/1 Terminating 0 15s
nvidia-driver-daemonset-5z7qw 0/1 Terminating 0 15s
nvidia-driver-daemonset-5z7qw 0/1 Terminating 0 15s
nvidia-driver-daemonset-5z7qw 0/1 Terminating 0 15s
nvidia-driver-daemonset-z4756 0/1 Pending 0 0s
nvidia-driver-daemonset-z4756 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-z4756 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-z4756 0/1 PodInitializing 0 19s
nvidia-driver-daemonset-z4756 0/1 Running 0 20s
nvidia-driver-daemonset-z4756 0/1 Terminating 0 2m25s
nvidia-driver-daemonset-z4756 0/1 Terminating 0 2m55s
nvidia-driver-daemonset-5789r 0/1 Pending 0 0s
nvidia-driver-daemonset-5789r 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-z4756 0/1 Terminating 0 2m56s
nvidia-driver-daemonset-z4756 0/1 Terminating 0 2m56s
nvidia-driver-daemonset-z4756 0/1 Terminating 0 2m56s
nvidia-driver-daemonset-5789r 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-5789r 0/1 PodInitializing 0 34s
nvidia-driver-daemonset-5789r 0/1 Running 0 35s
nvidia-driver-daemonset-5789r 0/1 Terminating 0 93s
nvidia-driver-daemonset-5789r 0/1 Terminating 0 2m4s
nvidia-driver-daemonset-sxwhc 0/1 Pending 0 0s
nvidia-driver-daemonset-sxwhc 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-5789r 0/1 Terminating 0 2m5s
nvidia-driver-daemonset-5789r 0/1 Terminating 0 2m5s
nvidia-driver-daemonset-5789r 0/1 Terminating 0 2m5s
nvidia-driver-daemonset-sxwhc 0/1 Terminating 0 1s
nvidia-driver-daemonset-sxwhc 0/1 Terminating 0 2s
nvidia-driver-daemonset-sxwhc 0/1 Terminating 0 2s
nvidia-driver-daemonset-sxwhc 0/1 Terminating 0 3s
nvidia-driver-daemonset-sxwhc 0/1 Terminating 0 3s
nvidia-driver-daemonset-sxwhc 0/1 Terminating 0 3s
nvidia-driver-daemonset-sxwhc 0/1 Terminating 0 3s
nvidia-driver-daemonset-6z9lz 0/1 Pending 0 0s
nvidia-driver-daemonset-6z9lz 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-6z9lz 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-6z9lz 0/1 Terminating 0 2s
nvidia-driver-daemonset-6z9lz 0/1 Terminating 0 4s
nvidia-driver-daemonset-6z9lz 0/1 Terminating 0 5s
nvidia-driver-daemonset-6z9lz 0/1 Terminating 0 5s
nvidia-driver-daemonset-6z9lz 0/1 Terminating 0 5s
nvidia-driver-daemonset-6z9lz 0/1 Terminating 0 5s
nvidia-driver-daemonset-btppt 0/1 Pending 0 0s
nvidia-driver-daemonset-btppt 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-btppt 0/1 Init:0/1 0 1s
nvidia-driver-daemonset-btppt 0/1 PodInitializing 0 29s
nvidia-driver-daemonset-btppt 0/1 Running 0 30s
nvidia-driver-daemonset-btppt 0/1 Terminating 0 2m43s
nvidia-driver-daemonset-btppt 0/1 Terminating 0 3m15s
nvidia-driver-daemonset-s7z9t 0/1 Pending 0 0s
nvidia-driver-daemonset-s7z9t 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-btppt 0/1 Terminating 0 3m16s
nvidia-driver-daemonset-btppt 0/1 Terminating 0 3m16s
nvidia-driver-daemonset-btppt 0/1 Terminating 0 3m16s
nvidia-driver-daemonset-s7z9t 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-s7z9t 0/1 Terminating 0 33s
nvidia-driver-daemonset-s7z9t 0/1 Terminating 0 34s
nvidia-driver-daemonset-s7z9t 0/1 Terminating 0 34s
nvidia-driver-daemonset-s7z9t 0/1 Terminating 0 34s
nvidia-driver-daemonset-s7z9t 0/1 Terminating 0 34s
nvidia-driver-daemonset-s7z9t 0/1 Terminating 0 34s
nvidia-driver-daemonset-5sccm 0/1 Pending 0 0s
nvidia-driver-daemonset-5sccm 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-5sccm 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-5sccm 0/1 PodInitializing 0 3s
nvidia-driver-daemonset-5sccm 0/1 Terminating 0 4s
nvidia-driver-daemonset-5sccm 0/1 Terminating 0 4s
nvidia-driver-daemonset-5sccm 0/1 Terminating 0 5s
nvidia-driver-daemonset-sdc7k 0/1 Pending 0 0s
nvidia-driver-daemonset-sdc7k 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-5sccm 0/1 Terminating 0 6s
nvidia-driver-daemonset-5sccm 0/1 Terminating 0 6s
nvidia-driver-daemonset-5sccm 0/1 Terminating 0 6s
nvidia-driver-daemonset-sdc7k 0/1 Terminating 0 1s
nvidia-driver-daemonset-sdc7k 0/1 Terminating 0 2s
nvidia-driver-daemonset-sdc7k 0/1 Terminating 0 2s
nvidia-driver-daemonset-sdc7k 0/1 Terminating 0 3s
nvidia-driver-daemonset-sdc7k 0/1 Terminating 0 3s
nvidia-driver-daemonset-sdc7k 0/1 Terminating 0 3s
nvidia-driver-daemonset-sdc7k 0/1 Terminating 0 3s
nvidia-driver-daemonset-fp5l2 0/1 Pending 0 0s
nvidia-driver-daemonset-fp5l2 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-fp5l2 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-fp5l2 0/1 PodInitializing 0 34s
nvidia-driver-daemonset-fp5l2 0/1 Running 0 35s
nvidia-driver-daemonset-fp5l2 0/1 Terminating 0 70s
nvidia-driver-daemonset-fp5l2 0/1 Terminating 0 90s
nvidia-driver-daemonset-dwcn5 0/1 Pending 0 0s
nvidia-driver-daemonset-dwcn5 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-fp5l2 0/1 Terminating 0 90s
nvidia-driver-daemonset-fp5l2 0/1 Terminating 0 90s
nvidia-driver-daemonset-fp5l2 0/1 Terminating 0 90s
nvidia-driver-daemonset-dwcn5 0/1 Init:0/1 0 1s
nvidia-driver-daemonset-dwcn5 0/1 Terminating 0 3s
nvidia-driver-daemonset-dwcn5 0/1 Terminating 0 3s
nvidia-driver-daemonset-dwcn5 0/1 Terminating 0 4s
nvidia-driver-daemonset-dwcn5 0/1 Terminating 0 4s
nvidia-driver-daemonset-dwcn5 0/1 Terminating 0 4s
nvidia-driver-daemonset-dwcn5 0/1 Terminating 0 4s
nvidia-driver-daemonset-x9wr6 0/1 Pending 0 0s
nvidia-driver-daemonset-x9wr6 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-x9wr6 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-x9wr6 0/1 PodInitializing 0 29s
nvidia-driver-daemonset-x9wr6 0/1 Running 0 30s
nvidia-driver-daemonset-x9wr6 0/1 Terminating 0 3m4s
nvidia-driver-daemonset-x9wr6 0/1 Terminating 0 3m30s
nvidia-driver-daemonset-b44rq 0/1 Pending 0 0s
nvidia-driver-daemonset-b44rq 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-x9wr6 0/1 Terminating 0 3m30s
nvidia-driver-daemonset-x9wr6 0/1 Terminating 0 3m30s
nvidia-driver-daemonset-x9wr6 0/1 Terminating 0 3m30s
nvidia-driver-daemonset-b44rq 0/1 Terminating 0 0s
nvidia-driver-daemonset-b44rq 0/1 Terminating 0 1s
nvidia-driver-daemonset-b44rq 0/1 Terminating 0 2s
nvidia-driver-daemonset-b44rq 0/1 Terminating 0 2s
nvidia-driver-daemonset-b44rq 0/1 Terminating 0 2s
nvidia-driver-daemonset-b44rq 0/1 Terminating 0 2s
nvidia-driver-daemonset-b44rq 0/1 Terminating 0 2s
nvidia-driver-daemonset-9sgg2 0/1 Pending 0 0s
nvidia-driver-daemonset-9sgg2 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-9sgg2 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-9sgg2 0/1 PodInitializing 0 34s
nvidia-driver-daemonset-9sgg2 0/1 Running 0 35s
nvidia-driver-daemonset-9sgg2 0/1 Terminating 0 84s
nvidia-driver-daemonset-9sgg2 0/1 Terminating 0 90s
nvidia-driver-daemonset-d476x 0/1 Pending 0 0s
nvidia-driver-daemonset-d476x 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-9sgg2 0/1 Terminating 0 91s
nvidia-driver-daemonset-9sgg2 0/1 Terminating 0 91s
nvidia-driver-daemonset-9sgg2 0/1 Terminating 0 91s
nvidia-driver-daemonset-d476x 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-d476x 0/1 Terminating 0 11s
nvidia-driver-daemonset-d476x 0/1 Terminating 0 11s
nvidia-driver-daemonset-d476x 0/1 Terminating 0 11s
nvidia-driver-daemonset-d476x 0/1 Terminating 0 11s
nvidia-driver-daemonset-d476x 0/1 Terminating 0 11s
nvidia-driver-daemonset-d476x 0/1 Terminating 0 11s
nvidia-driver-daemonset-4kwk4 0/1 Pending 0 0s
nvidia-driver-daemonset-4kwk4 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-4kwk4 0/1 Terminating 0 0s
nvidia-driver-daemonset-4kwk4 0/1 Terminating 0 1s
nvidia-driver-daemonset-4kwk4 0/1 Terminating 0 1s
nvidia-driver-daemonset-4kwk4 0/1 Terminating 0 2s
nvidia-driver-daemonset-4kwk4 0/1 Terminating 0 2s
nvidia-driver-daemonset-4kwk4 0/1 Terminating 0 2s
nvidia-driver-daemonset-4kwk4 0/1 Terminating 0 2s
nvidia-driver-daemonset-lrv77 0/1 Pending 0 0s
nvidia-driver-daemonset-lrv77 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-lrv77 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-lrv77 0/1 PodInitializing 0 21s
nvidia-driver-daemonset-lrv77 0/1 Running 0 22s
nvidia-driver-daemonset-lrv77 0/1 Terminating 0 54s
nvidia-driver-daemonset-lrv77 0/1 Terminating 0 78s
nvidia-driver-daemonset-klrb7 0/1 Pending 0 0s
nvidia-driver-daemonset-klrb7 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-lrv77 0/1 Terminating 0 78s
nvidia-driver-daemonset-lrv77 0/1 Terminating 0 78s
nvidia-driver-daemonset-lrv77 0/1 Terminating 0 78s
nvidia-driver-daemonset-klrb7 0/1 Init:0/1 0 1s
nvidia-driver-daemonset-klrb7 0/1 Terminating 0 4s
nvidia-driver-daemonset-klrb7 0/1 Terminating 0 4s
nvidia-driver-daemonset-klrb7 0/1 Terminating 0 4s
nvidia-driver-daemonset-klrb7 0/1 Terminating 0 5s
nvidia-driver-daemonset-klrb7 0/1 Terminating 0 5s
nvidia-driver-daemonset-klrb7 0/1 Terminating 0 5s
nvidia-driver-daemonset-sc7td 0/1 Pending 0 0s
nvidia-driver-daemonset-sc7td 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-sc7td 0/1 Terminating 0 0s
nvidia-driver-daemonset-sc7td 0/1 Terminating 0 1s
nvidia-driver-daemonset-sc7td 0/1 Terminating 0 2s
nvidia-driver-daemonset-sc7td 0/1 Terminating 0 2s
nvidia-driver-daemonset-sc7td 0/1 Terminating 0 2s
nvidia-driver-daemonset-sc7td 0/1 Terminating 0 2s
nvidia-driver-daemonset-sc7td 0/1 Terminating 0 2s
nvidia-driver-daemonset-gzqdr 0/1 Pending 0 0s
nvidia-driver-daemonset-gzqdr 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-gzqdr 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-gzqdr 0/1 Terminating 0 24s
nvidia-driver-daemonset-gzqdr 0/1 Terminating 0 25s
nvidia-driver-daemonset-gzqdr 0/1 Terminating 0 25s
nvidia-driver-daemonset-gzqdr 0/1 Terminating 0 25s
nvidia-driver-daemonset-gzqdr 0/1 Terminating 0 25s
nvidia-driver-daemonset-gzqdr 0/1 Terminating 0 25s
nvidia-driver-daemonset-sjbjk 0/1 Pending 0 0s
nvidia-driver-daemonset-sjbjk 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-sjbjk 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-sjbjk 0/1 PodInitializing 0 3s
nvidia-driver-daemonset-sjbjk 0/1 Running 0 4s
nvidia-driver-daemonset-sjbjk 0/1 Terminating 0 33s
nvidia-driver-daemonset-sjbjk 0/1 Terminating 0 59s
nvidia-driver-daemonset-g9jgm 0/1 Pending 0 0s
nvidia-driver-daemonset-g9jgm 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-sjbjk 0/1 Terminating 0 59s
nvidia-driver-daemonset-sjbjk 0/1 Terminating 0 59s
nvidia-driver-daemonset-sjbjk 0/1 Terminating 0 59s
nvidia-driver-daemonset-g9jgm 0/1 Init:0/1 0 1s
nvidia-driver-daemonset-g9jgm 0/1 Terminating 0 34s
nvidia-driver-daemonset-g9jgm 0/1 Terminating 0 35s
nvidia-driver-daemonset-g9jgm 0/1 Terminating 0 35s
nvidia-driver-daemonset-g9jgm 0/1 Terminating 0 35s
nvidia-driver-daemonset-g9jgm 0/1 Terminating 0 36s
nvidia-driver-daemonset-g9jgm 0/1 Terminating 0 36s
nvidia-driver-daemonset-bh4rj 0/1 Pending 0 0s
nvidia-driver-daemonset-bh4rj 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-bh4rj 0/1 Init:0/1 0 1s
nvidia-driver-daemonset-bh4rj 0/1 Terminating 0 2s
nvidia-driver-daemonset-bh4rj 0/1 Terminating 0 4s
nvidia-driver-daemonset-bh4rj 0/1 Terminating 0 4s
nvidia-driver-daemonset-bh4rj 0/1 Terminating 0 5s
nvidia-driver-daemonset-bh4rj 0/1 Terminating 0 5s
nvidia-driver-daemonset-bh4rj 0/1 Terminating 0 5s
nvidia-driver-daemonset-4p8zh 0/1 Pending 0 0s
nvidia-driver-daemonset-4p8zh 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-4p8zh 0/1 Init:0/1 0 1s
nvidia-driver-daemonset-4p8zh 0/1 Terminating 0 18s
nvidia-driver-daemonset-4p8zh 0/1 Terminating 0 18s
nvidia-driver-daemonset-4p8zh 0/1 Terminating 0 19s
nvidia-driver-daemonset-4p8zh 0/1 Terminating 0 19s
nvidia-driver-daemonset-4p8zh 0/1 Terminating 0 19s
nvidia-driver-daemonset-4p8zh 0/1 Terminating 0 19s
nvidia-driver-daemonset-4sxd5 0/1 Pending 0 0s
nvidia-driver-daemonset-4sxd5 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-4sxd5 0/1 Init:0/1 0 2s
nvidia-driver-daemonset-4sxd5 0/1 PodInitializing 0 10s
nvidia-driver-daemonset-4sxd5 0/1 Running 0 11s
nvidia-driver-daemonset-4sxd5 0/1 Terminating 0 110s
nvidia-driver-daemonset-4sxd5 0/1 Terminating 0 2m23s
nvidia-driver-daemonset-4sxd5 0/1 Terminating 0 2m23s
nvidia-driver-daemonset-k69dh 0/1 Pending 0 0s
nvidia-driver-daemonset-k69dh 0/1 Init:0/1 0 0s
nvidia-driver-daemonset-4sxd5 0/1 Terminating 0 2m23s
nvidia-driver-daemonset-4sxd5 0/1 Terminating 0 2m23s
nvidia-driver-daemonset-k69dh 0/1 Init:0/1 0 2s
From the driver container logs, we see the following error message
Error: Failed to download metadata for repo 'rhel-8-for-x86_64-baseos-eus-rpms': Cannot download repomd.xml: Cannot download repodata/repomd.xml: All mirrors were tried
Can you confirm your cluster networking is healthy?
@cdesiniotis Hi, yes, except that *-eus-*
rpm, everything is fine, is it needed?
and I don't know why server can not download metadata for -eus- only? url is same with other rpms.
Hello,
One of our node did run with the same issue while doing a driver upgrade. Upgrade went well for 5 servers but one of them, the nvidia-driver-daemonset pod are restarting to fast and I can not catch logs.
It looks like we are stuck in "pod-restart-required" from looking on labels described here https://docs.nvidia.com/datacenter/cloud-native/gpu-operator/24.3.0/_images/upgrade-controller-state-machine.png All nodes do have correct version of the driver deployed but they are still showing "upgrade-required" and the node being proceed with driver ds restarter is flagged as "pod-restart-required".
We didn't had any issue with "535.161.08" but with "535.183.06" that does not work as expected, operator 24.3.0.
I've disable the auto upgrade flags from helm config, the driver is still deployed but at least node are running okay (another way is to label the node with nvidia.com/gpu-driver-upgrade.skip=true
).
1. Quick Debug Information
2. Issue or feature description
nvidia-driver-daemonset not works
3. Steps to reproduce the issue
oneday one node has memory hang, so I reboot that server, and then nvidia-driver-daemonset did not change to running status.
4. Information to attach (optional if deemed irrelevant)
when 0/1 Running, it's log is
but every pod shows little different, sometimes pod shows
and sometimes I can see Caught signal message after other commands output...