I am using multus-cni, which defaults to using host-device to mount RDMA network card devices. However, some exceptional scenarios have arisen, causing an error when kubelet executes the RunPodSandBox interface: 'failed to find device name for pci address'. When kubelet automatically performs sandbox cleanup, it continuously outputs the 'Link not found' error in the netns of the erroneous sandbox, as it cannot find the corresponding network card device. This, in turn, causes the pod to be blocked in the Terminating state.
I am using multus-cni, which defaults to using host-device to mount RDMA network card devices. However, some exceptional scenarios have arisen, causing an error when kubelet executes the RunPodSandBox interface: 'failed to find device name for pci address'. When kubelet automatically performs sandbox cleanup, it continuously outputs the 'Link not found' error in the netns of the erroneous sandbox, as it cannot find the corresponding network card device. This, in turn, causes the pod to be blocked in the Terminating state.