As you know, sriov-device-plugin pods are creating when SriovNetworkNodePolicy deployed.
After then, my H100 nodes' status are changed from sriovnetwork.openshift.io/state: Idle to sriovnetwork.openshift.io/state: Reboot_Required and rebooted after elapsed some time.
But in v1.4.0, it seems that VFs were created but eventually these were not shown and even PF disappeared. Here's the logs from dmesg:
Hi, I'm facing an issue while creating VFs in v1.4.0 version - IB devices disappears at the end of VF creation (It works in v1.3.0 btw).
I used same configuration (e.g.
SriovNetworkNodePolicy
) for creating VFs.Here's
SriovNetworkNodePolicy
that I used:And I'm using H100 node with ConnectX-7 IB:
And I pulled v1.3.0 and v1.4.0 Helm charts from
oci://ghcr.io/k8snetworkplumbingwg/sriov-network-operator-chart
and image tags are different:v1.3.0
v1.4.0
As you know,
sriov-device-plugin
pods are creating whenSriovNetworkNodePolicy
deployed. After then, my H100 nodes' status are changed fromsriovnetwork.openshift.io/state: Idle
tosriovnetwork.openshift.io/state: Reboot_Required
and rebooted after elapsed some time.But in v1.4.0, it seems that VFs were created but eventually these were not shown and even PF disappeared. Here's the logs from
dmesg
:** Above messages shown when I pointed out
ibp65s0
to create VFs. Sorry for confusion. This behavior happens regardless of PF names.After then, when I tried to execute
mst status -v
then even the node can't find PF itself:Do you know anything about this situation? Anything would be very helpful.
Thanks.