k8snetworkplumbingwg / sriov-network-operator

Operator for provisioning and configuring SR-IOV CNI plugin and device plugin
Apache License 2.0
85 stars 114 forks source link

Mellanox plugin should handle Bluefield card in DPU mode correctly #662

Open ykulazhenkov opened 8 months ago

ykulazhenkov commented 8 months ago

Mellanox plugin should handle Bluefield card in DPU mode correctly

Currently, the Mellanox plugin will try to change the FW configuration for the card and reboot the node. Changing the FW configuration for BF in DPU mode requires a power cycle of the entire host. The operator can't handle this correctly, which leads to an infinite reboot loop.

Proposed behavior:

Alternative 1:

Alternative 2:

I personally prefer the option from the "Proposed behavior" section.

@e0ne @SchSeba @zeeke Please, share your thoughts

adrianchiris commented 7 months ago

+1 for the Proposed behavior

hopefully for DPU we will be able to load configuration by mstfwreset+reboot and save the powercycle then we can remove this check from the mellanox plugin