defenseunicorns / uds-bundle-software-factory-nutanix

A UDS Bundle
Apache License 2.0
2 stars 0 forks source link

bug: velero Backups for RWX PVCs fail #113

Open justicorn opened 6 months ago

justicorn commented 6 months ago

Changes are being pushed to address Velero working with the Nutanix CSI plugin. However, there is currently an issue with backing up RWX PVCs (nutanix-dynamicfile). Sample error below:

time="2024-05-21T19:16:26Z" level=warning msg="Volumesnapshotcontent snapcontent-5849b822-88fa-42ac-af3d-cc2f089b8369 has error: Failed to check and update snapshot content: failed to take snapshot of the volume NutanixFiles-dynamic-ZTEyNjY1YTEtMzJkZS00MWMyLTlhZDktNmJlYWM4OGY2YzQ3-pvc-25c708dd-44a6-4854-9047-09ba60d0b4e7: \"rpc error: code = Aborted desc = NutanixVolumes: Error creating snapshot snapshot-5849b822-88fa-42ac-af3d-cc2f089b8369 : Failed to create VG Snapshot: json: cannot unmarshal array into Go struct field MessageResource.message_list.details of type string\"" backup=velero/manual-keycloak-velero-backup-1716317444 cmd=/plugins/velero-plugin-for-csi logSource="/go/src/velero-plugin-for-csi/internal/util/util.go:261" pluginName=velero-plugin-for-csi

In order to reproduce, kick off backup and restore as noted in the docs in a namespace with a RWX PVC.

blancharda commented 6 months ago

The nutanix docs look like it should be supported 🤔

justicorn commented 6 months ago

Image

https://portal.nutanix.com/page/documents/solutions/details?targetId=NVD-2177-Cloud-Native-6-5-OpenShift:application-backup-and-disaster-recovery.html#:~:text=As%20an%20alternative%20solution

blancharda commented 6 months ago

😭 That's pretty unfortunate.

On the bright side, I don't believe anything depends on RWX volume backups today. We should look at adding restic (or some other alternative) to address in the future though

jacobbmay commented 2 months ago

Moving to No Status so it is still tracked, but this isn't something we can resolve ourselves. It is a limitation of the Nutanix Files CSI provisioner.