Open Kidswiss opened 2 months ago
I guess there is a potential race conditiion between the cloned PVC and snapshot. ZFS does not allow deleting snapshots that are used by a cloned volume. Because the CSI driver does receive separate requests for deleting the volume and snapshot, the driver might try to delete the snapshot first, which then fails. After deleting the volume, I guess LINSTOR should try to delete the snapshot again.
Yeah, makes sense.
Usually K8s operators re-try ever reconcile loop until the desired state is reached.
However, I don't know if this is the operator's concern, or if this should be handled upstream in linstor itself?
if this should be handled upstream in linstor itself
Ideally yes. There might be some trickery with zfs promote
that could even make this work the same as LVM for most cases.
Hi
I've been playing around with the operator a bit. I'm heavily relying on snapshots for my backups and saw, that they don't get cleaned up any more. I'm using 2 replicas for the disks.
Some more information:
ZFS destroy
Error report:
I'll gladly provide more information if necessary.
EDIT:
I can reproduce the issue with https://kubestr.io/:
Let if fail and check the volume snapshots on the cluster.