Open Dipanshu-Sehjal opened 2 years ago
How do you get the resource name for a drbd resource? Seems we are using RES: * but every name we try appears to say no valid resource name.
We also tried drbdadm down all but no resource defined is also returned.
Hi, After a serial/rolling upgrade of k8s cluster, one of the DRBD resources was found stuck in SyncTarget.
Linstor version - [root@flex-103 ~]# k exec --namespace=piraeus deployment/piraeus-op-piraeus-operator-cs-controller -- linstor --version linstor 1.13.0; GIT-hash: 840cf57c75c166659509e22447b2c0ca6377ee6d
DRBD version - [root@flex-103 ~]# k exec -n piraeus piraeus-op-piraeus-operator-ns-node-jrhwv -c linstor-satellite -- drbdadm --version DRBDADM_BUILDTAG=GIT-hash:\ 087ee6b4961ca154d76e4211223b03149373bed8\ build\ by\ @buildsystem\,\ 2022-01-28\ 12:19:33 DRBDADM_API_VERSION=2 DRBD_KERNEL_VERSION_CODE=0x090106 DRBD_KERNEL_VERSION=9.1.6 DRBDADM_VERSION_CODE=0x091402 DRBDADM_VERSION=9.20.2
Piraeus 1.8.0
Setup details -
K8s cluster is a 3 nodes setup - 2 disk nodes and 1 Diskless node with Protocol C replication. Disk node with InUse resource and SyncTarget is flex-106 (shorted as 106) Disk node with Unused resource and UpToDate is flex-107 (shorted as 107) Diskless node with Unused resource is flex-108 (shorted as 108)
Some relevant info -
DRBD logs from the disk node (flex-106) that has this PVC resource stuck in SyncTarget -
Pod using this PVC is deployed on disk node with this SyncTarget resource -
To avoid cluttering, I have attached all logs related to this PVC in logs here -
Here are attached drbd states, DRBD kernel logs, sc def, and linstor r l output files
drbadm-status-verbose-on-replica-disk-node-107.log drbdadm-cstate-disk-node-106.log drbdadm-cstate-diskless.log drbdadm-cstate-replica-disk-node-107.log drbdadm-dstate-disk-node-106.log drbdadm-dstate-diskless.log drbdadm-dstate-replica-disk-node-107.log drbdadm-show-resource-on-InUse.log linstor-resource-list.log node-associated-with-pod-using-this-pvc.log pvc-description.log sc-info.yaml.log diskless-node-stuck-target-108.log disk-node-stuck-target-primary-106.log disk-node-stuck-target-secondary-107.log drbadm-status-verbose-on-diskless-node-108.log drbadm-status-verbose-on-InUse-node.log