ceph / ceph-csi

CSI driver for Ceph
Apache License 2.0
1.26k stars 536 forks source link

ceph-csi shows errors regarding umount of volumes #1620

Closed andersosthus closed 3 years ago

andersosthus commented 3 years ago

Describe the bug

In our ceph-csi logs, we see errors that looks like this: 2020-10-22 12:40:26 | E1022 10:40:26.173335 1 utils.go:163] ID: 2262 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/pods/9fd8d5d0-13bf-11eb-975b-0a362473693d/volumes/kubernetes.io~csi/pvc-1f462319-1397-11eb-97a0-0258a0a0acd1/mount]

It occurs in several ceph-csi containers (but not all), and over the last 15 minutes, we have 300 entries of this in the logs. I've investigated a few of them. The first one I looked at, the pvc in question was not mounted on the node, but the directory still existed in /var/lib/kubelet/plugins/kubernetes.io/csi/pv/. The errors lasted for about 1 hour. Full log of that event is below marked as [1].

I then investigated another instance of this error. This time, the volume was still mounted on the node, and I did a manual umount of it without error. Looking at the ceph-csi logs, for this specific issue, it has been going on for over 12 hours. Logs from ceph-csi below marked as [2] (I've cut out a lot of duplicate logs in the middle).

Not really sure if this has any direct effect on Ceph or not, but we've had issues with CephFS lately, so we're looking into everything that seems not normal, and thus we found this in the logs.

If any more logs are needed, let me know.

Environment details

Logs

[1] csi-cephfsplugin:

  |   | 2020-10-22 11:15:03 | E1022 09:15:03.658218       1 utils.go:163] ID: 2623 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 11:13:01 | E1022 09:13:01.605311       1 utils.go:163] ID: 2616 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 11:10:59 | E1022 09:10:59.564526       1 utils.go:163] ID: 2604 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 11:08:57 | E1022 09:08:57.483271       1 utils.go:163] ID: 2599 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 11:06:55 | E1022 09:06:55.444130       1 utils.go:163] ID: 2595 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 11:04:53 | E1022 09:04:53.385999       1 utils.go:163] ID: 2591 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 11:02:51 | E1022 09:02:51.361462       1 utils.go:163] ID: 2584 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 11:00:49 | E1022 09:00:49.253230       1 utils.go:163] ID: 2580 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:58:47 | E1022 08:58:47.205973       1 utils.go:163] ID: 2572 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:56:45 | E1022 08:56:45.138325       1 utils.go:163] ID: 2464 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:54:43 | E1022 08:54:43.048169       1 utils.go:163] ID: 2460 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:52:40 | E1022 08:52:40.950785       1 utils.go:163] ID: 2455 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:50:38 | E1022 08:50:38.867794       1 utils.go:163] ID: 2448 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:48:36 | E1022 08:48:36.816126       1 utils.go:163] ID: 2444 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:46:34 | E1022 08:46:34.714009       1 utils.go:163] ID: 2286 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:44:32 | E1022 08:44:32.662215       1 utils.go:163] ID: 2281 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:42:30 | E1022 08:42:30.600554       1 utils.go:163] ID: 2277 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:40:28 | E1022 08:40:28.551307       1 utils.go:163] ID: 2270 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:38:26 | E1022 08:38:26.383005       1 utils.go:163] ID: 2158 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:36:24 | E1022 08:36:24.238817       1 utils.go:163] ID: 2142 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:34:22 | E1022 08:34:22.214360       1 utils.go:163] ID: 2138 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:32:20 | E1022 08:32:20.071057       1 utils.go:163] ID: 2130 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:30:17 | E1022 08:30:17.943734       1 utils.go:163] ID: 2122 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:28:15 | E1022 08:28:15.821786       1 utils.go:163] ID: 2114 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:26:13 | E1022 08:26:13.690989       1 utils.go:163] ID: 1980 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:24:11 | E1022 08:24:11.561196       1 utils.go:163] ID: 1961 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:22:09 | E1022 08:22:09.317058       1 utils.go:163] ID: 1948 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:20:07 | E1022 08:20:07.252616       1 utils.go:163] ID: 1934 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:19:03 | E1022 08:19:03.151341       1 utils.go:163] ID: 1929 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:18:30 | E1022 08:18:30.945487       1 utils.go:163] ID: 1921 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:18:14 | E1022 08:18:14.874316       1 utils.go:163] ID: 1919 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:18:06 | E1022 08:18:06.840722       1 utils.go:163] ID: 1916 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/plugins/kubernetes.io/csi/pv/pvc-ea1fb499-75ff-11ea-9ed8-0292c30af1ba/globalmount]
  |   | 2020-10-22 10:18:02 | E1022 08:18:02.821235       1 utils.go:163] ID: 1914 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Aborted desc = an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 already exists
  |   | 2020-10-22 10:18:02 | E1022 08:18:02.821195       1 nodeserver.go:290] ID: 1914 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 already exists
  |   | 2020-10-22 10:18:00 | E1022 08:18:00.812718       1 utils.go:163] ID: 1912 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Aborted desc = an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 already exists
  |   | 2020-10-22 10:18:00 | E1022 08:18:00.812683       1 nodeserver.go:290] ID: 1912 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 already exists
  |   | 2020-10-22 10:17:59 | E1022 08:17:59.709602       1 utils.go:163] ID: 1910 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 GRPC error: rpc error: code = Aborted desc = an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 already exists
  |   | 2020-10-22 10:17:59 | E1022 08:17:59.709566       1 nodeserver.go:290] ID: 1910 Req-ID: 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-f1c11904-75ff-11ea-b148-0a580ac83104 already exists

[2] csi-cephfsplugin:

  |   | 2020-10-22 13:25:11 | E1022 11:25:11.998059       1 utils.go:163] ID: 2329 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/pods/9fd8d5d0-13bf-11eb-975b-0a362473693d/volumes/kubernetes.io~csi/pvc-1f462319-1397-11eb-97a0-0258a0a0acd1/mount]
  |   | 2020-10-22 13:23:09 | E1022 11:23:09.937381       1 utils.go:163] ID: 2326 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/pods/9fd8d5d0-13bf-11eb-975b-0a362473693d/volumes/kubernetes.io~csi/pvc-1f462319-1397-11eb-97a0-0258a0a0acd1/mount]
  |   | 2020-10-22 13:21:07 | E1022 11:21:07.849732       1 utils.go:163] ID: 2322 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/pods/9fd8d5d0-13bf-11eb-975b-0a362473693d/volumes/kubernetes.io~csi/pvc-1f462319-1397-11eb-97a0-0258a0a0acd1/mount]
  |   | 2020-10-22 13:19:05 | E1022 11:19:05.773535       1 utils.go:163] ID: 2319 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/pods/9fd8d5d0-13bf-11eb-975b-0a362473693d/volumes/kubernetes.io~csi/pvc-1f462319-1397-11eb-97a0-0258a0a0acd1/mount]
  |   | 2020-10-22 13:17:03 | E1022 11:17:03.672512       1 utils.go:163] ID: 2316 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/pods/9fd8d5d0-13bf-11eb-975b-0a362473693d/volumes/kubernetes.io~csi/pvc-1f462319-1397-11eb-97a0-0258a0a0acd1/mount]
----CUT HERE----
  |   | 2020-10-21 23:20:54 | E1021 21:20:54.820958       1 utils.go:163] ID: 1019 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/pods/9fd8d5d0-13bf-11eb-975b-0a362473693d/volumes/kubernetes.io~csi/pvc-1f462319-1397-11eb-97a0-0258a0a0acd1/mount]
  |   | 2020-10-21 23:18:52 | E1021 21:18:52.750573       1 utils.go:163] ID: 1016 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/pods/9fd8d5d0-13bf-11eb-975b-0a362473693d/volumes/kubernetes.io~csi/pvc-1f462319-1397-11eb-97a0-0258a0a0acd1/mount]
  |   | 2020-10-21 23:16:50 | E1021 21:16:50.521826       1 utils.go:163] ID: 1013 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/pods/9fd8d5d0-13bf-11eb-975b-0a362473693d/volumes/kubernetes.io~csi/pvc-1f462319-1397-11eb-97a0-0258a0a0acd1/mount]
  |   | 2020-10-21 23:14:45 | E1021 21:14:45.735270       1 utils.go:163] ID: 986 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/pods/9fd8d5d0-13bf-11eb-975b-0a362473693d/volumes/kubernetes.io~csi/pvc-1f462319-1397-11eb-97a0-0258a0a0acd1/mount]
  |   | 2020-10-21 23:12:43 | E1021 21:12:43.629389       1 utils.go:163] ID: 971 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Internal desc = an error (exit status 32) occurred while running umount args: [/var/lib/kubelet/pods/9fd8d5d0-13bf-11eb-975b-0a362473693d/volumes/kubernetes.io~csi/pvc-1f462319-1397-11eb-97a0-0258a0a0acd1/mount]
  |   | 2020-10-21 23:10:41 | E1021 21:10:41.576857       1 utils.go:163] ID: 944 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Aborted desc = an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 23:10:41 | E1021 21:10:41.576832       1 nodeserver.go:261] ID: 944 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 23:08:39 | E1021 21:08:39.486300       1 utils.go:163] ID: 917 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Aborted desc = an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 23:08:39 | E1021 21:08:39.486006       1 nodeserver.go:261] ID: 917 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 23:06:37 | E1021 21:06:37.434819       1 utils.go:163] ID: 904 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Aborted desc = an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 23:06:37 | E1021 21:06:37.434796       1 nodeserver.go:261] ID: 904 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 23:04:35 | E1021 21:04:35.410276       1 utils.go:163] ID: 893 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Aborted desc = an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 23:04:35 | E1021 21:04:35.410139       1 nodeserver.go:261] ID: 893 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 23:02:33 | E1021 21:02:33.321903       1 utils.go:163] ID: 876 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Aborted desc = an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 23:02:33 | E1021 21:02:33.321871       1 nodeserver.go:261] ID: 876 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 23:00:31 | E1021 21:00:31.260813       1 utils.go:163] ID: 817 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Aborted desc = an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 23:00:31 | E1021 21:00:31.260776       1 nodeserver.go:261] ID: 817 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 22:59:27 | E1021 20:59:27.185629       1 utils.go:163] ID: 815 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Aborted desc = an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 22:59:27 | E1021 20:59:27.185596       1 nodeserver.go:261] ID: 815 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 22:58:55 | E1021 20:58:55.112311       1 utils.go:163] ID: 805 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Aborted desc = an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 22:58:55 | E1021 20:58:55.112283       1 nodeserver.go:261] ID: 805 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 22:58:39 | E1021 20:58:39.033132       1 utils.go:163] ID: 804 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Aborted desc = an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 22:58:39 | E1021 20:58:39.033022       1 nodeserver.go:261] ID: 804 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 22:58:30 | E1021 20:58:30.938560       1 utils.go:163] ID: 803 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Aborted desc = an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 22:58:30 | E1021 20:58:30.938528       1 nodeserver.go:261] ID: 803 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 22:58:26 | E1021 20:58:26.740326       1 utils.go:163] ID: 802 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Aborted desc = an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 22:58:26 | E1021 20:58:26.740299       1 nodeserver.go:261] ID: 802 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 22:58:24 | E1021 20:58:24.640302       1 utils.go:163] ID: 801 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Aborted desc = an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 22:58:24 | E1021 20:58:24.640272       1 nodeserver.go:261] ID: 801 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 22:58:23 | E1021 20:58:23.593221       1 utils.go:163] ID: 800 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 GRPC error: rpc error: code = Aborted desc = an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists
  |   | 2020-10-21 22:58:23 | E1021 20:58:23.593194       1 nodeserver.go:261] ID: 800 Req-ID: 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 an operation with the given Volume ID 0001-0009-rook-ceph-0000000000000001-1ff293cc-1397-11eb-9662-0a580ac80704 already exists

[3] driver-registrar for pod used in [2] above:

  |   | 2020-10-21 16:14:08 | I1021 14:14:08.956151       1 main.go:87] Received NotifyRegistrationStatus call: &RegistrationStatus{PluginRegistered:true,Error:,}
  |   | 2020-10-21 16:14:08 | I1021 14:14:08.861003       1 main.go:77] Received GetInfo call: &InfoRequest{}
  |   | 2020-10-21 16:14:08 | I1021 14:14:08.860400       1 node_register.go:67] Registration Server started at: /registration/rook-ceph.cephfs.csi.ceph.com-reg.sock
  |   | 2020-10-21 16:14:08 | I1021 14:14:08.860249       1 node_register.go:58] Starting Registration Server at: /registration/rook-ceph.cephfs.csi.ceph.com-reg.sock
  |   | 2020-10-21 16:14:02 | W1021 14:14:02.635119       1 connection.go:170] Still connecting to unix:///csi/csi.sock
  |   | 2020-10-21 16:13:52 | W1021 14:13:52.635139       1 connection.go:170] Still connecting to unix:///csi/csi.sock
  |   | 2020-10-21 16:13:42 | W1021 14:13:42.635133       1 connection.go:170] Still connecting to unix:///csi/csi.sock
  |   | 2020-10-21 16:13:32 | I1021 14:13:32.634987       1 connection.go:151] Connecting to unix:///csi/csi.sock
  |   | 2020-10-21 16:13:32 | I1021 14:13:32.634940       1 main.go:110] Version: v1.2.0-0-g6ef000ae
Madhu-1 commented 3 years ago

@andersosthus is this cluster is in a healthy state? also the above logs are having only error logs it will be difficult to analyze the issue. can you please increase the log level from 0 to 5 [rook}(https://github.com/rook/rook/blob/master/cluster/examples/kubernetes/ceph/operator.yaml#L35) and provide the logs again?

Madhu-1 commented 3 years ago

and also can you run the umount command manually and see what error you are getting?

andersosthus commented 3 years ago

The cluster is healthy now, but we do get spikes of "MDS_SLOW_REQUESTS" that causes everything to grind to an halt. I did an umount in my second example above, and it unmounted successfully without any errors. In the first example, the volume wasn't mounted at all.

I'll increase the log level and get you some better logs.

stale[bot] commented 3 years ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in a week if no further activity occurs. Thank you for your contributions.

stale[bot] commented 3 years ago

This issue has been automatically closed due to inactivity. Please re-open if this still requires investigation.