ceph / ceph-csi

CSI driver for Ceph
Apache License 2.0
1.25k stars 537 forks source link

CephFS: ceph-fuse: when cephfs nodeplugin pod restart, mount point cannot ls #4271

Closed VictoryAnn closed 9 months ago

VictoryAnn commented 9 months ago

Describe the bug

A clear and concise description of what the bug is.

Environment details

Steps to reproduce

Steps to reproduce the behavior:

  1. create pvc https://github.com/ceph/ceph-csi/blob/devel/examples/cephfs/pvc.yaml
  2. create deployment https://github.com/ceph/ceph-csi/blob/devel/examples/cephfs/deployment.yaml
  3. restart all cephfs nodeplugin pods
  4. enter demo pod and ls mountPath (ls /var/lib/www/html)

Actual results

Transport endpoint is not connected

ps -ef | grep ceph-fuse found ceph-fuse process not exists

Expected behavior

when cephfs-nodeplugin container or pod restart , my demo pod access normal mounPath.

Logs

If the issue is in PVC creation, deletion, cloning please attach complete logs of below containers.

If the issue is in PVC resize please attach complete logs of below containers.

If the issue is in snapshot creation and deletion please attach complete logs of below containers.

If the issue is in PVC mounting please attach complete logs of below containers.

Note:- If its a rbd issue please provide only rbd related logs, if its a cephFS issue please provide cephFS logs.

Additional context

ps -ef | grep ceph-fuse ceph-fuse process not exists rbd-nbd when nodeplugin restart , it will heal rbd-nbd process. cephfs also need to heal when restart or upgrade

For example:

Any existing bug report which describe about the similar issue/behavior

Rakshith-R commented 9 months ago

@VictoryAnn This is a known issue, please refer to https://github.com/ceph/ceph-csi/blob/devel/docs/ceph-mount-corruption.md. It requires manual steps, it cannot be handled within Cephcsi.