Open uzhinskiy opened 2 years ago
Hi, sorry for long wait. I'm not sure what it means in your case, but geesefs itself definitely handles network failures well. CSI unmounting the volume after a period of inactivity also looks rather strange. It should at least be logged in the pod log - maybe the pod is stopped? Check it with kubectl describe pod ... Or maybe you already found the answer yourself? =)
Hello. We are trying to use CSI-S3 with geesefs as storage backend for elasticsearch. We are using this elasticsearch as a snapshot checker. Most of the time it is idle and not processing any data. We noticed that after a few hours of inactivity all IO operations in elasticsearch's pod failed with following log lines in kube-system/csi-s3-XXX:
After we manually restarted this pod everything was fine again. We suspect that the problem could be caused by network disruption which leads to TCP connection termination, which is not being reestablished after that network problem is gone.
How do we prevent this behavior of CSI-S3?
Thank you.