kubernetes-sigs / alibaba-cloud-csi-driver

CSI Plugin for Kubernetes, Support Alibaba Cloud EBS/NAS/OSS/CPFS
Apache License 2.0
539 stars 241 forks source link

Still connecting to unix:///var/lib/kubelet/csi-plugins/*.csi.alibabacloud.com/csi.sock #1127

Open lliiang opened 3 months ago

lliiang commented 3 months ago

What happened:

集群上其中两个节点一直csi-plugin-h4qhz 报错重启,以下是日志截图

图片 图片 图片 图片

以下是container日志 csi-plugin-h4qhz-nas-driver-registrar.log csi-plugin-h4qhz-disk-driver-registrar.log

csi-plugin-h4qhz-csi-plugin.log csi-plugin-h4qhz-oss-driver-registrar.log

What you expected to happen:

集群有十几个节点,就其中两个节点报错,下面是DaemonSet的yaml `kind: DaemonSet apiVersion: apps/v1 metadata: name: csi-plugin namespace: kube-system uid: 509d3cfc-0dbe-4ebd-8d79-3b8c52774d17 resourceVersion: '601102482' generation: 5 creationTimestamp: '2023-03-21T14:45:10Z' annotations: deprecated.daemonset.template.generation: '5' spec: selector: matchLabels: app: csi-plugin template: metadata: creationTimestamp: null labels: app: csi-plugin annotations: kubectl.kubernetes.io/restartedAt: '2024-06-19T22:22:37+08:00' spec: nodeSelector: kubernetes.io/os: linux restartPolicy: Always serviceAccountName: csi-admin hostPID: true schedulerName: default-scheduler hostNetwork: true affinity: nodeAffinity: requiredDuringSchedulingIgnoredDuringExecution: nodeSelectorTerms:

How to reproduce it (as minimally and precisely as possible):

Anything else we need to know?:

Environment:

huww98 commented 3 months ago

Why is your filesystem read-only? Is it intentional? What OS are you using?

lliiang commented 3 months ago

Why is your filesystem read-only? Is it intentional? What OS are you using?

my cluster is openshift 4.13

the node os is coreos

Comparing logs between normal pods and abnormal pods. 图片

huww98 commented 3 months ago

OK, maybe we should never write file into /usr, which is expected to be managed by OS package manager.

You can try set env DISABLE_CSIPLUGIN_CONNECTOR=true. Or upgrade CSI, we have limited the number of retries to 5.

Comparing logs between normal pods and abnormal pods.

I think these logs come from different CSI version.

lliiang commented 3 months ago

hello, does csi-plugin has debug log config? how to open debug log,i want to collect debug log to platform

huww98 commented 3 months ago

No. The default log level already outputs almost all the logs.

huww98 commented 3 months ago

OK, maybe we should never write file into /usr, which is expected to be managed by OS package manager.

We decided not to fix this one. Because we have planned to remove the connector all together in the future.

k8s-triage-robot commented 2 weeks ago

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

You can:

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale