sergelogvinov / proxmox-csi-plugin

Proxmox CSI Plugin
Apache License 2.0
271 stars 25 forks source link

Getting error in name resolution - but don't know from where #108

Closed markussiebert closed 10 months ago

markussiebert commented 10 months ago

Bug Report

I see a lot of the following error:

 GRPC error: rpc error: code = Internal desc = 500 hostname lookup 'default' failed - failed to get address info for: default: Temporary failure in name resolution                                                                                                          │
│ csi-provisioner E1102 13:16:21.644338       1 capacity.go:551] CSI GetCapacity for {segment:0xc000712d80 storageClassName:proxmox-data-xfs}: rpc error: code = Internal desc = 500 hostname lookup 'default' failed - failed to get address info for: default: Temporary failure in name resolution                                          │
│ csi-provisioner W1102 13:16:21.644356       1 capacity.go:552] Retrying capacity.workItem{segment:(*topology.Segment)(0xc000712d80), storageClassName:"proxmox-data-xfs"} after 1 failures                                                                                                                                                   │
│ csi-provisioner I1102 13:16:21.644398       1 capacity.go:574] Capacity Controller: refreshing {segment:0xc000712d80 storageClassName:proxmox-data-xfs}                                                                                                            

my config looks like:

    config:
      clusters:
        - url: https://pve.redacted.de:8006/api2/json
          insecure: true
          token_id: "kubernetes-csi@pve!csi"
          token_secret: redacted
          region: default
    storageClass:
      - name: proxmox-data-xfs
        storage: vm-storage
        ssd: true
        cache: none
        reclaimPolicy: Delete
        fstype: xfs

Description

I have no idea why 'default' will be looked up

Logs

see apove

Environment

erver Version: v1.28.1 (Talos)

sergelogvinov commented 10 months ago

Hello,

config:
      clusters:
        - url: https://pve.redacted.de:8006/api2/json
          insecure: true
          token_id: "kubernetes-csi@pve!csi"
          token_secret: redacted
          region: default

region is your proxmox node name. You can get node list by command pvecm nodes

sergelogvinov commented 10 months ago

Oh, sorry,

region is your proxmox cluster name... It can be every thing...

sergelogvinov commented 10 months ago

So, what's the pod has such message - GRPC error: rpc error: code = Internal

markussiebert commented 10 months ago

trying your latest suggestions ...

markussiebert commented 10 months ago

Maybe I should describe my setup ... this is proxmox: image This is my k8s node:

 Name:               talos                                                                                                                                                                                                                                                                                                                     │
│ Roles:              control-plane                                                                                                                                                                                                                                                                                                             │
│ Labels:             beta.kubernetes.io/arch=amd64                                                                                                                                                                                                                                                                                             │
│                     beta.kubernetes.io/os=linux                                                                                                                                                                                                                                                                                               │
│                     kubernetes.io/arch=amd64                                                                                                                                                                                                                                                                                                  │
│                     kubernetes.io/hostname=talos                                                                                                                                                                                                                                                                                              │
│                     kubernetes.io/os=linux                                                                                                                                                                                                                                                                                                    │
│                     node-role.kubernetes.io/control-plane=                                                                                                                                                                                                                                                                                    │
│                     topology.kubernetes.io/region=pve-cluster                                                                                                                                                                                                                                                                                 │
│                     topology.kubernetes.io/zone=pve                                                                                                                                                                                                                                                                                           │
│ Annotations:        csi.volume.kubernetes.io/nodeid: {"csi.proxmox.sinextra.dev":"talos"}                                                                                                                                                                                                                                                     │
│                     node.alpha.kubernetes.io/ttl: 0                                                                                                                                                                                                                                                                                           │
│                     talos.dev/owned-labels: ["node-role.kubernetes.io/control-plane","topology.kubernetes.io/region","topology.kubernetes.io/zone"]   

And this my config at the moment:

    config:
      clusters:
        - url: https://pve.redacted.de:8006/api2/json
          insecure: true
          token_id: "kubernetes-csi@pve!csi"
          region: pve-cluster
    storageClass:
      - name: proxmox-data-xfs
        storage: vm-storage
        ssd: true
        cache: none
        reclaimPolicy: Delete
        fstype: xfs

The following error:

csi-provisioner I1102 13:59:04.740477       1 connection.go:194] GRPC request: {"accessible_topology":{"segments":{"topology.kubernetes.io/region":"pve-cluster","topology.kubernetes.io/zone":"pve"}},"parameters":{"cache":"none","csi.storage.k8s.io/fstype":"xfs","ssd":"true","storage":"vm-storage"},"volume_capabilities":[{"AccessTyp │
│ csi-provisioner I1102 13:59:04.741716       1 connection.go:200] GRPC response: {}                                                                                                                                                                                                                                                            │
│ csi-provisioner I1102 13:59:04.741747       1 connection.go:201] GRPC error: rpc error: code = Internal desc = proxmox cluster pve-cluster not found                                                                                                                                                                                          │
│ csi-provisioner E1102 13:59:04.741794       1 capacity.go:551] CSI GetCapacity for {segment:0xc000285860 storageClassName:proxmox-data-xfs}: rpc error: code = Internal desc = proxmox cluster pve-cluster not found                                                                                                                          │
│ csi-proxmox-proxmox-csi-plugin-controller I1102 13:59:04.741345       1 controller.go:454] GetCapacity: region=pve-cluster, zone=pve, storageName=vm-storage                                                                                                                                                                                  │
│ csi-proxmox-proxmox-csi-plugin-controller E1102 13:59:04.741366       1 controller.go:458] failed to get proxmox cluster: proxmox cluster pve-cluster not found  

so the container complaining are csi-provisioner and plugin-controller

The pvc itself has the following error message: failed to provision volume with StorageClass "proxmox-data-xfs": rpc error: code = Internal desc = proxmox cluster pve-cluster not found

And btw. thank you a lot for your help...

wait a moment ... might be a failure in my dns setup

markussiebert commented 10 months ago

Thank you - after getting region and zone right (cluster / node) it works!