k8ssandra / k8ssandra-operator

The Kubernetes operator for K8ssandra
https://k8ssandra.io/
Apache License 2.0
157 stars 73 forks source link

Pods stuck in CrashLoopBackOff after disk migration to other VMs #1208

Open networkingana opened 6 months ago

networkingana commented 6 months ago

What did you do? I have deploymnet of k8ssandra over 5 kubernetes nodes, 3 of them were on CentOS 7 and 2 of them were on Ubuntu 20.04. The plan was to migrate all of the k8ssandra nodes from the CentOS 7 to new Kubernetes nodes on Ubuntu 20.04. The persistent volume of cassandra is created using the local-path-provisioner from rancher. Because my Kubernetes nodes are virtual machine, I moved the hdd device from one VM to another on the virtualization infrastructure, then rescheduled the pod of K8ssandra on that node, the same steps for all of the nodes.

I changed the node affinity in the PV to match the new node, and changed the PVC label to match the new node the same. The PVC were sucessfully bound but the Pods are in the CrashLoopBackOff state. I can see that URGENT MESSAGES on port 7000 are send on wrong IP address.

I tried restarting the k8ssandra cluster using the stopped: true and than stopped: false parameter in the k8ssandraclusters.k8ssandra.io CRD, but the issue is the same.

kubectl get events -n k8ssandra-operator

47m         Normal    Killing                  pod/k8ssandra-dc2-reaper-5cd7db5bfd-pvms9                       Stopping container reaper
46m         Warning   FailedToUpdateEndpoint   endpoints/k8ssandra-dc2-service                                 Failed to update endpoint k8ssandra-operator/k8ssandra-dc2-service: Operation cannot be fulfilled on endpoints "k8ssandra-dc2-service": the object has been modified; please apply your changes to the latest version and try again
46m         Warning   FailedToUpdateEndpoint   endpoints/k8ssandra-seed-service                                Failed to update endpoint k8ssandra-operator/k8ssandra-seed-service: Operation cannot be fulfilled on endpoints "k8ssandra-seed-service": the object has been modified; please apply your changes to the latest version and try again

Here is the repeating log

INFO  [GossipStage:1] 2024-02-14 07:26:58,103 Gossiper.java:1930 - Adding /192.168.135.10:7000 as there was no previous epState; new state is EndpointState: HeartBeatState = HeartBeat: generation = 0, version = -1, AppStateMap = {}
INFO  [GossipStage:1] 2024-02-14 07:26:58,105 Gossiper.java:1930 - Adding /192.168.217.96:7000 as there was no previous epState; new state is EndpointState: HeartBeatState = HeartBeat: generation = 0, version = -1, AppStateMap = {}
INFO  [GossipStage:1] 2024-02-14 07:26:58,106 Gossiper.java:1930 - Adding /192.168.104.14:7000 as there was no previous epState; new state is EndpointState: HeartBeatState = HeartBeat: generation = 0, version = -1, AppStateMap = {}
INFO  [GossipStage:1] 2024-02-14 07:26:58,106 Gossiper.java:1930 - Adding /192.168.150.122:7000 as there was no previous epState; new state is EndpointState: HeartBeatState = HeartBeat: generation = 0, version = -1, AppStateMap = {}
INFO  [main] 2024-02-14 07:26:58,378 InboundConnectionInitiator.java:127 - Listening on address: (/192.168.166.162:7000), nic: eth0, encryption: unencrypted
WARN  [main] 2024-02-14 07:26:58,603 K8SeedProvider4x.java:47 - Seed provider couldn't lookup host k8ssandra-dc2-additional-seed-service
WARN  [main] 2024-02-14 07:26:58,615 K8SeedProvider4x.java:47 - Seed provider couldn't lookup host k8ssandra-dc2-additional-seed-service
INFO  [Messaging-EventLoop-3-1] 2024-02-14 07:27:05,803 NoSpamLogger.java:92 - /192.168.166.162:7000->/192.168.150.122:7000-URGENT_MESSAGES-[no-channel] failed to connect
io.netty.channel.ConnectTimeoutException: connection timed out: /192.168.150.122:7000
        at io.netty.channel.epoll.AbstractEpollChannel$AbstractEpollUnsafe$2.run(AbstractEpollChannel.java:576)
        at io.netty.util.concurrent.PromiseTask.runTask(PromiseTask.java:98)
        at io.netty.util.concurrent.ScheduledFutureTask.run(ScheduledFutureTask.java:170)
        at io.netty.util.concurrent.AbstractEventExecutor.safeExecute(AbstractEventExecutor.java:164)
        at io.netty.util.concurrent.SingleThreadEventExecutor.runAllTasks(SingleThreadEventExecutor.java:472)
        at io.netty.channel.epoll.EpollEventLoop.run(EpollEventLoop.java:384)
        at io.netty.util.concurrent.SingleThreadEventExecutor$4.run(SingleThreadEventExecutor.java:989)
        at io.netty.util.internal.ThreadExecutorMap$2.run(ThreadExecutorMap.java:74)
        at io.netty.util.concurrent.FastThreadLocalRunnable.run(FastThreadLocalRunnable.java:30)
        at java.base/java.lang.Thread.run(Unknown Source)
WARN  [main] 2024-02-14 07:27:29,770 K8SeedProvider4x.java:47 - Seed provider couldn't lookup host k8ssandra-dc2-additional-seed-service
INFO  [main] 2024-02-14 07:27:29,770 StorageService.java:651 - Unable to gossip with any peers but continuing anyway since node is in its own seed list
INFO  [main] 2024-02-14 07:27:29,782 StorageService.java:958 - Starting up server gossip
INFO  [main] 2024-02-14 07:27:29,786 ColumnFamilyStore.java:878 - Enqueuing flush of local: 0.459KiB (0%) on-heap, 0.000KiB (0%) off-heap
INFO  [PerDiskMemtableFlushWriter_0:1] 2024-02-14 07:27:29,792 Memtable.java:469 - Writing Memtable-local@1147930153(0.048KiB serialized bytes, 1 ops, 0%/0% of on/off-heap limit), flushed range = (min(-9223372036854775808), max(9223372036854775807)]
INFO  [PerDiskMemtableFlushWriter_0:1] 2024-02-14 07:27:29,793 Memtable.java:498 - Completed flushing /var/lib/cassandra/data/system/local-7ad54392bcdd35a684174e047860b377/nb-126-big-Data.db (0.028KiB) for commitlog position CommitLogPosition(segmentId=1707895595884, position=1351370)
INFO  [MemtableFlushWriter:1] 2024-02-14 07:27:29,806 LogTransaction.java:240 - Unfinished transaction log, deleting /var/lib/cassandra/data/system/local-7ad54392bcdd35a684174e047860b377/nb_txn_flush_82b203c0-cb0a-11ee-aec4-d9c36dc552ae.log
WARN  [main] 2024-02-14 07:27:29,817 K8SeedProvider4x.java:47 - Seed provider couldn't lookup host k8ssandra-dc2-additional-seed-service
INFO  [main] 2024-02-14 07:27:29,821 TokenMetadata.java:505 - Updating topology for /192.168.166.162:7000
INFO  [main] 2024-02-14 07:27:29,821 TokenMetadata.java:505 - Updating topology for /192.168.166.162:7000
WARN  [main] 2024-02-14 07:27:29,864 K8SeedProvider4x.java:47 - Seed provider couldn't lookup host k8ssandra-dc2-additional-seed-service

Here are the pods with wide output, here we can see that the IP address from the above log is not found on the actual pods

[root@master-node ~]# kubectl get pod -o wide -n k8ssandra-operator
NAME                                                READY   STATUS    RESTARTS        AGE     IP                NODE     NOMINATED NODE   READINESS GATES
k8ssandra-dc2-rack-1-sts-0                          3/3     Running   1 (39s ago)     3m3s    192.168.166.162   node1    <none>           <none>
k8ssandra-dc2-rack-2-sts-0                          3/3     Running   1 (39s ago)     3m3s    192.168.104.4     node2    <none>           <none>
k8ssandra-dc2-rack-3-sts-0                          2/3     Running   0               3m3s    192.168.217.113   node-4   <none>           <none>
k8ssandra-dc2-rack-4-sts-0                          2/3     Running   1 (39s ago)     3m3s    192.168.135.44    node3    <none>           <none>
k8ssandra-dc2-rack-5-sts-0                          2/3     Running   0               3m3s    192.168.150.115   node-5   <none>           <none>
k8ssandra-operator-5f7b4dfd94-f4n5h                 1/1     Running   0               6d17h   192.168.104.59    node2    <none>           <none>
k8ssandra-operator-cass-operator-6f7cb8ff67-b5gt5   1/1     Running   2 (6d12h ago)   6d17h   192.168.104.58    node2    <none>           <none>

Environment Kubernetes version: v1.22.3 OS: Ubuntu 20.04.2 LTS Docker version: docker://20.10.24

# Please edit the object below. Lines beginning with a '#' will be ignored,
# and an empty file will abort the edit. If an error occurs while saving this file will be
# reopened with the relevant failures.
#
apiVersion: k8ssandra.io/v1alpha1
kind: K8ssandraCluster
metadata:
  annotations:
    k8ssandra.io/initial-system-replication: '{"dc1":3,"dc2":3}'
    kubectl.kubernetes.io/last-applied-configuration: |
      {"apiVersion":"k8ssandra.io/v1alpha1","kind":"K8ssandraCluster","metadata":{"annotations":{},"name":"k8ssandra","namespace":"k8ssandra-operator"},"spec":{"cassandra":{"additionalSeeds":["192.168.139.81","192.168.150.87"],"datacenters":[{"config":{"jvmOptions":{"heapNewGenSize":"256M","heapSize":"512M"}},"metadata":{"name":"dc2"},"racks":[{"name":"rack-1"},{"name":"rack-2"},{"name":"rack-3"},{"name":"rack-4"},{"name":"rack-5"}],"size":5,"storageConfig":{"cassandraDataVolumeClaimSpec":{"accessModes":["ReadWriteOnce"],"resources":{"requests":{"storage":"98Gi"}},"storageClassName":"local-path-cassandra"}}}],"mgmtAPIHeap":"64Mi","serverVersion":"4.0.1","superuserSecretRef":{"name":"superuser-secret"}},"externalDatacenters":["dc1"],"medusa":{"cassandraUserSecretRef":{"name":"medusa-secret"},"storageProperties":{"bucketName":"k8ssandra-medusa","host":"minio.minio.svc.cluster.local","maxBackupCount":10,"port":9000,"prefix":" ","secure":false,"storageProvider":"s3_compatible","storageSecretRef":{"name":"medusa-bucket-key"}}},"reaper":{"autoScheduling":{"enabled":true},"cassandraUserSecretRef":{"name":"reaper-secret"},"jmxUserSecretRef":{"name":"reaper-jmx-secret"},"uiUserSecretRef":{"name":"reaper-ui-secret"}},"stargate":{"size":1}}}
  creationTimestamp: "2023-08-17T22:08:18Z"
  finalizers:
  - k8ssandracluster.k8ssandra.io/finalizer
  generation: 13
  name: k8ssandra
  namespace: k8ssandra-operator
  resourceVersion: "791254837"
  uid: f020eb0e-2ee1-4321-9b16-291800a1ecb2
spec:
  auth: true
  cassandra:
    datacenters:
    - config:
        jvmOptions:
          heapNewGenSize: 256M
          heapSize: 512M
      jmxInitContainerImage:
        name: busybox
        registry: docker.io
        tag: 1.34.1
      metadata:
        name: dc2
      racks:
      - name: rack-1
        nodeAffinityLabels:
          cassandra-node: "yes"
      - name: rack-2
        nodeAffinityLabels:
          cassandra-node: "yes"
      - name: rack-3
        nodeAffinityLabels:
          cassandra-node: "yes"
      - name: rack-4
        nodeAffinityLabels:
          cassandra-node: "yes"
      - name: rack-5
        nodeAffinityLabels:
          cassandra-node: "yes"
      size: 5
      stopped: false
      storageConfig:
        cassandraDataVolumeClaimSpec:
          accessModes:
          - ReadWriteOnce
          resources:
            requests:
              storage: 98Gi
          storageClassName: local-path-cassandra
    jmxInitContainerImage:
      name: busybox
      registry: docker.io
      tag: 1.34.1
    mgmtAPIHeap: 64Mi
    serverVersion: 4.0.1
    superuserSecretRef:
      name: k8ssandra-superuser
  medusa:
    cassandraUserSecretRef:
      name: k8ssandra-medusa
    storageProperties:
      bucketName: k8ssandra-medusa
      concurrentTransfers: 1
      host: 172.17.5.210
      maxBackupAge: 0
      maxBackupCount: 10
      multiPartUploadThreshold: 104857600
      port: 32496
      prefix: prod
      secure: false
      storageProvider: s3_compatible
      storageSecretRef:
        name: medusa-bucket-key
      transferMaxBandwidth: 50MB/s
  reaper:
    ServiceAccountName: default
    autoScheduling:
      enabled: true
      initialDelayPeriod: PT15S
      percentUnrepairedThreshold: 10
      periodBetweenPolls: PT10M
      repairType: AUTO
      scheduleSpreadPeriod: PT6H
      timeBeforeFirstSchedule: PT5M
    cassandraUserSecretRef:
      name: k8ssandra-reaper
    containerImage:
      name: cassandra-reaper
      registry: docker.io
      repository: thelastpickle
      tag: 3.2.0
    deploymentMode: PER_DC
    heapSize: 2Gi
    initContainerImage:
      name: cassandra-reaper
      registry: docker.io
      repository: thelastpickle
      tag: 3.2.0
    jmxUserSecretRef:
      name: k8ssandra-reaper-jmx
    keyspace: reaper_db
    uiUserSecretRef:
      name: reaper-ui-secret
  stargate:
    allowStargateOnDataNodes: false
    containerImage:
      registry: docker.io
      repository: stargateio
      tag: v1.0.45
    heapSize: 256Mi
    serviceAccount: default
    size: 1
status:
  conditions:
  - lastTransitionTime: "2023-08-18T07:06:30Z"
    status: "True"
    type: CassandraInitialized
  datacenters:
    dc2:
      cassandra:
        cassandraOperatorProgress: Ready
        conditions:
        - lastTransitionTime: "2024-02-05T12:42:19Z"
          message: ""
          reason: ""
          status: "True"
          type: Healthy
        - lastTransitionTime: "2024-02-14T07:26:01Z"
          message: ""
          reason: ""
          status: "False"
          type: Stopped
        - lastTransitionTime: "2023-08-18T07:06:28Z"
          message: ""
          reason: ""
          status: "False"
          type: ReplacingNodes
        - lastTransitionTime: "2024-02-14T07:31:56Z"
          message: ""
          reason: ""
          status: "False"
          type: Updating
        - lastTransitionTime: "2023-08-18T07:06:28Z"
          message: ""
          reason: ""
          status: "False"
          type: RollingRestart
        - lastTransitionTime: "2024-02-14T07:31:56Z"
          message: ""
          reason: ""
          status: "False"
          type: Resuming
        - lastTransitionTime: "2023-08-18T07:06:28Z"
          message: ""
          reason: ""
          status: "False"
          type: ScalingDown
        - lastTransitionTime: "2023-08-18T07:06:28Z"
          message: ""
          reason: ""
          status: "True"
          type: Valid
        - lastTransitionTime: "2023-08-18T07:06:28Z"
          message: ""
          reason: ""
          status: "True"
          type: Initialized
        - lastTransitionTime: "2024-02-14T07:32:16Z"
          message: ""
          reason: ""
          status: "True"
          type: Ready
        - lastTransitionTime: "2024-02-05T14:01:09Z"
          message: ""
          reason: ""
          status: "False"
          type: ScalingUp
        lastServerNodeStarted: "2024-02-14T07:30:02Z"
        nodeStatuses:
          k8ssandra-dc2-rack-1-sts-0:
            hostID: 42ce04c8-116e-44be-8d65-f6698b388d42
          k8ssandra-dc2-rack-2-sts-0:
            hostID: 6cd35862-def7-4f14-89a4-1b58b172f7eb
          k8ssandra-dc2-rack-3-sts-0:
            hostID: 21a58953-58c8-4a52-b2db-bb1eacf306f1
          k8ssandra-dc2-rack-4-sts-0:
            hostID: 5b4fac26-a4c9-4cd6-9c94-165046b70414
          k8ssandra-dc2-rack-5-sts-0:
            hostID: f9864e62-a094-4386-983f-6f3a47e8c48a
        observedGeneration: 17
        quietPeriod: "2024-02-14T07:32:23Z"
        superUserUpserted: "2024-02-14T07:32:17Z"
        usersUpserted: "2024-02-14T07:32:17Z"
      stargate:
        availableReplicas: 0
        conditions:
        - lastTransitionTime: "2024-02-14T07:32:20Z"
          status: "False"
          type: Ready
        deploymentRefs:
        - k8ssandra-dc2-rack-1-stargate-deployment
        progress: Deploying
        readyReplicas: 0
        readyReplicasRatio: 0/1
        replicas: 1
        updatedReplicas: 1

Please let me know if I can provide additional information. Thank you

┆Issue is synchronized with this Jira Story by Unito

adejanovski commented 6 months ago

It looks like the seed service isn't pointing to any of the pre-existing nodes in the cluster, creating some sort of split brain situation. As the nodes are unable to connect with each other, they're stuck trying to communicate with the old IPs that are stores in the system tables.

These events are the problematic ones:

46m         Warning   FailedToUpdateEndpoint   endpoints/k8ssandra-dc2-service                                 Failed to update endpoint k8ssandra-operator/k8ssandra-dc2-service: Operation cannot be fulfilled on endpoints "k8ssandra-dc2-service": the object has been modified; please apply your changes to the latest version and try again
46m         Warning   FailedToUpdateEndpoint   endpoints/k8ssandra-seed-service

I'm not sure what generated that situation. cass-operator is responsible for placing the seed-node labels that the seed service uses to build the seed list. Did you try to move all 3 nodes at the same time, or one by one? Are there errors in the cass-operator container logs that could help us understand what's happening here? Could you also list the pods with their labels?

networkingana commented 6 months ago

I was moving the nodes one by one, that is what is strange to me as well.

1.7079020979412708e+09  INFO    reconcile_racks::startOneNodePerRack    {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "8b9a9a45-aabb-436e-bc4a-713899ff8cb6", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079020979412932e+09  INFO    calling Management API cluster health - GET /api/v0/probes/cluster      {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "8b9a9a45-aabb-436e-bc4a-713899ff8cb6", "pod": "k8ssandra-dc2-rack-1-sts-0"}
1.7079020979413517e+09  INFO    client::callNodeMgmtEndpoint    {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "8b9a9a45-aabb-436e-bc4a-713899ff8cb6"}
1.7079020979510846e+09  INFO    calling Management API cluster health - GET /api/v0/probes/cluster      {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "8b9a9a45-aabb-436e-bc4a-713899ff8cb6", "pod": "k8ssandra-dc2-rack-3-sts-0"}
1.7079020979512746e+09  INFO    client::callNodeMgmtEndpoint    {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "8b9a9a45-aabb-436e-bc4a-713899ff8cb6"}
1.7079020979582424e+09  INFO    calling Management API cluster health - GET /api/v0/probes/cluster      {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "8b9a9a45-aabb-436e-bc4a-713899ff8cb6", "pod": "k8ssandra-dc2-rack-5-sts-0"}
1.7079020979583006e+09  INFO    client::callNodeMgmtEndpoint    {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "8b9a9a45-aabb-436e-bc4a-713899ff8cb6"}
1.7079020979648848e+09  INFO    calling Management API cluster health - GET /api/v0/probes/cluster      {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "8b9a9a45-aabb-436e-bc4a-713899ff8cb6", "pod": "k8ssandra-dc2-rack-2-sts-0"}
1.70790209796494e+09    INFO    client::callNodeMgmtEndpoint    {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "8b9a9a45-aabb-436e-bc4a-713899ff8cb6"}
1.7079020979712594e+09  INFO    calling Management API cluster health - GET /api/v0/probes/cluster      {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "8b9a9a45-aabb-436e-bc4a-713899ff8cb6", "pod": "k8ssandra-dc2-rack-4-sts-0"}
1.7079020979714522e+09  INFO    client::callNodeMgmtEndpoint    {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "8b9a9a45-aabb-436e-bc4a-713899ff8cb6"}
1.707902097979851e+09   INFO    reconcile_racks::startAllNodes  {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "8b9a9a45-aabb-436e-bc4a-713899ff8cb6", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079020979800844e+09  INFO    reconcile_racks::DecommissionNodes      {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "8b9a9a45-aabb-436e-bc4a-713899ff8cb6", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079020979804478e+09  INFO    starting CheckRackPodTemplate() {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "8b9a9a45-aabb-436e-bc4a-713899ff8cb6", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079020979915037e+09  INFO    waiting for upgrade to finish on statefulset    {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "8b9a9a45-aabb-436e-bc4a-713899ff8cb6", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra", "statefulset": "k8ssandra-dc2-rack-1-sts", "replicas": 1, "readyReplicas": 0, "currentReplicas": 1, "updatedReplicas": 1}
1.7079020979917388e+09  INFO    controllers.CassandraDatacenter Reconcile loop completed        {"cassandradatacenter": "k8ssandra-operator/dc2", "requestNamespace": "k8ssandra-operator", "requestName": "dc2", "loopID": "d06b9d75-fa5c-4762-a2ab-45937c69142a", "duration": 0.129688863}
1.7079021079927156e+09  INFO    controllers.CassandraDatacenter ======== handler::Reconcile has been called     {"cassandradatacenter": "k8ssandra-operator/dc2", "requestNamespace": "k8ssandra-operator", "requestName": "dc2", "loopID": "7bb37329-541c-4bb5-a71c-aa43e8452723"}
1.7079021079927874e+09  INFO    handler::CreateReconciliationContext    {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator"}
1.7079021079930046e+09  INFO    handler::calculateReconciliationActions {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079021079930294e+09  INFO    reconcile_services::ReconcileHeadlessServices   {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079021079935105e+09  INFO    reconcile_endpoints::CheckAdditionalSeedEndpoints       {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079021079935327e+09  INFO    reconcile_racks::calculateRackInformation       {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.707902107993544e+09   INFO    reconciliationContext::reconcileAllRacks        {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079021079935536e+09  INFO    reconcile_racks::listPods       {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079021079939945e+09  INFO    requesting Cassandra metadata endpoints from Node Management API        {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "pod": "k8ssandra-dc2-rack-2-sts-0"}
1.7079021079940143e+09  INFO    client::callNodeMgmtEndpoint    {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451"}
1.7079021079996274e+09  INFO    reconcile_racks::CheckConfigSecret      {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.707902107999678e+09   INFO    reconcile_racks::CheckRackCreation      {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.707902107999685e+09   INFO    reconcile_racks::getStatefulSetForRack  {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079021079998097e+09  INFO    reconcile_racks::getStatefulSetForRack  {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079021079999137e+09  INFO    reconcile_racks::getStatefulSetForRack  {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079021080000153e+09  INFO    reconcile_racks::getStatefulSetForRack  {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.707902108000122e+09   INFO    reconcile_racks::getStatefulSetForRack  {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079021080002506e+09  INFO    reconcile_racks::CheckRackLabels        {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079021080005913e+09  INFO    reconcile_racks::CheckDecommissioningNodes      {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079021080006142e+09  INFO    reconcile_racks::CheckSuperuserSecretCreation   {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079021080006468e+09  INFO    reconcile_racks::CheckInternodeCredentialCreation       {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079021080007215e+09  INFO    starting CheckRackForceUpgrade()        {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079021080007484e+09  INFO    reconcile_racks::CheckRackScale {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079021080007598e+09  INFO    reconcile_racks::CheckPodsReady {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079021080007653e+09  INFO    reconcile_racks::findStartedNotReadyNodes       {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079021080007854e+09  INFO    reconcile_racks::deleteStuckNodes       {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.707902108000797e+09   INFO    reconcile_racks::CheckSeedLabels        {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.707902108001134e+09   INFO    reconcile_racks::refreshSeeds   {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079021080011535e+09  INFO    calling Management API reload seeds - POST /api/v0/ops/seeds/reload     {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "pod": "k8ssandra-dc2-rack-2-sts-0"}
1.7079021080011656e+09  INFO    client::callNodeMgmtEndpoint    {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451"}
1.7079021080132968e+09  INFO    calling Management API reload seeds - POST /api/v0/ops/seeds/reload     {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "pod": "k8ssandra-dc2-rack-4-sts-0"}
1.7079021080133405e+09  INFO    client::callNodeMgmtEndpoint    {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451"}
1.7079021080249355e+09  INFO    calling Management API reload seeds - POST /api/v0/ops/seeds/reload     {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "pod": "k8ssandra-dc2-rack-5-sts-0"}
1.7079021080249877e+09  INFO    client::callNodeMgmtEndpoint    {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451"}
1.7079021080340521e+09  INFO    calling Management API reload seeds - POST /api/v0/ops/seeds/reload     {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "pod": "k8ssandra-dc2-rack-1-sts-0"}
1.7079021080340915e+09  INFO    client::callNodeMgmtEndpoint    {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451"}
1.707902108049493e+09   INFO    calling Management API reload seeds - POST /api/v0/ops/seeds/reload     {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "pod": "k8ssandra-dc2-rack-3-sts-0"}
1.7079021080495672e+09  INFO    client::callNodeMgmtEndpoint    {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451"}
1.7079021080617049e+09  INFO    reconcile_racks::findStartingNodes      {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079021080617561e+09  INFO    reconcile_racks::startOneNodePerRack    {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079021080617948e+09  INFO    calling Management API cluster health - GET /api/v0/probes/cluster      {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "pod": "k8ssandra-dc2-rack-2-sts-0"}
1.7079021080618205e+09  INFO    client::callNodeMgmtEndpoint    {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451"}
1.7079021080693684e+09  INFO    calling Management API cluster health - GET /api/v0/probes/cluster      {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "pod": "k8ssandra-dc2-rack-4-sts-0"}
1.7079021080694141e+09  INFO    client::callNodeMgmtEndpoint    {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451"}
1.7079021080775194e+09  INFO    calling Management API cluster health - GET /api/v0/probes/cluster      {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "pod": "k8ssandra-dc2-rack-5-sts-0"}
1.7079021080776033e+09  INFO    client::callNodeMgmtEndpoint    {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451"}
1.707902108083933e+09   INFO    calling Management API cluster health - GET /api/v0/probes/cluster      {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "pod": "k8ssandra-dc2-rack-1-sts-0"}
1.7079021080840058e+09  INFO    client::callNodeMgmtEndpoint    {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451"}
1.707902108093495e+09   INFO    calling Management API cluster health - GET /api/v0/probes/cluster      {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "pod": "k8ssandra-dc2-rack-3-sts-0"}
1.7079021080935755e+09  INFO    client::callNodeMgmtEndpoint    {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451"}
1.7079021081019838e+09  INFO    reconcile_racks::startAllNodes  {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079021081020377e+09  INFO    reconcile_racks::DecommissionNodes      {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079021081022608e+09  INFO    starting CheckRackPodTemplate() {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra"}
1.7079021081035924e+09  INFO    waiting for upgrade to finish on statefulset    {"controller": "cassandradatacenter_controller", "controllerGroup": "cassandra.datastax.com", "controllerKind": "CassandraDatacenter", "cassandraDatacenter": {"name":"dc2","namespace":"k8ssandra-operator"}, "namespace": "k8ssandra-operator", "name": "dc2", "reconcileID": "865777b7-a29d-4c22-9a5c-7931fd27c451", "namespace": "k8ssandra-operator", "datacenterName": "dc2", "clusterName": "k8ssandra", "statefulset": "k8ssandra-dc2-rack-1-sts", "replicas": 1, "readyReplicas": 0, "currentReplicas": 1, "updatedReplicas": 1}
1.7079021081036375e+09  INFO    controllers.CassandraDatacenter Reconcile loop completed        {"cassandradatacenter": "k8ssandra-operator/dc2", "requestNamespace": "k8ssandra-operator", "requestName": "dc2", "loopID": "7bb37329-541c-4bb5-a71c-aa43e8452723", "duration": 0.110950298}

I can't see any error in the logs, i tried to grep but just info is present.

This are the pods with its labels

[root@master-node ~]# kubectl get pods --show-labels -n k8ssandra-operator
NAME                                                        READY   STATUS             RESTARTS        AGE     LABELS
k8ssandra-dc2-rack-1-stargate-deployment-858d87f56f-djsxc   1/1     Running            0               104m    app.kubernetes.io/component=stargate,app.kubernetes.io/created-by=stargate-controller,app.kubernetes.io/name=k8ssandra-operator,app.kubernetes.io/part-of=k8ssandra,k8ssandra.io/cluster-name=k8ssandra,k8ssandra.io/cluster-namespace=k8ssandra-operator,k8ssandra.io/stargate-deployment=k8ssandra-dc2-rack-1-stargate-deployment,k8ssandra.io/stargate=k8ssandra-dc2-stargate,pod-template-hash=858d87f56f
k8ssandra-dc2-rack-1-sts-0                                  2/3     CrashLoopBackOff   18 (102s ago)   110m    app.kubernetes.io/created-by=cass-operator,app.kubernetes.io/instance=cassandra-k8ssandra,app.kubernetes.io/managed-by=cass-operator,app.kubernetes.io/name=cassandra,app.kubernetes.io/version=4.0.1,cassandra.datastax.com/cluster=k8ssandra,cassandra.datastax.com/datacenter=dc2,cassandra.datastax.com/node-state=Started,cassandra.datastax.com/rack=rack-1,cassandra.datastax.com/seed-node=true,controller-revision-hash=k8ssandra-dc2-rack-1-sts-6485897b4c,statefulset.kubernetes.io/pod-name=k8ssandra-dc2-rack-1-sts-0
k8ssandra-dc2-rack-2-sts-0                                  2/3     CrashLoopBackOff   18 (64s ago)    110m    app.kubernetes.io/created-by=cass-operator,app.kubernetes.io/instance=cassandra-k8ssandra,app.kubernetes.io/managed-by=cass-operator,app.kubernetes.io/name=cassandra,app.kubernetes.io/version=4.0.1,cassandra.datastax.com/cluster=k8ssandra,cassandra.datastax.com/datacenter=dc2,cassandra.datastax.com/node-state=Started,cassandra.datastax.com/rack=rack-2,cassandra.datastax.com/seed-node=true,controller-revision-hash=k8ssandra-dc2-rack-2-sts-8dc8895d9,statefulset.kubernetes.io/pod-name=k8ssandra-dc2-rack-2-sts-0
k8ssandra-dc2-rack-3-sts-0                                  3/3     Running            0               110m    app.kubernetes.io/created-by=cass-operator,app.kubernetes.io/instance=cassandra-k8ssandra,app.kubernetes.io/managed-by=cass-operator,app.kubernetes.io/name=cassandra,app.kubernetes.io/version=4.0.1,cassandra.datastax.com/cluster=k8ssandra,cassandra.datastax.com/datacenter=dc2,cassandra.datastax.com/node-state=Started,cassandra.datastax.com/rack=rack-3,cassandra.datastax.com/seed-node=true,controller-revision-hash=k8ssandra-dc2-rack-3-sts-65dd59cc55,statefulset.kubernetes.io/pod-name=k8ssandra-dc2-rack-3-sts-0
k8ssandra-dc2-rack-4-sts-0                                  2/3     CrashLoopBackOff   18 (105s ago)   110m    app.kubernetes.io/created-by=cass-operator,app.kubernetes.io/instance=cassandra-k8ssandra,app.kubernetes.io/managed-by=cass-operator,app.kubernetes.io/name=cassandra,app.kubernetes.io/version=4.0.1,cassandra.datastax.com/cluster=k8ssandra,cassandra.datastax.com/datacenter=dc2,cassandra.datastax.com/node-state=Started,cassandra.datastax.com/rack=rack-4,cassandra.datastax.com/seed-node=true,controller-revision-hash=k8ssandra-dc2-rack-4-sts-78879cd87c,statefulset.kubernetes.io/pod-name=k8ssandra-dc2-rack-4-sts-0
k8ssandra-dc2-rack-5-sts-0                                  3/3     Running            0               110m    app.kubernetes.io/created-by=cass-operator,app.kubernetes.io/instance=cassandra-k8ssandra,app.kubernetes.io/managed-by=cass-operator,app.kubernetes.io/name=cassandra,app.kubernetes.io/version=4.0.1,cassandra.datastax.com/cluster=k8ssandra,cassandra.datastax.com/datacenter=dc2,cassandra.datastax.com/node-state=Started,cassandra.datastax.com/rack=rack-5,cassandra.datastax.com/seed-node=true,controller-revision-hash=k8ssandra-dc2-rack-5-sts-d97c86bcf,statefulset.kubernetes.io/pod-name=k8ssandra-dc2-rack-5-sts-0
k8ssandra-dc2-reaper-cd7787c7d-lstzf                        1/1     Running            0               103m    app.kubernetes.io/component=reaper,app.kubernetes.io/created-by=reaper-controller,app.kubernetes.io/managed-by=k8ssandra-operator,app.kubernetes.io/name=k8ssandra-operator,app.kubernetes.io/part-of=k8ssandra,k8ssandra.io/cluster-name=k8ssandra,k8ssandra.io/cluster-namespace=k8ssandra-operator,k8ssandra.io/reaper=k8ssandra-dc2-reaper,pod-template-hash=cd7787c7d
k8ssandra-operator-5f7b4dfd94-f4n5h                         1/1     Running            0               6d19h   app.kubernetes.io/instance=k8ssandra-operator,app.kubernetes.io/managed-by=Helm,app.kubernetes.io/name=k8ssandra-operator,app.kubernetes.io/part-of=k8ssandra-k8ssandra-operator-k8ssandra-operator,control-plane=k8ssandra-operator,helm.sh/chart=k8ssandra-operator-0.38.2,pod-template-hash=5f7b4dfd94
k8ssandra-operator-cass-operator-6f7cb8ff67-b5gt5           1/1     Running            2 (6d14h ago)   6d19h   app.kubernetes.io/instance=k8ssandra-operator,app.kubernetes.io/managed-by=Helm,app.kubernetes.io/name=cass-operator,app.kubernetes.io/part-of=k8ssandra-k8ssandra-operator-k8ssandra-operator,control-plane=k8ssandra-operator-controller-manager,helm.sh/chart=cass-operator-0.37.2,pod-template-hash=6f7cb8ff67
networkingana commented 6 months ago

is there any update for this?

burmanm commented 4 months ago

Sorry for the delayed answer, but from looking at the logs, it seems it's actually a Kubernetes issue that's causing the problem. The events you posted Warning FailedToUpdateEndpoint are coming from Kubernetes' endpoint_controller:

https://github.com/kubernetes/kubernetes/blob/master/pkg/controller/endpoint/endpoints_controller.go#L548

Thus, it does not seem k8ssandra-operator can do much about it. That seed-service's IP list is updated by Kubernetes (we only provide label selector for it), but I'm not sure why it would fail to update those lists. k8ssandra-operator updates and maintains the endpoints for additional-seeds, but that does not seem to be the one that is failing.