deckhouse / deckhouse

Kubernetes platform from Flant
https://deckhouse.io
Other
1.07k stars 111 forks source link

[kube-system] Endpoints are out of sync with endpointslices #5435

Open konstantin-axenov opened 1 year ago

konstantin-axenov commented 1 year ago

Preflight Checklist

Version

v1.49

Expected Behavior

Endpoints are out of sync with endpointslices. Kube-controller-manager restart fixes the problem.

Actual Behavior

addressType: IPv4
apiVersion: discovery.k8s.io/v1
endpoints:
- addresses:
  - 10.244.31.6
  conditions:
    ready: true
    serving: true
    terminating: false
  nodeName: my-application-nodes-d421333b-5647f-gsvnf
  targetRef:
    kind: Pod
    name: application-78dbcfbff5-fdcjf
    namespace: application-stage
    resourceVersion: "2156881716"
    uid: c5865462-17a5-435f-9faf-9df6def60a35
kind: EndpointSlice
metadata:
  creationTimestamp: "2021-03-25T10:39:19Z"
  generateName: application-
  generation: 390
  labels:
    app: application
    app.kubernetes.io/managed-by: Helm
    chart: bricks-3.0.1
    endpointslice.kubernetes.io/managed-by: endpointslice-controller.k8s.io
    heritage: Helm
    kubernetes.io/service-name: application
    release: stock-availability.stock-availability.application.develo
  name: application-sfnrl
  namespace: application-stage
  ownerReferences:
  - apiVersion: v1
    blockOwnerDeletion: true
    controller: true
    kind: Service
    name: application
    uid: 911ad23c-3463-4125-9497-bd6c890584b2
  resourceVersion: "2156881719"
  uid: 2a5f0974-14e5-4e5d-9302-a24fae3796ce
ports:
- name: application
  port: 10000
  protocol: TCP
- name: http-metrics
  port: 10000
  protocol: TCP
---
apiVersion: v1
kind: Endpoints
metadata:
  annotations:
    endpoints.kubernetes.io/last-change-trigger-time: "2023-08-02T11:05:52Z"
  creationTimestamp: "2021-03-25T10:39:19Z"
  labels:
    app: application
    app.kubernetes.io/managed-by: Helm
    chart: bricks-3.0.1
    heritage: Helm
    release: stock-availability.stock-availability.application.develo
  name: application
  namespace: application-stage
  resourceVersion: "2156881717"
  uid: 2ad22ac8-c742-4902-a115-71a299667ae2
subsets:
- notReadyAddresses:
  - ip: 10.244.31.6
    nodeName: my-application-nodes-d421333b-5647f-gsvnf
    targetRef:
      kind: Pod
      name: application-78dbcfbff5-fdcjf
      namespace: application-stage
      resourceVersion: "2156881713"
      uid: c5865462-17a5-435f-9faf-9df6def60a35
  ports:
  - name: http-metrics
    port: 10000
    protocol: TCP
  - name: application
    port: 10000
    protocol: TCP
---
apiVersion: v1
kind: Pod
metadata:
  annotations:
    sha: dab30b6f0d5d23e290345024327115d4a0a1150d
  creationTimestamp: "2023-07-03T16:15:51Z"
  generateName: application-78dbcfbff5-
  labels:
    app: application
    pod-template-hash: 78dbcfbff5
    release: stock-availability.stock-availability.application.develo
  name: application-78dbcfbff5-fdcjf
  namespace: application-stage
  ownerReferences:
  - apiVersion: apps/v1
    blockOwnerDeletion: true
    controller: true
    kind: ReplicaSet
    name: application-78dbcfbff5
    uid: 713a501f-3dbe-4a17-ab94-4a8c216ba510
  resourceVersion: "2156881716"
  uid: c5865462-17a5-435f-9faf-9df6def60a35
spec:
  containers:
  - image: docker-stock-availability.art.my.xxx/stock-availability/application:develop
    imagePullPolicy: Always
    lifecycle:
      preStop:
        exec:
          command:
          - sh
          - -c
          - sleep 30
    livenessProbe:
      failureThreshold: 3
      httpGet:
        path: /actuator/health
        port: 10000
        scheme: HTTP
      initialDelaySeconds: 60
      periodSeconds: 5
      successThreshold: 1
      timeoutSeconds: 1
    name: application
    ports:
    - containerPort: 10000
      name: http-metrics
      protocol: TCP
    - containerPort: 10001
      protocol: TCP
    - containerPort: 10002
      protocol: TCP
    readinessProbe:
      failureThreshold: 3
      httpGet:
        path: /actuator/health
        port: 10000
        scheme: HTTP
      initialDelaySeconds: 60
      periodSeconds: 5
      successThreshold: 1
      timeoutSeconds: 1
    resources:
      limits:
        memory: 1000Mi
      requests:
        cpu: 200m
        memory: 700Mi
    terminationMessagePath: /dev/termination-log
    terminationMessagePolicy: File
    volumeMounts:
    - mountPath: /var/run/secrets/kubernetes.io/serviceaccount
      name: kube-api-access-7mkks
      readOnly: true
  dnsPolicy: ClusterFirst
  enableServiceLinks: true
  nodeName: my-application-nodes-d421333b-5647f-gsvnf
  preemptionPolicy: PreemptLowerPriority
  priority: 1000
  priorityClassName: develop
  restartPolicy: Always
  schedulerName: default-scheduler
  securityContext: {}
  serviceAccount: default
  serviceAccountName: default
  terminationGracePeriodSeconds: 60
status:
  conditions:
  - lastProbeTime: null
    lastTransitionTime: "2023-07-03T16:15:51Z"
    status: "True"
    type: Initialized
  - lastProbeTime: null
    lastTransitionTime: "2023-08-02T11:05:52Z"
    status: "True"
    type: Ready
  - lastProbeTime: null
    lastTransitionTime: "2023-08-02T11:05:52Z"
    status: "True"
    type: ContainersReady
  - lastProbeTime: null
    lastTransitionTime: "2023-07-03T16:15:51Z"
    status: "True"
    type: PodScheduled
  containerStatuses:
  - containerID: containerd://4fda15e1f13a2a1bcd2674c4539465ccaa5093f59a61f92d3d4e38205a31766f
    image: docker-stock-availability.art.my.xxx/stock-availability/application:develop
    imageID: docker-stock-availability.art.my.xxx/stock-availability/application@sha256:a038a244f3b183845615d78d7f00ba8ceaca571cd2d8b6c726b46af3426af4f8
    lastState:
      terminated:
        containerID: containerd://ac7e59324c3c4e2fc3d2e23b48e261da9c8ca40fadd33f64be68474cb749378c
        exitCode: 143
        finishedAt: "2023-07-14T11:32:09Z"
        reason: Error
        startedAt: "2023-07-13T13:16:46Z"
    name: application
    ready: true
    restartCount: 6
    started: true
    state:
      running:
        startedAt: "2023-07-14T11:32:09Z"
  hostIP: 172.16.1.117
  phase: Running
  podIP: 10.244.31.6
  podIPs:
  - ip: 10.244.31.6
  qosClass: Burstable
  startTime: "2023-07-03T16:15:51Z"

Steps To Reproduce

No response

Additional Information

No response

Logs

No response

deckhouse-BOaTswain commented 10 months ago

This issue has been automatically put in the triage queue because it has not had recent activity. The team will reconsider the status of this issue. Thank you for your contributions.