kyma-project / kyma

Kyma is an opinionated set of Kubernetes-based modular building blocks, including all necessary capabilities to develop and run enterprise-grade cloud-native applications.
https://kyma-project.io
Apache License 2.0
1.52k stars 405 forks source link

default-kne-trigger dispatcher with kafka OOMKill / CrashLoopBackOff #9698

Closed p4p4 closed 3 years ago

p4p4 commented 4 years ago

Description

see memory usage of the affected pod (always over the limit, most likely causing the restarts) image

Expected result

Steps to reproduce

Troubleshooting

Solution so far

image

image

image

after changing that, the subscriber received traffic in a reasonable rate again.

k15r commented 4 years ago

If those changed limits solve the restart issues, then they need to be persisted by changing the configuration of the knative-eventing-kafka-channel-controller in the knative-eventing namespace:

> kubectl edit deployment -n knative-eventing knative-eventing-kafka-channel-controller 
spec:
    ...
  template:
    ...
    spec:
      containers:
      - env:
        ...
        - name: DISPATCHER_CPU_REQUEST
          value: 300m
        - name: DISPATCHER_CPU_LIMIT
          value: 500m
        - name: DISPATCHER_MEMORY_REQUEST
          value: 50Mi
        - name: DISPATCHER_MEMORY_LIMIT
          value: 128Mi

DISPATCHER_MEMORY_LIMIT 128Mi is the current default value.

Please adapt the DISPATCHER_MEMORY_LIMIT to the values that worked for your workload.

Please keep in mind:

k15r commented 3 years ago

workaround delivered (updated channel-controller with configurable forced reconcile, plus updated limits as shown above)