Open Xavientois opened 1 year ago
Getting the same issue with same version of agent on Openshift 4.11.
For the warning in the logs:
2023-08-21 15:17:14 UTC | SYS-PROBE | WARN | (pkg/network/tracer/connection/kprobe/tracer.go:146 in LoadTracer) | error loading CO-RE network tracer, falling back to pre-compiled: failed to init ebpf manager: couldn't load eBPF programs: map connection_protocol: map create: cannot allocate memory
This can be avoided by bumping up the memory (200Mi currently), which should load the system-probe.
The error:
2023-08-21 15:17:16 UTC | SYS-PROBE | ERROR | (cmd/system-probe/api/module/loader.go:65 in Register) | error creating module network_tracer: failed to init ebpf manager: {UID:net EBPFFuncName:tracepoint__net__net_dev_queue EBPFSection:} failed the sanity check: use CloneProbe to load 2 instances of the same program
... is caused by the previous warning, and is fixed in the upcoming 7.48
release.
I was able to reproduce the issue too with GKE and latest helm-chart.
Bumping the limits of the containers as suggested, seems to solve the problem.
In the values files in agents.containers.systemProbe.resources
requests:
cpu: 100m
memory: 200Mi
limits:
cpu: 300m
memory: 500Mi
@Xavientois is this resolved?
Reporting here what Datadog Support told me:
A fix for this issue was introduced in 7.48 under this PR and has since also been backported to 7.47.1.
Agent Environment
Describe what happened:
When installing the
datadog
helm chart, the system-probe container crashes with the following output:Describe what you expected:
For the system-probe to start successfully
Steps to reproduce the issue:
Install the Datadog helm chart into a k8s 1.27 GKE cluster (I do not get this issue in EKS with the same chart, k8s version, and config).
Use the following
values.yaml
Additional environment details (Operating System, Cloud provider, etc):