DataDog / helm-charts

Helm charts for Datadog products
Apache License 2.0
347 stars 1.02k forks source link

Disable -- Agent -- Collector -- Running Checks --- docker #1239

Open ibadullaev-inc4 opened 1 year ago

ibadullaev-inc4 commented 1 year ago

Hi I am faced with following error

Describe what happened: I am Install k8s cluster on my own hardware I am using kubespray method to install cluster In my cluster nodes I have not docker service running I just have containerd cri

Describe what you expected:

Is it possible disable docker checks ?

Steps to reproduce the issue:

when I exec into datadog pod and execute agent status i show

=========
Collector
=========

  Running Checks
  ==============
............................

    docker
    ------
      Instance ID: docker [ERROR]
      Configuration Source: file:/etc/datadog-agent/conf.d/docker.d/conf.yaml.default
      Total Runs: 782
      Metric Samples: Last Run: 0, Total: 0
      Events: Last Run: 0, Total: 0
      Service Checks: Last Run: 0, Total: 0
      Average Execution Time : 0s
      Last Execution Date : 2023-11-13 19:17:05 UTC (1699903025000)
      Last Successful Execution Date : Never
      Error: temporary failure in dockerutil, will retry later: Cannot connect to the Docker daemon at unix:///host/var/run/docker.sock. Is the docker daemon running?
      No traceback
      Warning: Error initialising check: temporary failure in dockerutil, will retry later: Cannot connect to the Docker daemon at unix:///host/var/run/docker.sock. Is the docker daemon running?

Also in my pod log I received a lot of follow logs

2023-11-13 19:25:35 UTC | CORE | WARN | (pkg/collector/corechecks/checkbase.go:162 in Warnf) | Error initialising check: temporary failure in dockerutil, will retry later: Cannot connect to the Docker daemon at unix:///host/var/run/docker.sock. Is the docker daemon running?
2023-11-13 19:25:35 UTC | CORE | ERROR | (pkg/collector/worker/check_logger.go:69 in Error) | check:docker | Error running check: temporary failure in dockerutil, will retry later: Cannot connect to the Docker daemon at unix:///host/var/run/docker.sock. Is the docker daemon running?
2023-11-13 19:25:50 UTC | CORE | WARN | (pkg/collector/corechecks/checkbase.go:162 in Warnf) | Error initialising check: temporary failure in dockerutil, will retry later: Cannot connect to the Docker daemon at unix:///host/var/run/docker.sock. Is the docker daemon running?
2023-11-13 19:25:50 UTC | CORE | ERROR | (pkg/collector/worker/check_logger.go:69 in Error) | check:docker | Error running check: temporary failure in dockerutil, will retry later: Cannot connect to the Docker daemon at unix:///host/var/run/docker.sock. Is the docker daemon running?

Additional environment details (Operating System, Cloud provider, etc):

[nariman@notebook tmp]$ kubectl version
Client Version: v1.28.3
Kustomize Version: v5.0.4-0.20230601165947-6ce0bf390ce3
Server Version: v1.28.3

Helm Version

[nariman@notebook charts]$ helm version
version.BuildInfo{Version:"v3.12.0", GitCommit:"c9f554d75773799f72ceef38c51210f1842a1dea", GitTreeState:"clean", GoVersion:"go1.20.4"}

Datadog helm chart version

datadog-3.33.1.tgz
fanny-jiang commented 10 months ago

Hi @ibadullaev-inc4, you can set datadog.ignoreAutoConfig in your values.yaml file to disable the docker check from autodiscovery: https://github.com/DataDog/helm-charts/blob/main/charts/datadog/values.yaml#L821-L826

datadog:
  ignoreAutoConfig:
    - docker
ypp-unimelb commented 6 months ago

Hi @fanny-jiang , my helm chart version is 3.61.0. I am also getting this error.

2024-05-01 15:21:47 UTC | CORE | ERROR | (pkg/collector/worker/check_logger.go:71 in Error) | check:docker | Error running check: temporary failure in dockerutil, will retry later: Cannot connect to the Docker daemon at unix:///host/var/run/docker.sock. Is the docker daemon running?

Have set datadog.ignoreAutoConfig in the values.yaml file.

datadog:
  apiKeyExistingSecret: datadog-secret
  site: us5.datadoghq.com
  logs:
    enabled: true
    containerCollectAll: true
  ignoreAutoConfig:
    - docker
jozef-valko commented 1 month ago

Hello, following up here as I'm facing the similar issue. I would like to disable all default checks for Datadog agent. I've already tried to disable all default and autoconf checks:

datadog:
  ignoreAutoConfig:
      - container
      - containerd
      - cpu
      - cri
      - disk
      - file_handle
      - io
      - kubelet
      - kubernetes_state
      - load
      - memory
      - network
      - ntp
      - uptime

but only kubernetes_state was ignored as it is one of Autodiscovery integrations

Is it possible to disable default checks in Datadog agent?