thaum-xyz / ankhmorpork

@paulfantom's GitOps managed kube cluster running in a cupboard. Built with fancy tools :sparkles:
https://ankhmorpork.thaum.xyz
MIT License
77 stars 10 forks source link

Alert: TargetDown in gpu-infra #234

Open paulfantom opened 1 year ago

paulfantom commented 1 year ago

Alert TargetDown firing in gpu-infra namespace

This is an automated issue created by the monitoring system. Please do not edit this message.

Alertmanager URL: https://alertmanager.ankhmorpork.thaum.xyz

Issue was last updated at 2024-10-18 13:11:23.346017703 +0000 UTC m=+97352.641600760.

Common Labels

alertname TargetDown
cluster ankhmorpork
job gpu-infra/dcgm-exporter
namespace gpu-infra
prometheus monitoring/k8s
severity warning

Common Annotations

description 100% of the gpu-infra/dcgm-exporter/ targets in gpu-infra namespace are down.
runbook_url https://runbooks.prometheus-operator.dev/runbooks/general/targetdown
summary One or more targets are unreachable.

Alerts

StartsAt Links
2024-10-18 12:00:52.417 +0000 UTC GeneratorURL