ccfos / nightingale

An all-in-one observability solution which aims to combine the advantages of Prometheus and Grafana. It manages alert rules and visualizes metrics, logs, traces in a beautiful web UI.
https://flashcat.cloud/docs/
Apache License 2.0
9.84k stars 1.42k forks source link

update: oom_kill alert rule #2170

Closed fangpsh closed 2 months ago

fangpsh commented 2 months ago

What type of PR is this? 更新默认报警规则中OOM 的检测语句。

What this PR does / why we need it:

kernel_vmstat_oom_kill 是一个累计值,系统启动后出现的OOM次数持续累计,原规则中的语句会导致持续报警,按increase更加合适。 间隔时间为2m,和 https://github.com/ccfos/nightingale/blob/f1259d1dffc2a6f247c99e3e8ef91d29fffe59c7/integrations/Linux/dashboards/host_generic_categraf.json#L1362 保持一致。

Which issue(s) this PR fixes:

Fixes #

Special notes for your reviewer:

zhaobzh commented 1 month ago

赞,确实如此