ccfos / nightingale

An all-in-one observability solution which aims to combine the advantages of Prometheus and Grafana. It manages alert rules and visualizes metrics, logs, traces in a beautiful web UI.
https://flashcat.cloud/docs/
Apache License 2.0
9.42k stars 1.38k forks source link

时间偏移导致无法触发报警(非失联) #2024

Closed taoyang987 closed 1 month ago

taoyang987 commented 1 month ago

Question and Steps to reproduce

v7.0.0-beta.4 有几台服务器时间偏差达到3分钟,磁盘报警不会触发,以前我记得老版本是会触发的

Relevant logs and configurations

失联报警还是会触发

Version

v7.0.0-beta.4

taoyang987 commented 1 month ago

我把服务器ntp同步后,立马就能触发

UlricQin commented 1 month ago

之前的版本可能是打开了 ForceUseServerTS 这个配置 https://github.com/ccfos/nightingale/blob/main/etc/config.toml#L132 打开这个配置之后就会使用服务端的时间戳作为监控指标时间戳,不再依赖原始被监控机器的时间戳,另外要保证n9e进程所在机器的时间和时序库所在机器的时间校准一致

taoyang987 commented 1 month ago

ForceUseServerTS = true加了之后就好了,谢谢秦总