kubernetes / node-problem-detector

This is a place for various problem detectors running on the Kubernetes nodes.
Apache License 2.0
2.85k stars 616 forks source link

No event generated #842

Closed 2714222609 closed 7 months ago

2714222609 commented 7 months ago

NPD is successfully started and I ran sudo sh -c "echo 'kernel: BUG: unable to handle kernel NULL pointer dereference at TESTING' >> /dev/kmsg" in the host, but didn't see any event log:

I1213 11:02:03.347660 1 log_monitor.go:78] Finish parsing log monitor config file /etc/config/checkitem/monitor-log-kernal-kmsg.json: {WatcherConfig:{Plugin:kmsg PluginConfig:map[] LogPath:/dev/kmsg Lookback:5m Delay:} BufferSize:10 Source:kernel-monitor DefaultConditions:[{Type:KernelDeadlock Status: Transition:0001-01-01 00:00:00 +0000 UTC Reason:KernelHasNoDeadlock Message:kernel has no deadlock} {Type:ReadonlyFilesystem Status: Transition:0001-01-01 00:00:00 +0000 UTC Reason:FilesystemIsNotReadOnly Message:Filesystem is not read-only}] Rules:[{Type:temporary Condition: Reason:OOMKilling Pattern:Killed process \d+ (.+) total-vm:\d+kB, anon-rss:\d+kB, file-rss:\d+kB.} {Type:temporary Condition: Reason:TaskHung Pattern:task [\S ]+:\w+ blocked for more than \w+ seconds.} {Type:temporary Condition: Reason:UnregisterNetDevice Pattern:unregister_netdevice: waiting for \w+ to become free. Usage count = \d+} {Type:temporary Condition: Reason:KernelOops Pattern:BUG: unable to handle kernel NULL pointer dereference at .} {Type:temporary Condition: Reason:KernelOops Pattern:divide error: 0000 [#\d+] SMP} {Type:temporary Condition: Reason:Ext4Error Pattern:EXT4-fs error .} {Type:temporary Condition: Reason:Ext4Warning Pattern:EXT4-fs warning .} {Type:temporary Condition: Reason:IOError Pattern:Buffer I/O error .} {Type:temporary Condition: Reason:MemoryReadError Pattern:CE memory read error .} {Type:permanent Condition:KernelDeadlock Reason:DockerHung Pattern:task docker:\w+ blocked for more than \w+ seconds.} {Type:permanent Condition:ReadonlyFilesystem Reason:FilesystemIsReadOnly Pattern:Remounting filesystem read-only}] EnableMetricsReporting:0xc000015d2a} I1213 11:02:03.347776 1 log_watchers.go:40] Use log watcher of plugin "kmsg" I1213 11:02:03.547077 1 k8s_exporter.go:55] Waiting for kube-apiserver to be ready (timeout 5m0s)... I1213 11:02:03.850491 1 node_problem_detector.go:56] K8s exporter started. I1213 11:02:03.946796 1 node_problem_detector.go:60] Prometheus exporter started. I1213 11:02:03.946830 1 log_monitor.go:110] Start log monitor /etc/config/checkitem/monitor-log-kernal-kmsg.json I1213 11:02:03.946885 1 problem_detector.go:77] Problem detector started I1213 11:02:03.946903 1 log_monitor.go:235] Initialize condition generated: [{Type:KernelDeadlock Status:False Transition:2023-12-13 11:02:03.946893155 +0800 CST m=+5.299353624 Reason:KernelHasNoDeadlock Message:kernel has no deadlock} {Type:ReadonlyFilesystem Status:False Transition:2023-12-13 11:02:03.946893241 +0800 CST m=+5.299353709 Reason:FilesystemIsNotReadOnly Message:Filesystem is not read-only}]