pulibrary / ops-catchall

Operations Catch All
0 stars 0 forks source link

Check_MK: lib-dss999 #90

Open kayiwa opened 2 months ago

kayiwa commented 2 months ago
Service PROBLEM notification
Host: [lib-dss999.princeton.edu](http://lib-dss999.princeton.edu/) (IP: 128.112.203.169)
Service: Check_MK
State: CRITICAL
Additional Info
(Service Check Timed Out)

We get notification about Check_MK (as best as we can determine) is unable to contact the agent on this endpoint. This is a frequent alert. Anecdotal evidence says the machine is slow. It also does not look like a scheduled event.

Possible solution is the new hardware may fix this.

acozine commented 1 month ago

The most common alerts for lib-dss999 this week are for DotNet Memory Management.