sown / tasks

Tasks for sown projects
0 stars 0 forks source link

Investigate why node owner (up/down) emails not being received #92

Closed drn05r closed 1 year ago

drn05r commented 1 year ago

A node owner reported recently that they did not receive any notifications about a node going down. I am not sure if this was not sent of whether something blocks it being received either by our relay host or their mail server because something looks dodgy.

drn05r commented 1 year ago

I think these emails should go out from sown-monitor.ecs.soton.ac.uk. I can see that monitoring email is still being received from this server as I am on the monitoring alias and have received this email. Postfix is configured on sown-monitor but I cannot find a mail log (/var/log/mail.log is empty nothing is journalctl for postfix) to see what is going wrong.

drn05r commented 1 year ago

This may be due to the node owner's email address being broken as it is an alias. They are looking into getting this fix but I have setup a secondary email address that gets notifications for this node. If it goes down again, the node owner should be able to confirm if email notfications are broken or not.

drn05r commented 1 year ago

This looks to be down to a to address issue. This has been fixed but will require Ansible changes to ensure local changes are not overwritten. https://github.com/sown/ansible/issues/50