FOSSRIT / infrastructure

Set of scripts, Ansible playbooks/roles, and other tools to automate and manage FOSS@MAGIC infrastructure
BSD 3-Clause "New" or "Revised" License
5 stars 3 forks source link

Service disruption: Slack bridge down #45

Closed ct-martin closed 5 years ago

ct-martin commented 5 years ago

Summary

bridge to slack has stopped working. No messages going either way Expected results

Messages bring bridges both ways on relevant channels Actual results

messages not being sent to either side, unknown if bot errored or died Priority requested

Urgency: high/medium
Requested deadline: reasonably soon

Other details

Can we get auto-restarts, monitoring, or something for the bot?

jwflory commented 5 years ago

As with #42, a restart fixed it again. This time I had logs though:

-- Logs begin at Tue 2019-04-09 16:59:40 EDT, end at Fri 2019-04-12 10:47:26 EDT. --
Apr 11 11:24:33 ritlug-irc matterbridge[11525]: time="2019-04-11T11:24:33-04:00" level=error msg="Not connected to server, dropping message" prefix=irc
Apr 11 20:43:42 ritlug-irc matterbridge[11525]: time="2019-04-11T20:43:42-04:00" level=error msg="Not connected to server, dropping message" prefix=irc
Apr 12 10:23:53 ritlug-irc matterbridge[11525]: time="2019-04-12T10:23:53-04:00" level=error msg="Not connected to server, dropping message" prefix=irc

It's not clear to me why Matterbridge does not try to auto-reconnect. I'll try to chase this one back upstream.