fermi-ad / controls

Central repo for reporting bugs, making feature requests, managing RFCs, and requesting seminar topics.
https://www-bd.fnal.gov/controls/
2 stars 0 forks source link

DSE10 outage #42

Closed awattsFNAL closed 7 months ago

awattsFNAL commented 7 months ago

DSE10 hardware failure took down RabbitMQ OAC BUNNY1.

awattsFNAL commented 7 months ago

No degradation of service, just loss of redundancy.

finstrom commented 7 months ago

@lgmillsx recovered the node. No clues as to what the problem was.

awattsFNAL commented 7 months ago

Reopening due to another failure over the weekend. https://www-bd.fnal.gov/Elog/?orEntryId=250275

lgmillsx commented 7 months ago

Again, another lock-up with no traces - the third this year for this host. Re-seated all memory and internal MB connectors. If it happens again, will probably replace power supply.