IDR / deployment

Deployment infrastructure for the Image Data Resource
https://idr.openmicroscopy.org/about/deployment.html
BSD 2-Clause "Simplified" License
10 stars 14 forks source link

First reboot of omero* instances after updates may fail #71

Open manics opened 6 years ago

manics commented 6 years ago

As part of the standard deployment all servers are patched the first time and automatically rebooted if there as a kernel upgrade. If omeroreadwrite and omeroreadonly-N are rebooted simultaneously the omeroreadonly-N may hang since they mount omeroreadwrite with NFS, and when omeroreadwrite goes down the mounts are left hanging and never seem to recover, even after the server comes back up. The current solution is to force a hard reboot openstack server reboot --hard omeroreadonly-N

This is related to https://github.com/IDR/deployment/issues/68

joshmoore commented 6 years ago

Also seen in the (rather strewn out) deployment of prod47.

manics commented 6 years ago

Incidences of this should be reduced by https://github.com/IDR/deployment/pull/108