Open alanbchristie opened 5 months ago
After analysis a number of adjustments have been made to the code: -
With these in place an experimental deployment appears to "mask" 14 out of 16 connection problems (approximately 87% reduction in false alarms).
The improved ssh-tunnel/mysql handler is present in the latest backend and 2024.04.1
. It now logs MySQL connection errors - the source of the majority of our "connection issues".
Discovered while investigating #1403.
A large number of ISPyB connection failures seem to be caused by OperationalError exceptions: -
The majority of intermittent connection failures seem to be "handled" by simply retrying a number of times, along with additional timeout specifications (adding a read timeout for example)