Closed davidhwyllie closed 3 years ago
The frequency with which this happens is very low - perhaps once in 400,000 inserts, using SARS-CoV-2 test data. However, a robust mechanism for auto-restarting and for ensuring the database is left in a consistent state is required.
The proposed changes are as follows:
When running fn4 with gunicorn, multiple web workers are instantiated. each has a timeout period (default 30 sec; increased to 90sec at present).
if a process, including a process inserting samples, crashes or does not respond, gunicorn will kill it and restart another. if the process is inserting, this can result in