Open davidas1 opened 4 years ago
MMS just returns a 507 error for synchronous API calls, if that API caused an OOM. You would have to rely on HTTP error codes to handle the errors and not the error messages.
I didn't understand your first question fully. If there was an OOM, then the model would fail to load or request would fail, so you would need to handle this error code by reducing the number of models loaded on MMS . Otherwise you might see subsequent requests also fail.
Please let us know if this answers your question.
In this test it is mentioned that MMS expects the handler to raise MemoryError when he can no longer allocate memory for workers. I didn't find anything in the docs about the effects of this or how exactly MMS treats this error differently.