MemoryError handling - Githubissues

awslabs / multi-model-server

Multi Model Server is a tool for serving neural net models for inference

Apache License 2.0

998 stars 230 forks source link

In this test it is mentioned that MMS expects the handler to raise MemoryError when he can no longer allocate memory for workers. I didn't find anything in the docs about the effects of this or how exactly MMS treats this error differently.

Does this affect MMS behaviour / subsequent requests to register models?
Is this observable from outside MMS (i.e. REST API, without parsing the server logs)? I get 507 error with general message about "Internal Server Error", so I can't separate when the error is due to OOM and when it's a failure to load the model (I want my application to behave differently in each of those cases)

awslabs / multi-model-server

MemoryError handling #896