awslabs / multi-model-server

Multi Model Server is a tool for serving neural net models for inference
Apache License 2.0
984 stars 230 forks source link

Provide more information in job queue error and docs #995

Closed nskool closed 2 years ago

nskool commented 2 years ago

Before or while filing an issue please feel free to join our slack channel to get in touch with development team, ask questions, find out what's cooking and more!

Issue #, if available:

Description of changes:

  1. Improve error when there are no more workers to serve request for a model.
  2. Improve documentation for the usage of job_queue_size.

Testing done:

To run CI tests on your changes refer README.md

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.