ucbrise / clipper

A low-latency prediction-serving system
http://clipper.ai
Apache License 2.0
1.4k stars 280 forks source link

First query fails when model reconnects #361

Open Corey-Zumar opened 6 years ago

Corey-Zumar commented 6 years ago

Steps to reproduce:

  1. Start Clipper locally (via bin/start_clipper.sh)
  2. Connect the no-op container by executing the container script in a python shell
  3. Register a model associated with the no-op container, create an application, and link the model to the application
  4. Query the application, observe that queries are routed normally
  5. Restart the no-op container by killing its associated process and re-launching the container script.
  6. Send a new query to the application. Observe that it times out.
  7. Send the new query again. Observe that it is routed to the model and completes on time.

Cause:

rkooo567 commented 5 years ago

@withsmilo @simon-mo Not sure if this problem still persists? I will add it to the task first.

withsmilo commented 5 years ago

@rkooo567 Let me check it, but I think the problem still happens.

rkooo567 commented 5 years ago

@withsmilo Thanks a lot! 👍