bananaml / potassium

An HTTP serving framework by Banana
Apache License 2.0
97 stars 9 forks source link

Facing issues while deploying large models. #25

Open sanjay-nit opened 1 year ago

sanjay-nit commented 1 year ago

Facing issues while deploying LLM models. After an hour of starting, It was able to build the Docker image and also push it.

Getting timed out Error while deploying

Is there any resource constraint or anything else that I should follow while deploying large models? Below is the last few lines from build logs

Image pushed.
SUCCESS: Model Registered
Deploying...
ERROR: timed out waiting for deployment to be healthy

FYI: I'm hosting Falcon-40b LLM hugging face model.

cc: @sanjayk1801