Open sanjay-nit opened 1 year ago
Facing issues while deploying LLM models. After an hour of starting, It was able to build the Docker image and also push it.
Getting timed out Error while deploying
Is there any resource constraint or anything else that I should follow while deploying large models? Below is the last few lines from build logs
Image pushed. SUCCESS: Model Registered Deploying... ERROR: timed out waiting for deployment to be healthy
FYI: I'm hosting Falcon-40b LLM hugging face model.
cc: @sanjayk1801
Facing issues while deploying LLM models. After an hour of starting, It was able to build the Docker image and also push it.
Getting timed out Error while deploying
Is there any resource constraint or anything else that I should follow while deploying large models? Below is the last few lines from build logs
FYI: I'm hosting Falcon-40b LLM hugging face model.
cc: @sanjayk1801