GoogleCloudPlatform / cloudml-samples

Cloud ML Engine repo. Please visit the new Vertex AI samples repo at https://github.com/GoogleCloudPlatform/vertex-ai-samples
https://cloud.google.com/ai-platform/docs/
Apache License 2.0
1.51k stars 860 forks source link

Example/Template for Custom Container Online Prediction #495

Closed adrianpgob closed 2 years ago

adrianpgob commented 3 years ago

There is a lot of people with the same problem: ERROR: (gcloud.beta.ai-platform.versions.create) Create Version failed. Bad model detected with error: Model requires more memory than allowed. Please try to decrease the model size and re-deploy. If you continue to experience errors, please contact support. (I am one of those)

For this problem is being recommended to create a Custom Container, however, the process is tedious and there isn't any basic example or boilerplate(PyTorch Serve is an overkilling solution if you don't use Torch).

I think it could be very useful for the community if the Google team provides a preconfigured and basic Docker container with the best libraries for each requirement.

Thank you

kweinmeister commented 2 years ago

Thank you for the suggestion. Pre-built containers for prediction are now available in Vertex AI.