Open pjchungmd opened 1 year ago
I guess the sweet spot is something like
boot_disk {
initialize_params {
image = module.gce-container.source_image
# default disk size (in GBs)
size=10
}
}
And document this in the readme. This means users know exactly were to modify the main.tf directly in case they run into the same disk full issues. Something like https://github.com/bentoml/aws-ec2-deploy#troubleshooting could help too.
What do you think @pjchungmd ?
Issue
When deploying a pretrained model from Huggingface, in particular a model which is downloaded from Huggingface after the container has been setup, the default disk size of 10 Gi is too small and causes unexpected errors.
Possible Solution
In the
terraform_default.tf
file, we can change theboot_disk
section from:to something like:
However this may be a waste for most use cases. Another possibility is adding
size
toOPERATOR_SCHEMA
inoperator_config.py
or just updating the README to instruct the user to set the size if they are encountering problems.