awslabs / data-on-eks

DoEKS is a tool to build, deploy and scale Data & ML Platforms on Amazon EKS
https://awslabs.github.io/data-on-eks/
Apache License 2.0
592 stars 200 forks source link

Add model caching for Stable Diffusion #448

Open ratnopamc opened 6 months ago

ratnopamc commented 6 months ago

Community Note

What is the outcome that you are trying to reach?

Currently the Stable diffusion model is directly fetched from HuggingFace and adds to the latency of the inference. It would be good to download the model and load it from a storage like S3.

Describe the solution you would like

Update the instructions to download the model from Hugging Face. Store the model in S3 or any other storage. Load the model from the storage during inference.

Describe alternatives you have considered

Additional context

lindarr915 commented 1 month ago

I would like to work on this

lindarr915 commented 1 month ago

Using S3 mountpoints and NVMe local storage for storing ML models, it can save cold start times

ratnopamc commented 1 month ago

Thanks @lindarr915; assigning to you. Looking forward to the PR!