deepgram / self-hosted-resources

Official Deepgram resources for deploying Deepgram services in a self-hosted environment
https://developers.deepgram.com
ISC License
6 stars 4 forks source link

Helm chart - copy models from NFS storage to attached storage #10

Open bd-g opened 3 months ago

bd-g commented 3 months ago

Proposed changes

The AWS and GCP default configurations configure the Engine Pods to read models from shared network attached storage. This can increase the disk latency and possibly increase the latency of requests that require a model load, which is particularly sensitive for streaming requests.

There should be an option to copy models from the NFS onto the Pod's attached host storage to reduce read latency for models. This could be done once on startup, and possibly poll for updated models in NFS.