InftyAI / llmaz-dashboard

A web console for llmaz.
https://github.com/InftyAI/llmaz
Apache License 2.0
3 stars 5 forks source link

Accelerate the model weight loading in kubernetes #15

Open kerthcet opened 12 months ago

kerthcet commented 12 months ago

Because models are quite big with GB size, we should think of an efficient way to load models, e.g.

kerthcet commented 12 months ago

For the first trial, let's use localPath with a pvc mounted. :)