InftyAI / llmaz

☸️ Easy, advanced inference platform for large language models on Kubernetes
Apache License 2.0
13 stars 5 forks source link

Failover policy for various backends #86

Open kerthcet opened 4 weeks ago

kerthcet commented 4 weeks ago

What would you like to be added:

Different backends support a large range of popular models, but not all models are supported, so we should have a failover policy for them, at least this policy should be supported in Playground.

Why is this needed:

Completion requirements:

This enhancement requires the following artifacts:

The artifacts should be linked in subsequent comments.

kerthcet commented 4 weeks ago

/priority important-longterm