cncf / tag-runtime

🏃🏿‍♀️🏃🏽‍♀️🏃🏻‍♂️🕒CNCF Technical Advisory Group for Runtime
https://tag-runtime.cncf.io
Apache License 2.0
83 stars 61 forks source link

Cloud Native LLM runtime proposal #164

Open daixiang0 opened 5 months ago

daixiang0 commented 5 months ago

For python developers, litellm maybe a good choice. That would be great we can do it in a runtime and any developers can use LLMs in own language way rather than packaging HTTP/gRPC calls by themselves.

Now we have many LLM APIs like OpenAI, Azure AI, Cohere, LLaMA, AWS bedrock, Kserve, OpenVINO and so on, migrate from one to the other still need many code changes.

I propose that we can do it in a Cloud Native LLM runtime, then developers can migrate from one to the other only by config.

zanetworker commented 5 months ago

Related doc here: https://docs.google.com/document/d/1FQN_hGhTNeoTgV5Jj16ialzaSiAxC0ozxH1D9ngCVew/edit

daixiang0 commented 4 months ago

The whole proposal is here, and point out difference between gateway and runtime.