Open jaffe-fly opened 3 weeks ago
+1
cc @qinxuye
Is it supported in HuggingFace?
Oh, hi @terrytangyuan , xinference is an inference platform which integrated transformers, vllm, and llama.cpp as engines, it’s not directly supported by huggingface.
I see. @jaffe-fly Could you update the title and description to reflect that?
/kind feature
Describe the solution you'd like
Hope add https://github.com/xorbitsai/inference as the kserve huggingface LLMs serving runtime
Xorbits Inference(Xinference) is a powerful and versatile library designed to serve language, speech recognition, and multimodal models. With Xorbits Inference, you can effortlessly deploy and serve your or state-of-the-art built-in models using just a single command. Whether you are a researcher, developer, or data scientist, Xorbits Inference empowers you to unleash the full potential of cutting-edge AI models.
xinference is an inference platform which integrated transformers, vllm, and llama.cpp as engines, it’s not directly supported by huggingface.