Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
Feature request / 功能建议
支持minicpm-Embedding https://huggingface.co/openbmb/MiniCPM-Embedding
Motivation / 动机
minicpm-Embedding + minicpm --Reranker 发挥性能最佳,希望支持。
Your contribution / 您的贡献
https://huggingface.co/openbmb/MiniCPM-Embedding