Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
各位老师好,我们现在试用xinference部署Embbeding, 发现部署multilingual-e5-large 向量后,用 Qdrant + OpenAIEmbedding (通过 http://ip:9997 方式) 方式查询应用时查询的数据不准确,但用Qdrant + HuggingFaceEmbeddings (调用本地目录的 multilingual-e5-large 模型文件)查询应用时查询的数据是准确的。后来又换用GTE 向量模型,也是一样的效果,请问这可能是哪方面的影响,或者是调用xinference时有哪些特殊注意点的地方, 谢谢!