[Request] 自定义嵌入模型 Custom Embedding model

lobehub / lobe-chat

🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Qwen / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS/Plugins/Artifacts). One-click FREE deployment of your private ChatGPT/ Claude application.

https://chat-preview.lobehub.com

Other

46.71k stars 10.3k forks source link

[Request] 自定义嵌入模型 Custom Embedding model #3785

Open ProAlexUSC opened 2 months ago

ProAlexUSC commented 2 months ago

🥰 需求描述

目前嵌入模型仅支持 text-embedding-3-small，能否支持其他模型，或者提供模型 remap 的能力?

At present, the embedded model only supports text-embedding-3-small. Can it support other models, or provide the ability for model remapping?

🧐 解决方案

我在 OpenAI 模型配置下，使用其他兼容 OpenAI 接口的大模型提供商，它提供了 netease-youdao/bce-embedding-base_v1 模型。希望能使用这个模型作为嵌入模型，或者直接重命名 netease-youdao/bce-embedding-base_v1 为 text-embedding-3-small

Under the OpenAI model configuration, I am using a large model provider that is compatible with the OpenAI API, which offers the netease-youdao/bce-embedding-base_v1 model. I hope to use this model as the embedding model, or directly rename netease-youdao/bce-embedding-base_v1 to text-embedding-3-small.

📝 补充信息

No response

lobehubbot commented 2 months ago

👀 @ProAlexUSC

Thank you for raising an issue. We will investigate into the matter and get back to you as soon as possible. Please make sure you have given us as much context as possible.\ 非常感谢您提交 issue。我们会尽快调查此事，并尽快回复您。请确保您已经提供了尽可能多的背景信息。

dosubot[bot] commented 2 months ago

@ProAlexUSC 你好！我是Dosu，我是一个可以帮助你解决 bug、回答问题，并成为贡献者的机器人。如果你有任何问题，我会很乐意帮助你。

You can add support for other embedding models or remap existing models within the system. The models.json file in the src/libs/agent-runtime/togetherai/fixtures/ directory allows you to specify model remapping. For example, you can rename netease-youdao/bce-embedding-base_v1 to text-embedding-3-small as follows:

{
  "name": "netease-youdao/bce-embedding-base_v1",
  "renamed": "text-embedding-3-small",
  ...
}

This will allow the system to use the netease-youdao/bce-embedding-base_v1 model under the name text-embedding-3-small ^[1].

To continue talking to Dosu, mention @dosu.

arvinxx commented 2 months ago

后续会支持的，但是这个模型的向量必须要能设定 1024 维才能用

lobehubbot commented 2 months ago

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿

It will be supported in the future, but the vector of this model must be able to set 1024 dimensions before it can be used.

Lostsite commented 2 months ago

后续会支持的，但是这个模型的向量必须要能设定 1024 维才能用

Why must be 1024 dimensions? Also, based on OpenAI, "text-embedding-3-small" is 1,536 dimensions.

ref: https://platform.openai.com/docs/models/embeddings

arvinxx commented 2 months ago

@Lostsite Just read the offical docuement.

https://openai.com/index/new-embedding-models-and-api-updates/

https://platform.openai.com/docs/guides/embeddings/use-cases

exqmjmz commented 2 months ago

后续会支持的，但是这个模型的向量必须要能设定 1024 维才能用

BAAI/bge-m3 这个向量支持1024纬度在siliconflow有这个模型，目前是免费 https://docs.siliconflow.cn/reference/createembedding-1

CaiJingLong commented 2 months ago

So, must have OpenAI API to use it? Why not support ollama embedding. Or, if it support ollama embedding, how to set it?

Btw, I use self host server mode to deploy with docker-compose.

exqmjmz commented 2 months ago

智谱的embedding-3也支持1024纬度以上的向量 https://open.bigmodel.cn/dev/api/vector/embedding-3

alwqx commented 2 months ago

后续会支持的，但是这个模型的向量必须要能设定 1024 维才能用

赞，希望支持 ollama 运行的嵌入模型

lobehubbot commented 2 months ago

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿

It will be supported in the future, but the vector of this model must be able to set 1024 dimensions before it can be used.

Like, I hope to support the embedded model run by ollama

JunerLee commented 2 months ago

后续会支持的，但是这个模型的向量必须要能设定 1024 维才能用

目前也是遇到了这个问题，服务端只能用openai的词嵌入向量化，国内很多模型也支持价格也不贵，希望可以支持更多家的向量化模型，比如ChatGLM的

lobehubbot commented 2 months ago

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿

It will be supported in the future, but the vector of this model must be able to set 1024 dimensions before it can be used.

We are currently encountering this problem as well. The server can only use openai's word embedding vectorization. Many domestic models also support it and the price is not expensive. We hope to support more vectorization models, such as ChatGLM's.

Salmon-Bard commented 2 months ago

How can I get lobe-chat to a call a Rest API that exposes my local hosted embedding model ?? My be there should be some webhook or plugin to pass embedding created via any model

AlexBlack2202 commented 1 month ago

hello, is there any plain for support my own custom embedding, i think this is not a very hard problem

arvinxx commented 1 month ago

@AlexBlack2202 No. It's a hard work for the architecture design to make it easy to intergate more embedding providers. Here is PR for it: https://github.com/lobehub/lobe-chat/pull/4370. Stay tuned