About text embedding from text tokens.

BAAI-Agents / Cradle

The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.

MIT License

1.56k stars 141 forks source link

Our OpenAI LLM provider primarily refers to LangChain's implementation. The specific reference code and reasons are as follows:

The LangChain code is https://github.com/langchain-ai/langchain/blob/master/libs/community/langchain_community/embeddings/openai.py. At line 397, LangChain states that it primarily refers to OpenAI's codebook.
The openai codebook link is https://github.com/openai/openai-cookbook/blob/main/examples/Embedding_long_inputs.ipynb. They wrote it this way primarily to address embedding texts that are longer than the model's maximum context length.

Please refer to the above code. If you have any questions, feel free to contact us. Thanks.

BAAI-Agents / Cradle

About text embedding from text tokens. #48