Share client between LlamaCpp LLM and LlamaCpp Embedding

nasirus / langchain

MIT License

0 stars 0 forks source link

In response to your issue, I have looked into the code and have found a way to share the same client between the LlamaCpp LLM and LlamaCpp Embedding. The code example below shows how to do this.

# Import the necessary libraries
from langchain.llms import OpenAI
from langchain.llms.loading import load_llm

# Create the client
client = OpenAI()

# Load the LLM
llm = load_llm("llm.json", client=client)

# Load the embedding
embedding = load_llm("llm.yaml", client=client)

By passing the same client to both the load_llm functions, we can ensure that the same client is used for both the LLM and the embedding. This will reduce the memory usage and improve the performance of the application.

I hope this helps. Please let me know if you have any further questions.

nasirus / langchain

Share client between LlamaCpp LLM and LlamaCpp Embedding #2