Closed CharlesBdg closed 5 months ago
Agents do not run unless you invoke them, they have no impact on regular RAG chats
Thanks for the information.
Here what I've found, I have embed the Wikipedia page of OpenAI Default settings on the workspace. Question : "How much Microsoft invested ?" Answer : "Microsoft invested $1 billion in OpenAI, a non-profit artificial intelligence research organization, in 2019. This investment was part of a multi-year partnership aimed at advancing AI capabilities and promoting responsible AI development." Real answer should be : "Microsoft provided OpenAI Global LLC with a $1 billion investment in 2019 and a $10 billion investment in 2023" When I look at the "Citation", I don't even see the answer the LLM gave to me. And every chunk of text have at least "60% match" but in reality, it's totally wrong (see pictures)
I am using gpt-4o and text-embedding-ada-002 in LLM and embedding respectively. Below is my result after adding the Wikipedia page of OpenAI:
Q: How much has Microsoft invested in OpenAI? A: Microsoft has invested a total of $11 billion in OpenAI, with an initial $1 billion investment in 2019 and an additional $10 billion investment in 2023.
It seems I had two issues :
Now everything is back to normal and work has expected. Thank you for your help.
How are you running AnythingLLM?
Docker (local)
What happened?
Hello, Since 3 weeks, I get really bad result at RAG. Before around 18th April, I was able to get accurate information from my files (around 150 txt files). Now, the LLM always said he didn't find any relevant context. When I click on "Show citation", the file where the information is, is not showing up. For example : I have a file called "How to enable Warp / Zero Trust" (from Cloudflare). Inside the file it is written "How to enable Warp / Zero Trust". I ask the question to the LLM "How to enable Warp / Zero Trust", Output : "Sorry I didn't find any relevant context" and the file is not in "Show citation".
LLM : Ollama local / llama3, phi3, openchat, mistral, same output Embedding : Ollama / mxbai-embed-large Vector database : LanceDB or Milvus (I've already tried a hard reset of the DB).
I even tested with the Desktop version (v1.5.4) (in case it was a Docker issue) but same issue.
I've already tried tweaking in "Vector Database" section of the Workspace the "Max Context Snippets" and "Document similarity threshold" but no result.
I don't know if something break with the merged of Agent or the bump version in lancedb deps or bump langchain deps or the code that do the RAG.
Maybe add the ability to completely disable the Agent and get back the old RAG.
PS : Great project, thank you
Are there known steps to reproduce?
No response