digitalfabrik / integreat-chat

Interface to self-hosted large language models and vector databases to provide improved Integreat Chat functionality
https://integreat-app.de
MIT License
1 stars 0 forks source link

Use entire pages as context to LLM after retrieving chunks #64

Closed dasgoutam closed 2 weeks ago

dasgoutam commented 1 month ago

Implement a new retrieval strategy and check performance -

  1. Retrieve smaller chunks from VDB using the retriever
  2. Sort the results to the first 3 unique integreat page results
  3. Retrieve pages from the Page Children API
  4. Use the pages as context for the LLM to extract answer
dasgoutam commented 3 weeks ago

Results from using pages as context to the llm -

Screenshot 2024-11-06 at 3 54 19 PM Screenshot 2024-11-06 at 3 55 15 PM Screenshot 2024-11-06 at 4 13 16 PM
dasgoutam commented 3 weeks ago

"How can I learn French" still provides how to learn German. This is expected with the relevance check OFF; maybe lowering the threshold a bit more can help. But the results are consistent otherwise so far