danswer-ai / danswer

Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.
https://docs.danswer.dev/
Other
10.36k stars 1.24k forks source link

Customize Chunk Size Per Connector #1632

Open stianrincon opened 3 months ago

stianrincon commented 3 months ago

After using Danswer for a while at BrightInsight, we propose adding a feature to customize chunk sizes when creating a connector.

Main Goals:

  1. Customize Chunk Size: Allow increasing or decreasing the vector database chunk size. Currently, this is set by DOC_EMBEDDING_CONTEXT_SIZE.
  2. Customize Chunk Overlap: Allow increasing or decreasing the vector database chunk overlap. Currently, this is set by CHUNK_OVERLAP.

Specific Details: