Open JCTaz9 opened 1 month ago
Hi, this is definitely something we are looking at refactoring as running the ingestion inside the endpoint doesn't match with background processing system. Have you looked into using the file connector?
Tracking refactoring of this as DAN-867.
We use the indexing API to embed a lot of documents with a structure like in the documentation (by section). https://docs.danswer.dev/backend_apis/ingestion It takes a very long time to embed all of our documents because for every document, it's making one api call. Would it be possible to embed in batch when using the ingestion api?
Right now it takes 13 days embedding what would take a few hours with the files connectors.