aws-samples / Serverless-Retrieval-Augmented-Generation-RAG-on-AWS

A full-stack serverless RAG workflow. This is thought for running PoCs, prototypes and bootstrap your MVP.
MIT No Attribution
44 stars 17 forks source link

Load Test: concurrent usage #10

Open giusedroid opened 3 months ago

giusedroid commented 3 months ago

right now we have limited the execution of the ingestion function to 1. LanceDB has a native lock system based on DynamoDB locks currently in beta. We should experiment with this and remove the artificial limitation of 1 max concurrent execution for the writer.

Once this is implemented, we should run load tests to understand impact on retrieval performance against the same (user+knowledge-base).