almutareb / insurance-advisor-agent

0 stars 0 forks source link

Add prompt compression to reduce token #43

Open almutareb opened 2 months ago

almutareb commented 2 months ago

Is your feature request related to a problem? Please describe. Implement prompt compression, to reduce the number of tokens of the query and retrieved documents before sending them to the LLM for generation

Describe the solution you'd like A clear and concise description of what you want to happen.

Describe alternatives you've considered A clear and concise description of any alternative solutions or features you've considered.

Additional context check selective_context or LLMlingua