Is your feature request related to a problem? Please describe.
Implement prompt compression, to reduce the number of tokens of the query and retrieved documents before sending them to the LLM for generation
Describe the solution you'd like
A clear and concise description of what you want to happen.
Describe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.
Additional context
check selective_context or LLMlingua
Is your feature request related to a problem? Please describe. Implement prompt compression, to reduce the number of tokens of the query and retrieved documents before sending them to the LLM for generation
Describe the solution you'd like A clear and concise description of what you want to happen.
Describe alternatives you've considered A clear and concise description of any alternative solutions or features you've considered.
Additional context check selective_context or LLMlingua