This Spike won't be merge, it's for experimental purposes we are checking if providing reranking with small models (running on CPU) could provide good result an evaluating the RAM usage cost of this method.
For this experimentation, we target only the Gen AI Orchestrator, we will not alter the Bot API so that we can test it by only deploying the Gen AI Orchestrator with configurations based on environment variables.
dercbot 1047
This Spike won't be merge, it's for experimental purposes we are checking if providing reranking with small models (running on CPU) could provide good result an evaluating the RAM usage cost of this method.
For this experimentation, we target only the Gen AI Orchestrator, we will not alter the Bot API so that we can test it by only deploying the Gen AI Orchestrator with configurations based on environment variables.