Closed Torhamilton closed 1 year ago
I added the feature to perform summarization in the background for messages larger than 512 tokens. When summarization is finished, the result is added to the MessageHistory, and when sent to llm, the summarized text is sent instead of the original text. The token threshold can be changed in ChatConfig.
5b2d56f0ba18ac65cc3b453bb4830096bc7a6187
Perfect!
I propose we use a two LLMs approach to cut them on the cost of using gpt4 and all expensive future variants.
This mostly applies if you are using GTP4, but why use anything else :)
You have:
This may even get gpt4 to be more focus and on point