Closed JasonMaggard closed 1 year ago
@JasonMaggard This is by design. In general, the summarizer only summarizes messages beyond the message window. More about how it operates may be found here: https://docs.getzep.com/sdk/extractors/#summarizer-extractor
Rather than limiting the number of messages returned by passing the limit
parameter, you may want to reduce the message window itself.
Configuration details may be found here: https://docs.getzep.com/deployment/config/
I have noticed the same behaviour using the env variable and using the limit parameter. ZEP_MEMORY_MESSAGE_WINDOW
@JasonMaggard Looks like you closed the issue before I could respond. Did you successfully set the message window ?
The size of our prompt was leaving our AI slow. With doc stores and history our prompts were getting into the 1,000 token range and prompt eval time got up to 2 minutes.
When I tried to limit the history to the last 5 messages, I noticed that the summary was never returned until there were 12 messages in history. When the summary is returned, it is only a summary for the messages > 12.
In this instance, the summary does not reflect the questions about Henry Cavill or Freya Allen being good actors nor the answers when asking for only the last three memory messages.
This is easily replicated. Just load 10 messages into history and request the last 4. There will be no summary. Thanks.