If you naively trim the oldest messages from context, you'll have to rebuild the cache every message once you hit the context limit. This extension instead trims N messages at a time, so you only have to rebuild the cache every N messages.
There are two settings you can configure for this extension:
max context length - character card length
. I'd like to automate this in the future.