Closed milanglacier closed 5 months ago
I connot reproduce the problem.
Your description is very messy. There is a lot of false information.
compress_threshold
is 2000, other than 1000.compress_threshold < 1000
, the compress session will be disabled. The only exception is that session has its own compress_session
set separately.If you can add more information, I will open this issue again.
I realized that compress_session
is renamed to compress_threshold
. Changed compress_session
to compress_threshold
solves the problem.
config:
compress_session: 0
Start with the following session:
Side Note
I personally think
1000
is too small as a default value, literally 1000 tokens restricted the conversation within 1-2 replies.Either turned off by default (the user who uses the expensive LLMs (gpt4, claude3-opus) already takes their own full responsibility for the potential expenses that may yield, just like people who uses free software takes all the responsibility that how potential bugs will influence their apps / computational results)
or at least 8K as the default value.