Make messages expiration time lower, in hours not days
With gpt requests consider not sending the entire conversation if it's more than N messages take last N only, as some conversations can have 300+ messages sent with the request, study this performance.
Make messages expiration time lower, in hours not days
With gpt requests consider not sending the entire conversation if it's more than N messages take last N only, as some conversations can have 300+ messages sent with the request, study this performance.