Open krrishdholakia opened 9 months ago
Allow setting duration for max parallel requests
User built their own version of this, where they set the duration on a per day (12 hrs) basis. And it was different for different users.
No response
this could then be set as the ttl on the cache https://github.com/BerriAI/litellm/blob/3026e5aa580c1a7431ffd35b4d80b300e771e29b/litellm/proxy/hooks/parallel_request_limiter.py#L44
The Feature
Allow setting duration for max parallel requests
Motivation, pitch
User built their own version of this, where they set the duration on a per day (12 hrs) basis. And it was different for different users.
Twitter / LinkedIn details
No response