Configuring rate limiter (throttler)

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—language models

https://dspy.ai

MIT License

19.37k stars 1.47k forks source link

Closed denisergashbaev closed 1 month ago

denisergashbaev commented 1 month ago

LiteLLM provides rate limit aware routing strategy that routes the call to the deployment with the lowest tokens per minute value (see https://github.com/BerriAI/litellm/discussions/4510, https://docs.litellm.ai/docs/routing#advanced---routing-strategies-%EF%B8%8F).

How could we configure it in DSPy?

denisergashbaev commented 1 month ago