Closed kmyczkowska-hypatos closed 3 months ago
Good point - will work on this now
PR here @kmyczkowska-hypatos https://github.com/BerriAI/litellm/pull/2493
@kmyczkowska-hypatos can we setup a direct Slack connect with your team to address your issues faster ?
What's the best email to send a slack connect to?
Here's my linkedin if you want to dm me there https://www.linkedin.com/in/reffajnaahsi/
What happened?
The problem happened while performing load tests (1100 prompts) on the same model, with 0 retries. The following HTTP status codes were returned by LiteLLM[proxy]: 200: 864 occurrences 429: 219 occurrences 500: 17 occurrences
While 200 and 429 are expected, the 500 isn't. The error message doesn't tell why models aren't available. If it's because of quota, why not return 429? It would be good to have information if the error is recoverable. The error is repeatable.
Relevant log output
Twitter / LinkedIn details
No response