crashes on first occurrence of OpenAI rate limit

joaomdmoura / crewAI

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

MIT License

16.92k stars 2.29k forks source link

it looks like the agents cannot retry or retry with exponential backoff intervals, when receiving rate limiting from openai api. the tokens per minute can be reached easily, if many agents work with large contexts, so the agents should simply wait and retry - but at the moment it halts the crew and never resumes. would it be possible to add a timeout and make it retry, instead of halting all operation when throttling from openai or any other api gets received?

example of implementation: https://github.com/openai/openai-cookbook/blob/main/examples/api_request_parallel_processor.py#L123

should I submit a PR? I'm new to this project, but if you could point me to the correct place where this processing takes place, I might be able to help.

joaomdmoura / crewAI

crashes on first occurrence of OpenAI rate limit #377