Handle GPT-4 rate limits

StampyAI / stampy-chat

Conversational chatbot to answer questions about AI Safety & Alignment based on information retrieved from the Alignment Research Dataset

https://chat.stampy.ai

MIT License

12 stars 5 forks source link

Handle GPT-4 rate limits #60

Closed henri123lemoine closed 11 months ago

henri123lemoine commented 11 months ago

Current rate limits are of 200 messages or 40,000 tokens per minute, which will likely be reached in 5-6 questions if we use the full context window for every chat. That might be a problem. We need to set up a system that handles GPT-4 rate limits by switching to ChatGPT if they occur. Additionally, we could limit spamming by slowing down the streaming if a single user sends queries at an impossible speed.