GPT-4 1106-Preview often gives error ".. exceeded token rate limit of your current OpenAI S0 pricing tier".

Been using this for some time, all runs good. Just clean chat, no history etc.

After updating my application to use GPT-4 Turbo, users often get this error:

Requests to the Creates a completion for the chat message Operation under Azure OpenAI API version 2023-03-15-preview have exceeded token rate limit of your current OpenAI S0 pricing tier. Please retry after 9 seconds. Please go here: https://aka.ms/oai/quotaincrease if you would like to further increase the default rate limit.

In my application settings I correctly put

AZURE_OPENAI_PREVIEW_API_VERSION=2023-07-01-preview

So wondering if this API version "2023-03-15-preview" you reference in the error message is hardcoded, or there is some other issue I dont see?

Btw im pretty sure the token rate limit is actually NOT exceeded, this happens also during evenings where employees are not using it.

microsoft / sample-app-aoai-chatGPT

GPT-4 1106-Preview often gives error ".. exceeded token rate limit of your current OpenAI S0 pricing tier". #406