Description of issue
Instructions read "In Azure OpenAI Studio, create a new deployment with the following settings:
Model: gpt-35-turbo"
I was unable to complete the instructions because I received an error message when I tried to create a deployment for the gpt-35-turbo model. Please see screen image 1.
When I selected the command to request more quota, this message was displayed:
Azure OpenAI Service: Request for Quota Increase
This form is used to submit requests for increased Azure OpenAI TPM (tokens-per-minute) quota.
Quota increases for text-davinci-003, gpt-35-turbo, and gpt-4 are temporarily paused due to substantial demand for these models. However, we are accepting requests and will fill them in the order they are received once our process resumes. Priority will be given to customers who generate traffic that consumes the existing quota allocation, and your request may be denied if this condition is not met.
we discussed this on Teams, but it's due to you having other deployments taking up quota. Try deleting those deployments or other Azure OpenAI resources to regain quota
Module: 03
Lab/Demo: 03
Task: Deploy a model, https://github.com/MicrosoftLearning/mslearn-openai/blob/main/Instructions/Labs/03-prompt-engineering.md#deploy-a-model
Step: 02
Description of issue Instructions read "In Azure OpenAI Studio, create a new deployment with the following settings: Model: gpt-35-turbo"
I was unable to complete the instructions because I received an error message when I tried to create a deployment for the gpt-35-turbo model. Please see screen image 1.
When I selected the command to request more quota, this message was displayed: Azure OpenAI Service: Request for Quota Increase This form is used to submit requests for increased Azure OpenAI TPM (tokens-per-minute) quota.
Quota increases for text-davinci-003, gpt-35-turbo, and gpt-4 are temporarily paused due to substantial demand for these models. However, we are accepting requests and will fill them in the order they are received once our process resumes. Priority will be given to customers who generate traffic that consumes the existing quota allocation, and your request may be denied if this condition is not met.
Please see screen image 2. link->Azure OpenAI Service: Request for Quota Increase (microsoft.com)
I was able to complete the steps by using the gpt-35-turbo-16k model instead.
Repro steps:
Followed instructions as written.