Closed suryakurapati closed 10 months ago
Utilization stats are only available for managed-provisioned deployment endpoints. If you do not see the header then chances are your endpoint isn't managed-provisioned.
Closing this as Azure support issue. Please reopen if otherwise.
Whenever the code is being run with below set off arguments, all the output fields are being extracted except util_avg and util_95th.
Run Commad: python -m benchmark.bench load --api-version --api-key-env --clients 10 --shape-profile custom --context-tokens 4000 --max-tokens 5500 --deployment gpt-4-8k-0613 --duration 60 --requests 60 --output-format human
I see the logic for util_avg and util_95th in the code as below:
step1: Get the response of the input request
step2: Transformation and logic - part 1
step3: Transformation and logic - part 2
Could you take a look and confirm if any update to the code is required in order to extract util_avg and util_95th parameters.