Azure OpenAI 2023-07-01-preview streaming = true token count

FrancescoMasaia commented 1 year ago

I was wondering when will become available an official token count for the streaming version of the chat completion API.

navba-MSFT commented 1 year ago

@FrancescoMasaia Thanks for reaching out to us and reporting this issue. Could you please share your requirement and your use case ? This will help us to provide you the concrete answer or share the possible alternatives. Awaiting your reply.

FrancescoMasaia commented 1 year ago

Sure, right now with openai chat completion api is present at the end of the response https://github.com/Azure/azure-rest-api-specs/blob/main/specification/cognitiveservices/OpenAI.Inference/examples/2023-07-01-preview/chat_completions.json

"usage": {
          "completion_tokens": 557,
          "prompt_tokens": 33,
          "total_tokens": 590
        }

but this seems to work only for non streamed response, and with stream=true, this usage structure is not populated. It could be very useful to have it working correctly. right now I've implemented an approximation of this count on my side but I don't know how accurate is it working.

navba-MSFT commented 1 year ago

@FrancescoMasaia Thanks for getting back. I will check this and get back to you.

navba-MSFT commented 1 year ago

@FrancescoMasaia This is not a supported feature at this time. stream does NOT support usage tokens. We are discussing if this is something we will create in the future. No ETA as of now. So we will update this thread once there is any update about this.

BlackGad commented 11 months ago

I am currently in need of the functionality for precise token calculation in the streaming version of the Azure OpenAI API. This feature is critical for accurately calculating user tokens to ensure proper billing. Additionally, it would enable me to implement appropriate throttling mechanisms on my side. At the moment, I'm using a custom token calculator based on tiktoken, but managing a separate token calculation process for billing purposes is complex and prone to inaccuracies. The availability of an official token count for streamed responses would greatly streamline this process and enhance the reliability of billing and service management.

gsuiteautomations commented 7 months ago

Hi there ! I think this is really important Claude from Anthropic provides the tokens when stream is set to true so that would be something expected from Open AI , a leader on the field

vvidovic commented 5 months ago

Still no update about this?

janaka commented 4 months ago

I just updated my feat req issue about this...

This options seems to be switched on by default in the new OpenAI client (pulled in by Azure OpenAI 2.0.*). However, doesn't seem to work.

issue on OpenAI repo: https://github.com/openai/openai-dotnet/issues/103

laygir commented 4 months ago

I need this on the API, we are unnecessarily counting tokens on a stream response for analytics purposes. Any chance soon?

BlackGad commented 4 months ago

Try to use version 2.0-beta Azute.AI.OpenAI package. They dropped this one

laygir commented 4 months ago

Try to use version 2.0-beta Azute.AI.OpenAI package. They dropped this one

Actually I am a REST API consumer, not using any packages and seems like the responsible property is not supported on direct api call.

janaka commented 4 months ago

@BlackGad are you saying you have usage working with the Azure 2.0 package? I cannot get it to work. And OpenAI package lot saying it's not supported yet (comment on here https://github.com/openai/openai-dotnet/issues/103)