Open GrenardAntoine opened 2 months ago
This item has been open without activity for 19 days. Provide a comment on status and remove "update needed" label.
Since the offset is getting more and more wrong over time. I decided to restart startContinuousRecognitionAsync()
every minute. This clearly doesn't fix the problem but mitigates it.
We are also seeing that the word offset resets after every ~10 min - the SDK creates a new connection and resets the word offsets.
@Azure Speech team - how are we supposed to keep track of word and phrase time stamps for real time speech to text which extends longer than 10 min??
@CDSFounder are you looking at the JSON or the .offset property on the result?
The .offset property should be fixed up to produce an increasing offset, the JSON was only being partially corrected and is something we'll look at for a future release.
Hello,
I use microsoft-cognitiveservices-speech-sdk (1.38.0) in order to do real time speech to text. It seems like the offset is right when I send a full audio but it is wrong when I send it cut in a lot of audio chunks.
The more there is audio chunks the more inaccurate the offset is :
To reproduce here is some piece of code :
To create the audio chunks :
Here is the audio link : https://drive.google.com/file/d/1H_RJuqMiBaVkpo9XHrgp1bpuFdgQl64O/view?usp=sharing
Thanks for your help