Open coder-kl opened 3 months ago
@Kerry-LinZhang could you help to triage?
Hi @coder-kl thanks and well received for the feedback, let me track it and keep you updated for the progress.
Under investigation for it
Investigation ongoing
This item has been open without activity for 19 days. Provide a comment on status and remove "update needed" label.
@Kerry-LinZhang - Another behavior I have noticed recently with the latest SDK is that the speech marks do not generate sometimes. Alternatively, sometimes it returns a partial list. However, if we run the same function multiple times, it eventually returns all speech marks correctly. Do you aware of such issue? It is a random behavior, so I cannot pint point exact issue. Just sharing it if anyone has noticed similar issue.
Hi @coder-kl thanks for your feedback, we are under investigation for it. I will continue tracking this feedback.
Thank you @Kerry-LinZhang
I accidentally clicked on the wrong button.
This item has been open without activity for 19 days. Provide a comment on status and remove "update needed" label.
Assign @yanchang-gyc to continue following up on the feedback
I am experiencing an interesting issue.
My text highlighting feature with speech mark created using word boundary method work perfectly for one or more sentences. However, when I add between two sentences to increase pause between two sentences, the same logic work correctly for the first sentence correctly, however it throws off audio and text highlighting.
Has anyone encountered such issue?
This tag generates correct speech marks for text highlighting:
<speak xmlns="http://www.w3.org/2001/10/synthesis" xmlns:mstts="http://www.w3.org/2001/mstts" xmlns:emo="http://www.w3.org/2009/10/emotionml" version="1.0" xml:lang="en-US"><voice name="hi-IN-SwaraNeural"><prosody rate='-30%'>My name is Ramesh. My name is Ramesh.</prosody><mstts:silence type="Tailing" value="0"/></voice></speak>
This tag produce correct speech marks for only the first sentence, and speech marks seems off for the second sentence.
<speak xmlns="http://www.w3.org/2001/10/synthesis" xmlns:mstts="http://www.w3.org/2001/mstts" xmlns:emo="http://www.w3.org/2009/10/emotionml" version="1.0" xml:lang="en-US"><voice name="hi-IN-SwaraNeural"><prosody rate='-30%'>My name is Ramesh.<break time='1s'/>My name is Ramesh.<break time='1s'/></prosody><mstts:silence type="Tailing" value="0"/></voice></speak>
Any feedback would be helpful.