Closed wilsonchua20 closed 5 years ago
Should i upload acoustic data set for acronyms or pronunciation data set would suffice? thank you
@wilsonchua20 Thanks for the feedback. We are investigating into the issue and will update you shortly.
Hi, may i know why when uploading acoustic data set, the status is failed?
@wilsonchua20 If you are using REST the audio for the first 15 seconds is recognized and the text is returned. I have tested the same using a audio file which was more than 1 minute and it returned only the first 15 seconds text. If you plan to use longer audio files please use any of the SDKs.
You can try mode 'dictation' and add enable custom pronunciation if your use case is of using acronyms.
Could you please elaborate on the error seen while import of acoustic data? Could you please check if the audio file and transcripts are according to the guidelines requested?
Thank you for that. Should i also import for acoustic data set, even if i already imported the words in language data (i just pronounce the words in acoustic data set since they are not all in english)?
For language model, acoustic data set is not required but you need to add language data first.
@wilsonchua20 If you do not have any other queries we will proceed to close this thread. If there are further questions regarding this matter, please tag @RohitMungi-MSFT in your reply. We will gladly continue the discussion and we will reopen the issue.
should i do custom speech for rest api, because when i speak acronyms, the converted text tend to spell it out (e.g. BDO -> be dio) and which recognition mode (interactive, conversation) can audio file be more than 15seconds?
Document Details
⚠ Do not edit this section. It is required for docs.microsoft.com ➟ GitHub issue linking.