Open bpasero opened 6 months ago
This is generally the same with all SDK file access at the moment, including log files.
This item has been open without activity for 19 days. Provide a comment on status and remove "update needed" label.
This seems to be an acknowledged bug!
@bpasero File paths with non-ASCII characters are not supported in the current SDK implementation, whether it's about embedded speech models, KWS models or log files. This is for future enchancement, no ETA yet.
@pankopon understood. note that VS Code users can be impacted as soon as their usernames in the OS contains non-ASCII characters because we store VS Code extensions in the user-home directory, which typically includes the username verbatim. The Azure Speech model will be loaded directly from said extensions folder.
Would be great to see a fix for this eventually as there is no real workaround for people impacted 👍
Hm, I wonder if a workaround could be to load the model with a relative file path? I have not tried that so far, but that could certainly help address this issue for our users.
@bpasero It depends on what the working directory of the process that uses the Speech SDK is. The model path would need to be relative to this. If both the process working directory and the model location are under the user's home directory then a relative path could work.
This item has been open without activity for 19 days. Provide a comment on status and remove "update needed" label.
Please fix it :-)
This item has been open without activity for 19 days. Provide a comment on status and remove "update needed" label.
Still an issue.
This item has been open without activity for 19 days. Provide a comment on status and remove "update needed" label.
Still an issue.
IN ORDER TO ASSIST YOU, PLEASE PROVIDE THE FOLLOWING:
log.txt
Describe the bug
The embedded speech model fails to load STT models when the path contains non-ASCII characters and spaces.
To Reproduce
Steps to reproduce the behavior:
Expected behavior
Models can load from paths with spaces and non-ASCII characters.
Version of the Cognitive Services Speech SDK
1.35.0
Platform, Operating System, and Programming Language
Additional context
Reported in https://github.com/microsoft/vscode/issues/206512