MontrealCorpusTools / Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi
https://montrealcorpustools.github.io/Montreal-Forced-Aligner/
MIT License
1.29k stars 242 forks source link

Unicode decode error. #662

Open nonusx opened 1 year ago

nonusx commented 1 year ago

Debugging checklist

Describe the issue While I was running "mfa validate", I've encountered the error: UnicodeDecodeError: 'utf-8' codec can't decode byte 0xd1 in position 0: unexpected end of data.

For Reproducing your issue

  1. Corpus structure
  2. Dictionary
    • Are you using a dictionary from MFA? If so, which one?
    • Yes.
  3. Acoustic model
    • If you're using an acoustic model, is it one download through MFA? If so, which one? Yes.

Desktop (please complete the following information):

mmcauliffe commented 1 year ago

Can you try calling:

set PYTHONUTF8=1

in the command line before running MFA and see if that resolves it? This is likely an issue with how the the windows command and python interface having issues with paths with unicode symbols in them. If that still doesn't fix it, can you run the command with --verbose and paste the full trace to see where the error is being thrown?