Closed ninackjeong closed 11 months ago
Can you try switching the order of the corpus directory and model? So something like this should work:
mfa g2p /Volumes/ssd/dissertation/scripts/tono-init-spon/sample-test/data/sound/SDRW2100000003_pcm/ /Volumes/ssd/dissertation/scripts/tono-init-spon/sample-test/data/sound/SDRW2100000003_pcm/korean.zip /Volumes/ssd/dissertation/scripts/tono-init-spon/sample-test/data/sound/SDRW2100000003_pcm/korean.txt
The order in MFA is [input_files] [models] [output_files], see mfa g2p [OPTIONS] INPUT_PATH G2P_MODEL_PATH OUTPUT_PATH
from https://montreal-forced-aligner.readthedocs.io/en/latest/user_guide/workflows/dictionary_generating.html
Ah, my bad! It worked!
Debugging checklist
[ O] Have you updated to latest MFA version? [ X ] Have you tried rerunning the command with the
--clean
flag?Describe the issue
When running, "mfa g2p ['zip' file created by 'mfa train_g2p'] [my dataset] [output directory]." I got the following error:
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xc9 in position 10: invalid continuation byte
For Reproducing your issue Please fill out the following:
Log file Please attach the log file for the run that encountered an error (by default these will be stored in
~/Documents/MFA
).Desktop (please complete the following information):
Additional context Add any other context about the problem here. Q1. (This may not be relevant to this issue) Do I need to transform PCM data to wav format?