Open massimolipari opened 1 year ago
What were the errors when --single_speaker
and --no_textrgrid_cleanup
were omitted? I would recommend for something like this definitely not specifying --single_speaker
, as it disables speaker adaptation.
i am getting the same, without using --single_speaker and --no_textgrid_cleanup
I'm having the exact same error when trying to validate the data. I used mfa validate --clean my_corpus/ my_dictionary.txt
, got this error and tried with --no_use_mp
and --num_jobs 1
without success.
this works ok with my data/config on 2.0.6
Can you try upgrading to the latest version (2.2.13 or 2.2.14) and trying the command with --clean
and see if that completes? I'm having trouble reproducing the error locally. Also try running with --no_use_postgres
and see if that makes a difference?
I use the version of 2.2.14, and with --clean --no_use_postgres options, get the same error
I use the version of 2.2.14, and with --clean --no_use_postgres options, get the same error
My issue is due to missing the .lab file.
I'm having this issue too. I've tried latest stable (2.2.15 & 2.2.17) and latest v3 (3.0.0a8).
I believe the issue is that in the annotations (textgrid files) there are annotated phones that are silent and/or very short. No features can be extracted from them. The error seems to stem from the "get_feat_dim" call and results in an IndexError: list index out of range.
I'm attaching the console output (no log file gets written)
mfa-index-error.txt
UPDATE: I've attempted running adapt on my corpus locally using multiple different conda installs and they all break. Attempting exactly the same process in a google notebook runs successfully (same commands, setup and corpus). Points to either an issue with some files locally, or some system library getting in the way. I have cleared out the MFA folder in the home directory and retried - errors still persist.
Are there other directories / files that need to be removed to create a fresh clean environment for MFA? Any other throughts on possible libraries I should check (each install is in conda so I was hoping that would prevent this type of issue)?
Debugging checklist
[X] Have you updated to latest MFA version? [X] Have you tried rerunning the command with the
--clean
flag?Describe the issue When training an acoustic model with the command
mfa train --clean --no_textgrid_cleanup --single_speaker --no_debug --no_verbose --no_use_mp <dict_path> <output_path>
, an IndexError is thrown at the very beginning of theInitializing training for monophone...
step.For Reproducing your issue Please fill out the following:
Log file Please attach the log file for the run that encountered an error (by default these will be stored in
~/Documents/MFA
). pg_log_global.txtDesktop (please complete the following information):
Additional context Here is the error message:
I'm attempting to train a new acoustic model with the full phoneset I need by using an existing (pre-aligned) corpus in order to align additional data of the same type: every utterance therefore contains a single word. This error also occurs when multiprocessing is used (which is what I originally trued), regardless of the number of jobs specified. (The
--single_speaker
and--no_textgrid_cleanup
flags were used since different errors resulted on other runs which did not include them.)