MontrealCorpusTools / Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi
https://montrealcorpustools.github.io/Montreal-Forced-Aligner/
MIT License
1.29k stars 242 forks source link

[BUG] #667

Closed AndHofma closed 1 year ago

AndHofma commented 1 year ago

Debugging checklist

[x] updated to latest MFA version [x] running the command with the --clean flag

Describe the issue I installed the newest version of the MFA with conda and also the newest version of SQlite (3.42.0). I have downloaded the german_mfa dictionary and acoustic model in the corresponding folders under MFA. I activated the aligner environment, set up the paths to my input and output and used the correct paths to all relevant parameters. In the beginning it seems everything is working fine but the process always stopps with this error message: subprocess.CalledProcessError: Command '['sqlite3', 'C:/Users/Andrea Hofmann/Documents/MFA/input/input.db', '--cmd', '.mode csv', '.import C:/Users/Andrea Hofmann/Documents/MFA/input/alignment/word_intervals.csv word_interval_temp']' returned non-zero exit status 1. Somehow subprocess call to SQLite is not completing successfully, and it seems to be due to an extra argument "word_interval_temp" which is not recognized by SQLite. I have no idea what to do.

For Reproducing your issue Please fill out the following:

  1. Corpus structure
    • What language is the corpus in?
      • German
    • How many files/speakers?
      • 20 files, 1 speaker
    • Are you using lab files or TextGrid files for input?
      • each of the 20 wav files has a TextGrid with the same name
  2. Dictionary
    • Are you using a dictionary from MFA? If so, which one?
      • I downloaded and use: german_mfa
    • If it's a custom dictionary, what is the phoneset?
  3. Acoustic model
    • If you're using an acoustic model, is it one download through MFA? If so, which one?
      • german_mfa as well

Log file Please attach the log file for the run that encountered an error (by default these will be stored in ~/Documents/MFA).

Desktop (please complete the following information):

Additional context I can see that most of the data is there in the input folder but I can't get the process to finish. This is the miniconda console output, if it helps: compile_train_graphs.1.log get_phone_ctm.1.log align.1.fmllr.log align.1.log calc_fmllr.1.log This is the miniconda console output, if it helps: (aligner) C:\Users\Andrea Hofmann>mfa align --clean "C:\Users\Andrea Hofmann\OneDrive\PhD\exp_dualtask\data_pre-processing\MFA\input" "C:\Users\Andrea Hofmann\Documents\MFA\pretrained_models\dictionary\german_mfa.dict" "C:\Users\Andrea Hofmann\Documents\MFA\pretrained_models\acoustic\german_mfa.zip" "C:\Users\Andrea Hofmann\OneDrive\PhD\exp_dualtask\data_pre-processing\MFA\ouput" DEBUG Beginning run for input DEBUG Using "global" profile DEBUG Using multiprocessing with 3 DEBUG Set up logger for MFA version: 2.2.15 DEBUG Cleaned previous run DEBUG There were some differences in the current run compared to the last one. This may cause issues, run with --clean, if you hit an error. DEBUG Using IPA DEBUG Loaded dictionary in 44.509 seconds INFO Setting up corpus information... DEBUG Could not load from temp INFO Loading corpus from source files... 1% ╸━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1/100 [ 0:00:05 < -:--:-- , ? it/s ] DEBUG Processing queue: 0.140625 1% ╸━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1/100 [ 0:00:05 < -:--:-- , ? it/s ] DEBUG Parsed corpus directory with 3 jobs in 0.15625 seconds INFO Found 1 speaker across 20 files, average number of utterances per speaker: 20.0 DEBUG Loaded corpus in 5.839 seconds INFO Initializing multiprocessing jobs... WARNING Number of jobs was specified as 3, but due to only having 1 speakers, MFA will only use 1 jobs. Use the --single_speaker flag if you would like to split utterances across jobs regardless of their speaker. DEBUG Initialized jobs in 0.029 seconds INFO Normalizing text... 5% ━━━╸━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1/20 [ 0:00:11 < -:--:-- , ? it/s ] DEBUG Wrote lexicon information in 0.376 seconds INFO Creating corpus split for feature generation... 2% ━╸━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1/40 [ 0:00:04 < -:--:-- , ? it/s ] DEBUG Created corpus split directory in 4.566 seconds INFO Generating MFCCs... 100% ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 20/20 [ 0:00:05 < 0:00:00 , 61 it/s ] DEBUG Generating MFCCs took 6.781 seconds INFO Calculating CMVN... INFO Generating final features... 5% ━━━╸━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1/20 [ 0:00:05 < -:--:-- , ? it/s ] DEBUG Generating final features took 5.755 seconds INFO Creating corpus split with features... 5% ━━━╸━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1/20 [ 0:00:04 < -:--:-- , ? it/s ] DEBUG Generated features in 17.039 seconds DEBUG Setting up corpus took 84.110 seconds DEBUG DEBUG ====ACOUSTIC MODEL INFO==== DEBUG Acoustic model root directory: C:\Users\Andrea Hofmann\Documents\MFA\extracted_models\acoustic DEBUG Acoustic model dirname: C:\Users\Andrea Hofmann\Documents\MFA\extracted_models\acoustic\german_mfa_acoustic DEBUG Acoustic model meta path: C:\Users\Andrea Hofmann\Documents\MFA\extracted_models\acoustic\german_mfa_acoustic\meta.json DEBUG Acoustic model meta information: DEBUG architecture: gmm-hmm dictionaries: bracketed_word: '[bracketed]' clitic_marker: '''' default: german_mfa laughter_word: '[laughter]' names:

AndHofma commented 1 year ago

input.log generate_final_features.1.log make_mfcc.1.log cmvn.log

AndHofma commented 1 year ago

Okay, I uninstalled everything again and this time I set up 'MFA_ROOT_DIR' with a path that did not include spaces and ran the aligner again and it worked. Uff.