MontrealCorpusTools / Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi
https://montrealcorpustools.github.io/Montreal-Forced-Aligner/
MIT License
1.35k stars 249 forks source link

[BUG] logs apparently not being created #834

Open jeffmielke opened 2 months ago

jeffmielke commented 2 months ago

Debugging checklist

[x] Have you read the troubleshooting page (https://montreal-forced-aligner.readthedocs.io/en/latest/user_guide/troubleshooting.html) and searched the documentation to ensure that your issue is not addressed there? [x] Have you updated to latest MFA version (check https://montreal-forced-aligner.readthedocs.io/en/latest/changelog/changelog_3.0.html)? What is the output of mfa version? [x] Have you tried rerunning the command with the --clean flag?

Describe the issue I am not finding some of the expected log files after running mfa align. The output indicates that there are utterances missing features and out of vocabulary items, but in the ~/Documents/MFA/ subfolder I find no missing_features.csv or any oovs files.

For Reproducing your issue Please fill out the following:

  1. Corpus structure
    • What language is the corpus in? English
    • How many files/speakers? 248 files, 300+ speakers
    • Are you using lab files or TextGrid files for input? TextGrid
  2. Dictionary
    • Are you using a dictionary from MFA? If so, which one? no
    • If it's a custom dictionary, what is the phoneset? arpabet
  3. Acoustic model
    • If you're using an acoustic model, is it one download through MFA? If so, which one? english_us_arpa
    • If it's a model you've trained, what data was it trained on? n/a

Log file Please attach the log file for the run that encountered an error (by default these will be stored in ~/Documents/MFA).

There is no basic log file. The subfoder of ~/Documents/MFA contains three subfolders and two files: alignment dictionary raleigh_2024_input2 raleigh_2024_input2.db raleigh_2024_input2.yaml

Desktop (please complete the following information):

Additional context I'm realigning the Raleigh corpus to track down a few previously unaligned utterances. The alignment goes fine, as it has with previous MFA versions, just no logs to direct me to the problems I'm looking for.