MontrealCorpusTools / Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi
https://montrealcorpustools.github.io/Montreal-Forced-Aligner/
MIT License
1.31k stars 243 forks source link

Feature Generation Error:There were 64106 utterances missing features. #820

Open aaditi0722 opened 3 months ago

aaditi0722 commented 3 months ago

Debugging checklist

[ ] Have you read the troubleshooting page (https://montreal-forced-aligner.readthedocs.io/en/latest/user_guide/troubleshooting.html) and searched the documentation to ensure that your issue is not addressed there? Yes [ ] Have you updated to latest MFA version (check https://montreal-forced-aligner.readthedocs.io/en/latest/changelog/changelog_3.0.html)? What is the output of mfa version? Version 2.2.17 [ ] Have you tried rerunning the command with the --clean flag?Yes

Describe the issue A clear and concise description of what the bug is. I used the mfa validate command and there were errors in feature generation-(missing features) and I had received warnings for -"Out of vocabulary words" and there was error in the run of " atexit._run_exitfuncs"

For Reproducing your issue Please fill out the following:

  1. Corpus structure
    • What language is the corpus in?Hindi
    • How many files/speakers?3238/2
    • Are you using lab files or TextGrid files for input?TextGrid
  2. Dictionary
    • Are you using a dictionary from MFA? If so, which one?No
    • If it's a custom dictionary, what is the phoneset? What is a phonest?
  3. Acoustic model
    • If you're using an acoustic model, is it one download through MFA? If so, which one?No
    • If it's a model you've trained, what data was it trained on?I am training a acoustic model on MFA

Log file Please attach the log file for the run that encountered an error (by default these will be stored in ~/Documents/MFA). log.zip

Desktop (please complete the following information):

Additional context Add any other context about the problem here.