MontrealCorpusTools / Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi
https://montrealcorpustools.github.io/Montreal-Forced-Aligner/
MIT License
1.26k stars 242 forks source link

[BUG] Kaldi Processing Error #677

Open al3ka opened 10 months ago

al3ka commented 10 months ago

I'm brand new to using MFA so sorry if I've missed anything!

Debugging checklist

[x] Have you updated to latest MFA version? [x] Have you tried rerunning the command with the --clean flag?

Describe the issue A clear and concise description of what the bug is.

validation and alignment won't run, cites multi processing error or kaldi processing error

For Reproducing your issue Please fill out the following:

  1. Corpus structure
    • What language is the corpus in? English
    • How many files/speakers? 12 files, 2 speakers in each
    • Are you using lab files or TextGrid files for input? .txt
  2. Dictionary
    • Are you using a dictionary from MFA? If so, which one? english_mfa
    • If it's a custom dictionary, what is the phoneset? n/a
  3. Acoustic model
    • If you're using an acoustic model, is it one download through MFA? If so, which one? yes, english_mfa
    • If it's a model you've trained, what data was it trained on? n/a

Log file Please attach the log file for the run that encountered an error (by default these will be stored in ~/Documents/MFA).

/Users/alicepaver/miniconda3/envs/aligner/bin/gmm-boost-silence --boost=1.25 1 /Users/alicepaver/Documents/MFA/mfa/monophone/1.mdl - LOG (gmm-boost-silence[5.5.1068]:main():gmmbin/gmm-boost-silence.cc:93) Boosted weights for 5 pdfs, by factor of 1.25 /Users/alicepaver/miniconda3/envs/aligner/bin/gmm-align-compiled --transition-scale=1.0 --acoustic-scale=0.1 --self-loop-scale=0.1 --beam=6 --retry-beam=40 --careful=false --write-per-frame-acoustic-loglikes=ark:/Users/alicepaver/Documents/MFA/mfa/monophone/like.1.1.ark - ark,s,cs:/Users/alicepaver/Documents/MFA/mfa/monophone/fsts.1.1.ark 'ark,s,cs:add-deltas scp,s,cs:"/Users/alicepaver/Documents/MFA/mfa/mfa/split3/feats.1.1.scp" ark:- |' ark:/Users/alicepaver/Documents/MFA/mfa/monophone/ali.1.1.ark ark,t:- LOG (gmm-boost-silence[5.5.1068]:main():gmmbin/gmm-boost-silence.cc:103) Wrote model to - add-deltas scp,s,cs:/Users/alicepaver/Documents/MFA/mfa/mfa/split3/feats.1.1.scp ark:- LOG (gmm-align-compiled[5.5.1068]:main():gmmbin/gmm-align-compiled.cc:127) 1-1 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:617) Retrying utterance 1-1 with beam 40 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:626) Did not successfully decode file 1-1, len = 26759 LOG (gmm-align-compiled[5.5.1068]:main():gmmbin/gmm-align-compiled.cc:127) 1-10 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:617) Retrying utterance 1-10 with beam 40 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:626) Did not successfully decode file 1-10, len = 51995 LOG (gmm-align-compiled[5.5.1068]:main():gmmbin/gmm-align-compiled.cc:127) 1-11 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:617) Retrying utterance 1-11 with beam 40 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:626) Did not successfully decode file 1-11, len = 12993 LOG (gmm-align-compiled[5.5.1068]:main():gmmbin/gmm-align-compiled.cc:127) 1-12 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:617) Retrying utterance 1-12 with beam 40 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:626) Did not successfully decode file 1-12, len = 10749 LOG (gmm-align-compiled[5.5.1068]:main():gmmbin/gmm-align-compiled.cc:127) 1-2 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:617) Retrying utterance 1-2 with beam 40 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:626) Did not successfully decode file 1-2, len = 4400 LOG (gmm-align-compiled[5.5.1068]:main():gmmbin/gmm-align-compiled.cc:127) 1-3 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:617) Retrying utterance 1-3 with beam 40 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:626) Did not successfully decode file 1-3, len = 13653 LOG (gmm-align-compiled[5.5.1068]:main():gmmbin/gmm-align-compiled.cc:127) 1-4 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:617) Retrying utterance 1-4 with beam 40 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:626) Did not successfully decode file 1-4, len = 71065 LOG (gmm-align-compiled[5.5.1068]:main():gmmbin/gmm-align-compiled.cc:127) 1-5 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:617) Retrying utterance 1-5 with beam 40 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:626) Did not successfully decode file 1-5, len = 70607 LOG (gmm-align-compiled[5.5.1068]:main():gmmbin/gmm-align-compiled.cc:127) 1-6 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:617) Retrying utterance 1-6 with beam 40 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:626) Did not successfully decode file 1-6, len = 12328 LOG (gmm-align-compiled[5.5.1068]:main():gmmbin/gmm-align-compiled.cc:127) 1-7 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:617) Retrying utterance 1-7 with beam 40 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:626) Did not successfully decode file 1-7, len = 55925 LOG (gmm-align-compiled[5.5.1068]:main():gmmbin/gmm-align-compiled.cc:127) 1-8 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:617) Retrying utterance 1-8 with beam 40 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:626) Did not successfully decode file 1-8, len = 67029 LOG (gmm-align-compiled[5.5.1068]:main():gmmbin/gmm-align-compiled.cc:127) 1-9 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:617) Retrying utterance 1-9 with beam 40 WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:626) Did not successfully decode file 1-9, len = 32727 LOG (gmm-align-compiled[5.5.1068]:main():gmmbin/gmm-align-compiled.cc:135) Overall log-likelihood per frame is nan over 0 frames. LOG (gmm-align-compiled[5.5.1068]:main():gmmbin/gmm-align-compiled.cc:137) Retried 12 out of 12 utterances. LOG (gmm-align-compiled[5.5.1068]:main():gmmbin/gmm-align-compiled.cc:139) Done 0, errors on 12

Desktop (please complete the following information):

Additional context Add any other context about the problem here. files range in length, between 7 and 30 mins long but the majority about 7-10 mins. have run alignment with --beam 100 as I saw suggested on some other posts on here, but no luck. any help welcome thank you!

al3ka commented 10 months ago

bump

Nghiauet commented 5 months ago

I have same issuse with process in vietnamese WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:617) Retrying utterance 1-71 with beam 40
WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:626) Did not successfully decode file 1-71, len = 501
LOG (gmm-align-compiled[5.5.1068]:main():gmmbin/gmm-align-compiled.cc:127) 1-72
WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:617) Retrying utterance 1-72 with beam 40
WARNING (gmm-align-compiled[5.5.1068]:AlignUtteranceWrapper():decoder/decoder-wrappers.cc:626) Did not successfully decode file 1-72, len = 540