MontrealCorpusTools / Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi
https://montrealcorpustools.github.io/Montreal-Forced-Aligner/
MIT License
1.3k stars 243 forks source link

[BUG] No output produce, no error, MFCC generation failed?? #503

Open awindsor opened 2 years ago

awindsor commented 2 years ago

Debugging checklist

[x ] Have you updated to latest MFA version?

mfa version 2.0.6

[x] Have you tried rerunning the command with the --clean flag?

mfa align Corpus english_us_arpa english_us_arpa TextGrids --clean --beam=1000

Describe the issue Run finishes and there are no errors. Log indicates that everything is aligned but no TextGrid output is generated.

It seems that possibly mfcc generation fails since

INFO - Generating MFCCs... 0%| | 0/1 [00:02<?, ?it/s]

during the run but everything proceeds.

I have tried with and without an existing output directory. If there is no existing output directory it creates one but the directory is empty.

For Reproducing your issue Test corpus consists of one mono .wav file and one .lab text transcript. The .wav file in 1 minute in duration.

The dictionary and the acoustic files are MFA downloaded english_us_arpa.

I have tried with a beam=1000.

This is on a Mac running macOS 12.5

Model Name: MacBook Pro Model Identifier: MacBookPro18,2 Chip: Apple M1 Max Total Number of Cores: 10 (8 performance and 2 efficiency) Memory: 64 GB

pretrained_aligner.log

lszhuhaichao commented 2 years ago

I also have the same problem on my macOS 12.5

Model Name: MacBook Pro 2019 Chip: 2.6 GHz 6-Core Intel Core i7 Memory: 16 GB

awindsor commented 2 years ago

I will get my developers to check whether this is an os x issue and report back.

awindsor commented 1 year ago

This is not an os x issue and was resolved by editing the transcript to exclude a short extraneous passage and the beginning. There was a long section afterward, which seems to break MFA though Gentle handled it.