MontrealCorpusTools / Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi
https://montrealcorpustools.github.io/Montreal-Forced-Aligner/
MIT License
1.26k stars 242 forks source link

Getting phones out of diarized TextGrid input files #734

Open stcoats opened 5 months ago

stcoats commented 5 months ago

I am trying to align files that have two speakers to get phones as segments. If I have the audio file and a non-diarized transcript, as a .txt file in the corpus folder, the output TextGrid contains all the words in the .txt file and the corresponding phones.

If, however, I use the same audio file and a .TextGrid file with two tiers, one for each speaker, the output is a .TextGrid that is missing a lot of words. During alignment, the message WARNING There were 24 utterances ignored due to an issue in feature generation, see the log file for full details or run mfa validate on the corpus. is generated.

I have tried using --beam 400 --retry_beam 1000, to no avail. Are there better ways of making the aligner align all the words in the input file?

mmcauliffe commented 4 months ago

I would double check that your tiers actually have text in them corresponding to the transcript? The log file should list out all utterances that were ignored, but it's either due to duration being very short or no text.