Word Alignment Between ASR Output and Expected Text with Support for Discrepancies in Matching

MahmoudAshraf97 / ctc-forced-aligner

Text to speech alignment using CTC forced alignment

146 stars 30 forks source link

I liked the forced alignment; I was having an issue and would like to know if it's possible to use your code to help me. I have an output from an ASR model and a text that I expected, but in most cases, the ASR output doesn't cover even half of the expected text, and sometimes it's quite distant from it. I would like to try to align as much of the ASR words as possible based on the reference, and for those reference words that aren't in the ASR, to assign them a very low score. From the initial tests I've done, it seems like it's trying to align the entire text within the audio

MahmoudAshraf97 / ctc-forced-aligner

Word Alignment Between ASR Output and Expected Text with Support for Discrepancies in Matching #12