what work would be needed to use a wav2vec2 acoustic model?

MontrealCorpusTools / Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

https://montrealcorpustools.github.io/Montreal-Forced-Aligner/

MIT License

1.35k stars 248 forks source link

what work would be needed to use a wav2vec2 acoustic model? #313

Open tensorfoo opened 3 years ago

tensorfoo commented 3 years ago

Wav2vec2 makes it possible for low resource languages to build high quality acoustic models using only unlabelled audio. Finetuning this with a couple of hours of labelled data gives you a pretty good start to ASR.

What sort of interface would be required to leverage a wav2vec model (finetuned or not) with MFA instead of Kaldi?

mdatsev commented 1 year ago

I am too interested in this