MontrealCorpusTools / mfa-models

Collection of pretrained models for the Montreal Forced Aligner
Creative Commons Attribution 4.0 International
103 stars 19 forks source link

new acoustic models with IPA dictionnaries #1

Open noetits opened 2 years ago

noetits commented 2 years ago

Hello,

I saw that new IPA dictionaries for many languages were added, which is great.

I was wondering if there were plans to train acoustic models in IPA as well. Maybe a multilingual model, but with not too many languages at first, that share characteristics? (e.g. english, french, spanish)

In fact, I just stumbled upon your blog post that is talking about that. Very interesting by the way ! https://mmcauliffe.medium.com/creating-english-ipa-dictionary-using-montreal-forced-aligner-2-0-242415dfee32

mmcauliffe commented 2 years ago

I've finished my pass of updating all the acoustic models for the new IPA-based phone set, so those are available here: https://mfa-models.readthedocs.io/en/latest/acoustic/index.html. I have a couple more languages to train with data that I have currently (Japanese, Tamil, Arabic, maybe Wu), and then I'd like to try generating a multilingual one and see how well it does compared to the language-specific models.