why results of "mfa align" is texts not phonemes？

Xwmiss commented 8 months ago

Thanks for your work first！

When I am using mfa to get results from japanese, I find that results in TextGrid is texts, like below:

While I need is the align results is for the phonemes in the dictionary, like below:

Could you tell me how to fix this?

Thank you again.

Xwmiss commented 8 months ago

and in the part2 is only the "spn"

Daisyqk commented 8 months ago

Have you solved it? I come across the same problem.

Xwmiss commented 8 months ago

Have you solved it? I come across the same problem.

I fix it by run g2p first to get the new dictionary, and then use this new dictionary to align all datas.

mmcauliffe commented 7 months ago

If you update to MFA 3.0 and download the latest Japanese models, they use third party tokenizers automatically.

MontrealCorpusTools / Montreal-Forced-Aligner