Closed Xwmiss closed 7 months ago
and in the part2 is only the "spn"
Have you solved it? I come across the same problem.
Have you solved it? I come across the same problem.
I fix it by run g2p first to get the new dictionary, and then use this new dictionary to align all datas.
If you're using a model before 3.0, you'll have to run tokenization on the corpus first, since Japanese text does not contain spaces typically: https://montreal-forced-aligner.readthedocs.io/en/latest/user_guide/corpus_creation/tokenize.html with https://mfa-models.readthedocs.io/en/latest/tokenizer/Japanese/Japanese%20tokenizer%20v2_2_1.html#Japanese%20tokenizer%20v2_2_1.
If you update to MFA 3.0 and download the latest Japanese models, they use third party tokenizers automatically.
mfa model download acoustic japanese_mfa --ignore_cache
mfa model download dictionary japanese_mfa --ignore_cache
conda install -c conda-forge spacy sudachipy sudachidict-core
Thanks for your work first!
When I am using mfa to get results from japanese, I find that results in TextGrid is texts, like below:
While I need is the align results is for the phonemes in the dictionary, like below:
Could you tell me how to fix this?
Thank you again.