RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
MIT License
33.45k stars 3.84k forks source link

Add LLM translated CMUDICT-KATAKANA dictionary #1660

Closed Akito-UzukiP closed 3 days ago

Akito-UzukiP commented 3 days ago

This pull request introduces a CMUDICT-to-Katakana dictionary, translated using ChatGPT and DeepSeek. This conversion enables the Japanese G2P mode to directly pronounce most common English words in Katakana, improving the compatibility of English terms in Japanese tts applications.

Since the translation was performed by an LLM the accuracy might not be perfect and could benefit from further refinement. However, initial testing indicates that it is currently functional and applicable in most cases.

Additionally, this update restores the "UNK" token's phoneme. During training, this token was learned as a pause, which has shown to be helpful in the Japanese G2P process.