yl4579 / AuxiliaryASR

Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)
MIT License
108 stars 30 forks source link

How to train ZH-EN duo language aligner? #12

Open Stardust-minus opened 4 months ago

Stardust-minus commented 4 months ago

Hi there. I saw that the repo's code only support Engilish aligner training

WoBuChiTang commented 2 months ago

Hi there. I saw that the repo's code only support Engilish aligner training

expand vocab size that including all of chinese and english phonemes, prepare dataset as train.txt