modelscope / FunCodec

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
https://funcodec.github.io/
MIT License
360 stars 30 forks source link

Required features in Jan. 2024 #10

Closed ZhihaoDU closed 6 months ago

ZhihaoDU commented 9 months ago

Hi, all. I'm collecting the required features which will be considered implementing in Jan. 2024. Please let me know your concern and feel free to comment below. Thanks. To make FunCodec better!

duj12 commented 9 months ago

Hi, all. I'm collecting the required features which will be considered implementing in Jan. 2024. Please let me know your concern and feel free to comment below. Thanks. To make FunCodec better!

Support Chinese. Maybe use your 60k hour Mandarin speech data to train a LauraTTS model, and this will be a Paraformer-like TTS model in Chinese Community.

ZhihaoDU commented 9 months ago

Hi, all. I'm collecting the required features which will be considered implementing in Jan. 2024. Please let me know your concern and feel free to comment below. Thanks. To make FunCodec better!

Support Chinese. Maybe use your 60k hour Mandarin speech data to train a LauraTTS model, and this will be a Paraformer-like TTS model in Chinese Community.

Thanks for your suggestion. We are considering to train a data-scale-up model.

didadida-r commented 8 months ago

Support Chinese.