AIGC-Audio / AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
https://huggingface.co/spaces/AIGC-Audio/AudioGPT
Other
10.04k stars 863 forks source link

Add WIP items for the project #3

Closed ftshijt closed 1 year ago

ftshijt commented 1 year ago

One question: how do you plan about the coverage of languages in different systems (these might be critical for tts, asr, st)?

Rongjiehuang commented 1 year ago

It's a good question, and we have a mono-lingual model for each task at this time. For multi-lingual support, we may set a language argument with a default value (English) in each class (i.e., the tool for langchain), and specify the language we want to synthesize/transcribe/translate in chatting.