AIGC-Audio / AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
https://huggingface.co/spaces/AIGC-Audio/AudioGPT
Other
9.89k stars 844 forks source link

Bark audio model and talking head additions #24

Open sarutobiumon opened 1 year ago

sarutobiumon commented 1 year ago

This can be done by integrating code from one of the following choices:

Bark oobabooga tts extention: https://github.com/wsippel/bark_tts

yangdongchao commented 1 year ago
  • Would be amazing if you can:
  1. Turn the "talking head" images into animated gifs lip-sync'ed to the wav audio generated by tts using Bark (Bark is currently the best and most realstic/emotion-driven audio model that is free to use, even better than the best commercial closed source model Eleven Labs)
  2. Then generating an mp4 from the combination of animated gif and wav audio on the fly, replacing the starting-point animated gif on the screen.

This can be done by integrating code from one of the following choices:

Bark oobabooga tts extention: https://github.com/wsippel/bark_tts

Hi, Thanks for your suggestions. We will try to add these models into AudioGPT as soon as.