Camb-ai / MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI
https://www.camb.ai
GNU Affero General Public License v3.0
2.53k stars 206 forks source link

Takes too long to process. Optimizations needed. #80

Closed bigbrotherr closed 3 months ago

bigbrotherr commented 3 months ago

Takes too long for deep clone to finish processing even for a simple text (i9+ 32gb, RTX 3080 12gb) and the result is also not that good. We need optimizations. Shallow clone does not work. The output is something else other than the text provided. output_audio.zip

NourMerey commented 3 months ago

MARS5 is not optimized for speed. Try running our Colab notebook, that might be faster. We're also releasing MARS6 soon, an optimized model in terms of speed and output quality; so keep an eye on it!