Open nortekax opened 7 months ago
Can't really agree with the 'high quality'... all my tests on that hugging space ended up in the model repeating itself over and over again, the voices do not sound good either.
@Alumniminium , I did some more testing and also noticed some problems with SeamlessM4T. I was trying to have better Text-to-Speech locally, and I found this very good solution, please try it to see what you think:
@Alumniminium and with v2?
It seems official ggml implementation. https://github.com/facebookresearch/seamless_communication/tree/main/ggml
The following model is a great high quality model supporting:
It also allows multilingual translation in all these modes. Would it be possible to make a gguf model and add support to whisper.cpp for it? You can try it here: https://huggingface.co/spaces/facebook/seamless_m4t