There are MLX models for text generation (llama 3) and for text recognition (whisper) but I think that to have a complete NLP environment it would be necessary to create a text to scpeech MLX. How would it be possible to create, for example, an MLX model of facebook/fastspeech2-en-ljspeech?
There are MLX models for text generation (llama 3) and for text recognition (whisper) but I think that to have a complete NLP environment it would be necessary to create a text to scpeech MLX. How would it be possible to create, for example, an MLX model of facebook/fastspeech2-en-ljspeech?