metavoiceio / metavoice-src

Foundational model for human-like, expressive TTS
https://themetavoice.xyz/
Apache License 2.0
3.78k stars 650 forks source link

Does it support Arabic #10

Open Qt4arab opened 8 months ago

Qt4arab commented 8 months ago

I have 50k high quality Arabic dataset,is possible to train the model on Arabic?

sidroopdaska commented 8 months ago

See comment here #6

vatsalaggarwal commented 7 months ago

I've added some initial pointers to this here: https://github.com/metavoiceio/metavoice-src/issues/70#issuecomment-1957337895

lucapericlp commented 6 months ago

Hey @Qt4arab , we've just published an initial approach for finetuning the last N transformer blocks of the first stage LLM. Best to play around with the hyperparams in finetune_params.py as we didn't determine the optimal set. Let us know if you have any issues or if you're up for contributing any improvements (via param sweep or otherwise!)

Next step to improve finetuning effectiveness is to have LoRA adapters for the first stage LLM which is being worked on here.