-
Thanks for your great work. I'v trained a VITS model, and it can synthesize very fluently, and inference very fast. However is it possible to export trained model into onnx format, so as to inference …
-
I have been using `mfa align` to generate the alignments of audio with input IPA phonemes directly instead of text. This was done by using a handmade dictionary that simply maps IPA phonemes to themse…
-
I just learned that Openface, which this project depends on, cannot be used for commercial purposes without purchasing a license.
Important points from the license:
The non-exclusive commercia…
-
These are the main dev plans for :frog: TTS.
If you want to contribute to :frog: TTS and don't know where to start you can pick one here and start with our [Contribution Guideline](https://github.…
-
Hello,
First, This is an excellent library, very useful. Thanks!!!
This is really not an issue more of a question.
But I don't no how to contact you otherwise.
How do I add/create my own words, or con…
-
I don't know the inner workings of vosk or the language models, but for Japanese I think the logical process would be audio --> kana (phonetic representation) --> kanji character(s).
While kanji c…
-
Hello,
I have observed an issue where digits remain unnormalized in the output text when using the Nemo text normalization library, specifically with European languages such as German (de), Italia…
-
Comment below with questions or thoughts about the reading for this week's workshop.
Please make your comments by Wednesday 11:59 PM, and upvote at least five of your peers' comments on Thursday pr…
-
Hi, I reached out to you via email just now FYI, just introducing myself basically, but wanted to follow up here.
### What is required to support all IPA characteristics across all languages?
I …
-
Have trained `update_v2` branch on :
* Extracted Semantic token from HuBert Large layer 16 with 1024 cluster Kmean. (`50 tok/sec`)
* Extracted Acoustic token from Encodec 24 khz sample rate, 240 ho…