-
**Submitting author:** @fxjung (Felix Jung)
**Repository:** https://github.com/PhysicsOfMobility/ridepy
**Branch with paper.md** (empty if default branch): joss
**Version:** v2.7
**Editor:** @diehlpk
…
-
The comparison of Grad-TTS in the paper is unfair. The authors of Grad-TTS have published a follow-up paper which includes a maximum likelihood based SDE Solver (from Diffusion-Based Voice Conversion …
-
### Description
_No response_
### Additional Information
_No response_
-
Is we really need to tokenizer before feeding into your library ? Because as I can see, every 2, 3 ,... syllables word in vietnamese have phoneme is the combine of all 1 syllables word
Example:
cái…
-
Greetings!
If I remove all the parts about style in the entire pipeline ( everything about phase and style encoder ), will the quality of generated motions degrade?
-
### What is the issue?
Hey amazing team! I’m experiencing an issue with the context window size when using the new Mistral Nemo model on Ollama version 0.2.8-rc2 on my Apple Mac Silicon M2 Pro. Accor…
-
Output from TransformerTTS (Fastpitch/fastspeech2 based) :
text: "नमस्ते, मैं बजाज आलियांज़ जनरल इंश्योरेंस की ओर से स्वाति बोल रही हूँ, क्या आप से बात करने के लिए ये समय सही है?"
https://gith…
-
Hi,
I have been playing around with Toucan TTS for some times and it is really easy to use and training is fast. I finetuned the provided Meta pretrained model with a 8 hour dataset and the result…
-
Bonjour à tous,
Je suis en train de tester la mise à jour vers de la version 2.11 vers la 2.13.
Tout se passe correctement jusqu'à la mise à jour de BD ou je rencontre une erreur avec une violatio…
-
good work.
I see the following upsampling f0 operation in [dataset.py](https://github.com/zhangyongmao/VISinger2/blob/54a00d6d7da4de4f4037ab182e92b3b9102c2f11/egs/visinger2/dataset.py#L169):
```…