-
Thanks for the codebase. Good work!
In the paper, Speech is split into -- timbre (using speaker embedding), pitch, rhythm, content. If I am not wrong, the accent information of the speaker is not c…
-
Hi. I tested the model with the inference jupyter file your provided. It's amazing that the model can still generate good voice even if a Mandarin source file is fed as input.
However, I notice that…
-
This issue summarizes our wishlist of music descriptors and algorithms to be added in Essenita. You can post your suggestions in comments.
Essentia 2.1
- [x] Implement standard mode and finish te…
-
# Trending repositories for C#
1. [**BeyondDimension / SteamTools**](https://github.com/BeyondDimension/SteamTools)
__🛠「Watt Toolkit」是一个开源跨平台的多功能 Steam 工具箱。__
19 stars to…
-
Hi! Could you please give some clarifications on usage of raw/diagonal/forward attentions during inference?
-
Hello,
First, I apologize if this is not a proper channel to ask about your paper "MELLOTRON: MULTISPEAKER EXPRESSIVE VOICE SYNTHESIS BY CONDITIONING ON RHYTHM, PITCH AND GLOBAL STYLE TOKENS Rafael …
-
To enhance the user experience of 'Say, Pi', we propose implementing a "Dynamic Submission Delay" feature. This feature aims to dynamically adjust the delay before automatic prompt submission, making …
-
To go beyond bioacoustics and to actually serve as a core package for acoustic communication researchers, we need a `models` module that contains reference implementations of *models* of animal acoust…
-
Hi
I have a Samsung S7, with Android 8.0.0
Runner up ver. 2.2.4.1.
When start training appear window with message "Text to Speech is not available..."
Into phone there is Samsung TTS and Google TT…
-
Just wanted to share some feedback..
It seems like the voices are trained to pause somewhat awkwardly when using punctuation. I noticed it especially using hyphenated words. I also noticed it with …