-
Expression Transfer:
"GANimation: Anatomically-aware Facial Animation from a Single Image" (Pumarola et al., 2018)
"MeshTalk: 3D Face Animation from Speech using Cross-Modal Disentanglement" (Rich…
-
**Describe the bug**
Audios generated for `gu-IN` locale using voice `gu-IN-DhwaniNeural` contains about 3 sec silence at the end of audio file. The same generation, performed using `gu-IN-NiranjanNe…
-
目前好像不管有多少个websocket的请求都会在下面这段代码里阻塞
while True:
try:
tts_results = next(wav_generator)
resp = {"status": 1, "audi…
QAQyy updated
1 month ago
-
### Describe the bug
1. The TTS Speech service seems to limit the audio files to a maximum length of 10 mins. This is regardless of a free or paid account - https://learn.microsoft.com/en-us/azure/ai…
-
*Describe the bug*
Certain TTS voices are not providing speechmarks with viseme timings. For example, all the Urdu Azure TTS voices provide word timings but do not provide viseme timings which is w…
-
In addVisemeReceivedEventHandler, I receive event.animation. I want to use Viseme 3D Blend Shapes to drive my 3D Avatar.
Here is an example JSON:
{
"FrameIndex": 0,
"BlendShapes": [
…
-
Could you add a native speech to speech / audio-to-audio support with encoder (tokenizer) and decoder (back to audio waves)
I was able to implement a decoder only model, I first used audio codec to…
-
### Operating System Info
Windows 10
### Other OS
_No response_
### OBS Studio Version
29.0.2
### OBS Studio Version (Other)
_No response_
### OBS Studio Log URL
https://obsproject.com/logs/O…
-
# Task Name
Singing voice synthesis
## Task Objective
Singing voice synthesis (SVS) is a task that transforms music scores into singing waveforms. This involves generating realistic and intel…
-
### Brief Description
I noticed some issues with the GoogleSynthesizer class during development. It seams to importing the old version of the library causing a compile error. The sample rate seems fi…