-
**Describe the bug**
Audios generated for `gu-IN` locale using voice `gu-IN-DhwaniNeural` contains about 3 sec silence at the end of audio file. The same generation, performed using `gu-IN-NiranjanNe…
-
I found that SampleRNN need to be run in parallel to get fast generation speed. It takes only about 500 seconds for generating 200 utterances, each with a length of 8 seconds speech. But it will be ve…
-
## 🐛 Bug
I am trying to use the model that you shared [here](https://dl.fbaipublicfiles.com/joint_speech_text_4_s2t/iwslt/iwslt_data/checkpoint17.pt) to generate translations for the speech that I …
-
### There are some exsting datasets that we can leverage directly such as -
- https://www.openslr.org/104/ contains aligned Hindi-English extracted from spoken tutorials of technical topics and lec…
-
**Which AWS Services is the feature request for?**
aws-android-sdk-transcribe - AWS Transcribe Medical Streaming
**Is your feature request related to a problem? Please describe.**
No, I'd like t…
-
## Description
Develop backend functionalities for text navigation, SSML tag processing, and voice generation, as outlined in `draft1.py`. This includes integrating with speech synthesis OpenAPI and …
-
根据文档配置到`Visualise the generated motions` 时,产生如下信息:
(csmg) xht@xht-Z590-GAMING-X:~/SourceCode/Co-Speech-Motion-Generation/src$ bash visualise.sh
making video
Traceback (most recent call last):
Fi…
-
# Speech Separation
Speech separation is the task of obtaining clean, single-speaker speech from a speech mixture of multiple overlapping speakers.
## Task Objective
**Why is this task needed…
-
Path: /api-reference/sound-generation
Missing:
get: `/v1/sound-generation/history`
get: `/v1/sound-generation/history/{clip-id}`
get `/v1/sound-generation/history/{clip-id}/audio`
-
Hello @lucidrains
I would like to test the pre-trained models for speech generation
How would I be able to do that.