-
Hi @KdaiP nice work, just like to know is this architecture is intended to support zero-shot TTS or normal multi-speaker kind of TTS,
-
I noticed that the project provides an extensive set of functionalities for voice conversion and text-to-speech, and I am specifically interested in using it for Cantonese text and speech processing, …
-
Hello, I just started to learn voice conversion.And I want to know how to write a demo by using this frame? How do I use another person’s voice to speak the content of the person’s speech with the voi…
-
- Enable users to provide audio input for their performance reviews and self-reviews, in addition to the existing text-based input.
- Use the `streamlit-audio-recorder` library to allow audio recordin…
-
When I try to test the API locally and fetch a YouTube video with URL, and have **vocab** in the request body, such as:
```
{
"compression_ratio_threshold": 2.4,
"condition_on_previous_text…
-
Free form speech inputs will need to be processed to separately identify numeric and unit components. When specifying the unit components using speech, the user should be able to construct arbitrary …
-
Hello,
we are a bit confused about the function of the `--language` flag. Does it
- restrict the transcription to the specified language
or
- translate whatever language it recognizes to the sp…
-
Steps to reproduce
------------------
1. (How do you make the issue happen? Does it happen every time you try it?)
2. (Make sure to go into as much detail as needed to reproduce the issue. Postin…
-
There is an existing rudimentary conversion to "plain text". A conversion to SSML, Speech Synthesis Markup Language [1], would be a manageable project for someone with some experience with XSLT.
T…
-
### Description
The goal is to develop a Tibetan text-to-speech (TTS) model that can convert Tibetan text into Tibetan speech. This project involves training a TTS model using filtered good audio qual…