-
# Speech Emotion Captioning
Speech emotion captioning is to describe the emotion in speech using natural language.
## Task Objective
Compared with traditional speech emotion recognition(wher…
-
Proprietary music generation is far ahead of open source (see Suno, Udio et al).
Using your encodec method, please include text-to-music with English synthetic Singing somehow. I'm not sure of the…
-
### Describe the bug
I M trying to use this repo for urdu language i have found some pretrained module on hugging face
but i m unable to use
i dont have any prior knowledge of python i m not fami…
-
Thank you very much for this wonderful program, it has very high accuracy levels and is helping me so much in many ways :)
But unfortunately Speech Note keeps inserting words that I didn't speak ra…
-
# Phone/Phoneme segment counting
This task is to count the number of phoneme segments in a given speech sample. This task is essential for evaluating the ability of models in the benchmark to accurat…
-
1. use CosyVoice Chinese woman to generate audio (first video), then use OpenVoice ToneColorConverter to generate audio(third video) according target_se(second video) that has serious electrical tone…
-
I tested some videos
if the silence duration is long , then enable vad_filter will be effective
but if video is as normal, then enable vad_filter may cause more timestamp mismatch
is there …
-
This is aggregated issue to request support for new languages. If you see one of the following errors:
> Synchronization between languages xxx - yyy is currently not supported.
> Synchronization …
sc0ty updated
3 weeks ago
-
We want to:
- Offer free LLM api to community
- Build up open dataset
- Collecting feedback of research team model
Probably will be hosted on cloud
-
Currently, there is no functional way to change the Text-to-Speech (TTS) language in our application. While the system is intended to support French ("fr") as a language option, this setting is not be…