-
Hello,
we are a bit confused about the function of the `--language` flag. Does it
- restrict the transcription to the specified language
or
- translate whatever language it recognizes to the sp…
-
Create bindings for https://github.com/ggerganov/whisper.cpp
- [x] Simple golang bindings with tests
- [x] Some examples (main, sample) based off of these
- [x] Integrate with ffmpeg for audio conver…
-
How to test on my own data? I have a "Source Speaker / Speech" and a "Target Speaker / Speech", I want to generate the "Conversion", as shown on the demo page https://auspicious3000.github.io/autovc-d…
-
The current response time for a single sentence in "Bark" is several seconds. Some companies' speech conversion interfaces can return results in milliseconds. I also hope that "Bark" can achieve text-…
-
Hi @adelacvg
Can we use this kind model for speech to speech (Voice conversion).
-
Hi,
I'm currently trying to replicate the performance of Qwen2-Audio on the AIR Bench. However, I noticed that the repository at [AIR-Bench](https://github.com/OFA-Sys/AIR-Bench/blob/main/score_cha…
-
When I try to test the API locally and fetch a YouTube video with URL, and have **vocab** in the request body, such as:
```
{
"compression_ratio_threshold": 2.4,
"condition_on_previous_text…
-
- Video type conversion
- Video compression
- Conduct text-text, speech-text, speech-speech analysis
- Storage to database
-
# Project Description
## Project Overview
Develop a _Speech to Text conversion_ system using **Azure AI Speech Studio** and Azure Cognitive Services. Leverage Azure's speech recognition services to …
-
Hello, I just started to learn voice conversion.And I want to know how to write a demo by using this frame? How do I use another person’s voice to speak the content of the person’s speech with the voi…