-
**Is your feature request related to a problem? Please describe.**
Some users want to compare Rats to each other for example.
**Describe the solution you'd like**
To accommodate this, it is inter…
-
To the Microsoft Support Team,
We have been using ConversationTranscriber of the Azure Speech SDK, to implement Diarization in our project, and have encountered an issue in which we need your assis…
-
https://github.com/coqui-ai/STT
https://github.com/coqui-ai/TTS
Chromium does not include text to speech or speech to text. Firefox does not have it either, text to speech, speech to text are a vi…
-
I'm looking to create custom [viseme](https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-synthesis-markup-structure#viseme-element) animations for lip syncing in my project.
…
-
使用了固定的音色,也固定了seed,但多次合成音色很不问题。
一句话按标点分割后,分段流式合成,合成完之后听起来音色不够稳定。
```
random.seed(seed)
np.random.seed(seed)
torch.manual_seed(seed)
spk = torch.load('****.pt', map_location=select_device(…
-
### Check for previous/existing GitHub issues/module proposals
- [X] I have checked for previous/existing GitHub issues/module proposals.
### Check this module doesn't already exist in the modul…
-
### Describe the bug
1. The TTS Speech service seems to limit the audio files to a maximum length of 10 mins. This is regardless of a free or paid account - https://learn.microsoft.com/en-us/azure/ai…
-
我现在遇到的问题就是 用API生成的音频 ,音量有时候大有时候小
有没有参数可以控制,非常感谢!!!
-
is there a possibility to get JSON format as output with start time end time and it's value (resp word)
-
### Project Name
VidSage
### Description
# VidSage: Video Insights using Graph RAG
https://www.youtube.com/watch?v=IUSCWtB9jWk
VidSage focuses on processing video data, storing it in Azur…