-
Could you add a native speech to speech / audio-to-audio support with encoder (tokenizer) and decoder (back to audio waves)
I was able to implement a decoder only model, I first used audio codec to…
-
**IN ORDER TO ASSIST YOU, PLEASE PROVIDE THE FOLLOWING:**
- Speech SDK log taken from a run that exhibits the reported issue.
[log.txt](https://github.com/Azure-Samples/cognitive-services-speec…
-
### Subtitle Quality Enhancement Using Machine Learning
**Description:** Develop a machine learning model that can automatically enhance the quality of subtitle files by correcting errors, improvin…
-
# URL
- https://arxiv.org/abs/2411.04996
# Authors
- Weixin Liang
- Lili Yu
- Liang Luo
- Srinivasan Iyer
- Ning Dong
- Chunting Zhou
- Gargi Ghosh
- Mike Lewis
- Wen-tau Yih
- Luk…
-
Hi there,
I'm having issues with audio files that contain mixed languages, specifically, I have one audio file that starts with a speech in Japanese and then it switches to English for the rest of …
-
As we discussed previously: https://github.com/kubeflow/training-operator/pull/2021#issuecomment-1987733922 we want to add more AI/ML examples to the Kubeflow Training Operator. Right now, most of our…
-
What is your question?
I am experiencing an issue with the pretrained neural network facebook/mms-tts-deu. When generating speech, it sometimes alternates between male and female voices, making the o…
-
Hello,
I am using the below code to build a voice agent, most of the code has been gathered from different examples. I am facing the following problems:
1- interruption handling is bad compared to e…
-
### Describe the bug
Sometimes the speech pauses then the speaker continues but it's neither written nor is it any language, but it's clearly the same speaker. Unless you want to create a horror mo…
-
**Description**
Develop a system to detect specific danger phrases in user speech using advanced speech recognition and natural language processing models such as DeepSpeech or WaveNet.
**Motivati…