speech-language-model Search Results

1000+ results
for speech-language-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

espnet/espnet #5871

Speech-to-Speech/Audio-to-Audio support

Could you add a native speech to speech / audio-to-audio support with encoder (tokenizer) and decoder (back to audio waves) I was able to implement a decoder only model, I first used audio codec to…

nanowell updated 3 months ago
4
Azure-Samples/cognitive-services-speech-sdk #2288

Embedded model fails to load from paths with non-ASCII chara…

**IN ORDER TO ASSIST YOU, PLEASE PROVIDE THE FOLLOWING:** - Speech SDK log taken from a run that exhibits the reported issue. [log.txt](https://github.com/Azure-Samples/cognitive-services-speec…

bpasero updated 5 months ago
13
CCExtractor/Subtitle-Resync #4

Project idea :- Subtitle Quality Enhancement Using Machine L…

### Subtitle Quality Enhancement Using Machine Learning **Description:** Develop a machine learning model that can automatically enhance the quality of subtitle files by correcting errors, improvin…

PRIYANSHU2026 updated 2 months ago
1
AkihikoWatanabe/paper_notes #1505

Mixture-of-Transformers: A Sparse and Scalable Architecture …

# URL - https://arxiv.org/abs/2411.04996 # Authors - Weixin Liang - Lili Yu - Liang Luo - Srinivasan Iyer - Ning Dong - Chunting Zhou - Gargi Ghosh - Mike Lewis - Wen-tau Yih - Luk…

AkihikoWatanabe updated 6 days ago
1
m-bain/whisperX #466

Detected language issue

Hi there, I'm having issues with audio files that contain mixed languages, specifically, I have one audio file that starts with a speech in Japanese and then it switches to English for the rest of …

adrianguanipa updated 6 months ago
3
kubeflow/training-operator #2040

Add more AI/ML Training Examples

As we discussed previously: https://github.com/kubeflow/training-operator/pull/2021#issuecomment-1987733922 we want to add more AI/ML examples to the Kubeflow Training Operator. Right now, most of our…

andreyvelich updated 1 week ago
8
facebookresearch/fairseq #5498

Facebook/mms-tts-deu speaks two voices at once, male and fem…

What is your question? I am experiencing an issue with the pretrained neural network facebook/mms-tts-deu. When generating speech, it sometimes alternates between male and female voices, making the o…

moseich updated 1 month ago
1
pipecat-ai/pipecat #640

Websocket not working as expected

Hello, I am using the below code to build a voice agent, most of the code has been gathered from different examples. I am facing the following problems: 1- interruption handling is bad compared to e…

sadimoodi updated 1 week ago
1
coqui-ai/TTS #3572

[Bug] The voice-cloned speaker continues with garbage after …

### Describe the bug Sometimes the speech pauses then the speaker continues but it's neither written nor is it any language, but it's clearly the same speaker. Unless you want to create a horror mo…

Bardo-Konrad updated 2 months ago
7
TAHIR0110/ThereForYou #151

Keyword Spotting for Danger Phrases

**Description** Develop a system to detect specific danger phrases in user speech using advanced speech recognition and natural language processing models such as DeepSpeech or WaveNet. **Motivati…

ShaikArshidBanu updated 5 months ago
1

上一页 1...12 13 14 15 16 17 18...100 下一页

1000+ results for speech-language-model

1000+ results
for speech-language-model