speech-language-model Search Results

1000+ results
for speech-language-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

langgenius/dify #11046

can't connect an audio file in speech to text tool

### Self Checks - [X] This is only for bug report, if you would like to ask a question, please head to [Discussions](https://github.com/langgenius/dify/discussions/categories/general). - [X] I have s…

lucaseatp updated 2 days ago
1
KoljaB/RealtimeSTT #150

Can the model's automatic translation to the specified langu…

from RealtimeSTT import AudioToTextRecorder import pyperclip def process_text(text): pyperclip.copy(text) if __name__ == '__main__': print("Wait until it says 'speak now'") r…

Pixellevel updated 1 week ago
7
Osmodium/W40KRogueTraderSpeechMod #8

Feature Request: Integrate Text-to-Speech AI for Better Use…

## **Summary** First of all, thank you for the outstanding work on this mod. It's impressive, and I appreciate the effort and dedication that has gone into its development. Let's jump straight t…

magyargergo updated 1 week ago
1
edwko/OuteTTS #32

Model loading Error

# Code **model_config = outetts.HFModelConfig_v1(model_path=r"D:\model\tts\OuteAI\OuteTTS-0.1-350M", language="en", wavtokenizer_model_path=r"D:\model\tts\\OuteAI\wavtokenizer_large_speech_320_24k.c…

wukonggeo updated 11 hours ago
1
BradyFU/Awesome-Multimodal-Large-Language-Models #184

Add SALMONN, video-SALMONN, video-SALMONN 2

Hello! Could you please add SALMONN series models? Title | Venue | Date | Code | Demo -- | -- | -- | -- | -- [SALMONN: Towards Generic Hearing Abilities for Large Language Models](https://arxiv.o…

TCL606 updated 4 weeks ago
1
microsoft/vscode #218202

"VS Code Speech" Extension Crashes on Activation

Does this issue occur when all extensions are disabled?: Yes/No - VS Code Version: - OS Version: ## Bugreport: "VS Code Speech" Extension Crashes on Activation ### Desc…

fussbanana updated 6 days ago
4
CheshireCC/faster-whisper-GUI #227

0.8.1转写速度非常慢

4080显卡，速度可能不到原来的1%，堪比用CPU跑。但看显卡占用又跑满了，找不到原因。是否没有正确调用到打包里的PyTorch和TensorFlow所致？ fasterwhispergui.log如下： None of PyTorch, TensorFlow >= 2.0, or Flax have been found. Models won't be available and …

syazyz updated 2 weeks ago
5
SYSTRAN/faster-whisper #1085

Running the same code twice giving two different results

Hi, I am running faster-whisper on an audio file like follows ``` segments, info = model.transcribe(wav, task="transcribe", language="hi",beam_size=1, word_timestamps=True,max_new_tokens=50 ) ```…

bchinnari updated 1 month ago
3
myshell-ai/MeloTTS #214

Japanese sounds unnatural

I have combined the phoneme sets for all three langauges, English, Chinese, Japanese and started fine tuning using a datset comprised of all three speech languages The base model I use is the chine…

michaellin99999 updated 1 week ago
2
microsoft/cognitive-services-speech-sdk-go #108

Auto Detect Source language is giving result only in the las…

I'm using Azure Speech to Text SDK version 1.21 in Golang. Here I'm adding a feature of Auto-detect language for the audio file using ```Go engLangConfing, err := speech.NewSourceLanguageConf…

arvind-prajapati updated 5 days ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for speech-language-model

1000+ results
for speech-language-model