speech-language-model Search Results

1000+ results
for speech-language-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mkiol/dsnote #158

Strange insertion of words not resembling what I spoke, even…

Thank you very much for this wonderful program, it has very high accuracy levels and is helping me so much in many ways :) But unfortunately Speech Note keeps inserting words that I didn't speak ra…

Getarhubar updated 1 week ago
7
pipecat-ai/rtvi-web-demo #6

Unable to change TTS language - French language setting not …

Currently, there is no functional way to change the Text-to-Speech (TTS) language in our application. While the system is intended to support French ("fr") as a language option, this setting is not be…

ttamoud updated 2 months ago
2
alphacep/vosk-api #1540

using this stuff for a newbie

hello, i'm new to speech recognition, vosx and python, but i want to translate speech from a simple video i downloaded from the internet (and later even tts'ing to my language or even speech to speech…

elemich updated 8 months ago
1
OpenPecha/Requests #351

RFW0097: Improve the scaling of AI models API.

# RFW0097: *Improve the scaling of AI models API* ## Named Concepts API (Application programming interface): is a set of rules and protocols that defines how two software systems can communicate w…

tenzin3 updated 10 months ago
2
m-bain/whisperX #298

A "phantom language" can ruin the whole transcription

I am using WhisperX v2.0.1 with the option "detect language" (by omitting the "--language" option from the command). I use the "detect language" option for a video in which two languages are spoken, E…

oep42 updated 5 months ago
3
modelscope/ms-swift #2237

llama-omni微调相关问题

1，是只能微调LLM吗，我使用模型只能使用文字对话，不能传输语音数据。意思是只能使用LLM，没有speech encoder,speech adaptor?现阶段是否有论文上的stag1的微调，请告知，谢谢。 ![image](https://github.com/user-attachments/assets/80829c0d-cf0a-4cf1-b2a5-a087f1037f6f)

sixbenzene updated 12 hours ago
18
SYSTRAN/faster-whisper #230

enable vad_filter cause timestamp mismatch

I tested some videos if the silence duration is long , then enable vad_filter will be effective but if video is as normal, then enable vad_filter may cause more timestamp mismatch is there …

iorilu updated 2 months ago
7
leon-ai/leon #495

After I turn on STT in .env, the server can not be started u…

### Specs - Leon version:1.0.0-beta.9+dev - OS (or browser) version:ubantu 22.04 - Node.js version:v18.16.0 - Complete "leon check" (or "npm run check") output: xu@xu-ThinkPad-Edge-E431:/…

xuguoliang1964 updated 1 week ago
5
Daisie-Bell/DataModels #3

Metamersion Bot Specification

**Datamodels needed:** OpenAI ElevenLab Text to Speech VLM - visual language model (OpenAI GPT-4V) Whisper Speech to Text Basis for bot behavior OpenAI GPT-4 phenomenological problem interviewer prom…

elacosse updated 9 months ago
2
sir-kokabi/test2 #1

test

## Text To Speech Preprocessing - [ParsiNorm](https://github.com/haraai/ParsiNorm) - Persain Text Pre-Proceesing Tool - [Persian Tools](https://github.com/persian-tools/py-persian-tools) - An anthol…

sir-kokabi updated 1 year ago
1

上一页 1...15 16 17 18 19 20 21...100 下一页

1000+ results for speech-language-model

1000+ results
for speech-language-model