audio-text Search Results

1000+ results
for audio-text

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

openai/openai-realtime-api-beta #25

Request: Enhance audio-text synchronization for RESPONSE_AUD…

Objective: Improve the ability to align text and audio deltas for smoother playback and interruption handling. Proposed solutions (in order of preference): - Implement corresponding event_ids bet…

opchronatron updated 1 month ago
2
pytube/pytube #2075

[BUG]

I'm trying to run Multimodal RAG for processing videos using OpenAI GPT4V and LanceDB vectorstore https://github.com/run-llama/llama_index/blob/main/docs/docs/examples/multi_modal/multi_modal_video…

ixn3rd3mxn updated 1 week ago
1
gpt-omni/mini-omni2 #35

Is it possibel to do RAG woth this model?.

Hi, thank you for your excellent work. As we know, in text-to-text models, we can perform Retrieval-Augmented Generation (RAG). For more clarification, I have my personal data in text format, but to m…

ParthArora11 updated 4 days ago
1
gpt-omni/mini-omni #101

Question regarding data format and loss calculation in stage…

In stage 1, only ASR and TTS is used. ASR is Audio -> Text, so loss is only calculated for text tokens, not for audio tokens right? TTS is Text -> Audio, but mini-omni outputs text and audio sim…

sphmel updated 3 weeks ago
7
Mintplex-Labs/anything-llm #2347

[FEAT]: Audio to Text - Whisper / LocalAi

### What would you like to see? Hello everyone, First of all, thank you for this superb project. Would it be possible to use LocalAI for Whisper? Currently the model is Xenova Whisper which uses th…

czerr updated 1 month ago
3
omnivore-app/omnivore #4416

Text to speech audio on iOS not working

Over the last week or so, text-to-speech stopped working on my device. I usually use Alloy for audio generation, and now that voice, along with most other English voices, display this upon attempting …

raynergit updated 2 weeks ago
10
fedirz/faster-whisper-server #160

faster-whisper-server output seems something wrong

After a certain segment, all subsequent recognized texts are incorrect： ``` from openai import OpenAI client = OpenAI(api_key="cant-be-empty", base_url="http://192.168.31.100:8000/v1/") …

burness updated 3 days ago
1
LuanRT/YouTube.js #804

Formato de áudio não encontrado

### Steps to reproduce case 'playyy': { if (args.length < 1) return reply("Insira o comando, e em seguida um nome para a pesquisa!"); const { Innertube } = require('youtubei.js'); co…

tonykx updated 6 hours ago
2
LostRuins/koboldcpp #1197

[Question] Any plans to support models other than GGUF and m…

Hugginface has most models in some other formats. For example, the auto-to-text/text-to-audio model facebook/seamless-m4t-v2-large is in .safetensors format: https://huggingface.co/facebook/seamles…

yurivict updated 2 weeks ago
1
psychopy/psychopy #6943

[Bug]: The sound stimulus does not play.

### PsychoPy Version 2024.2.1 ### What OS are your PsychoPy running on? Windows 10 ### Bug Description My python version: 3.10.11 I'm using a VENV virtual environment. I recently changed the v…

JeongWoo7780 updated 2 weeks ago
6

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for audio-text

1000+ results
for audio-text