text-to-audio Search Results

1000+ results
for text-to-audio

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

SubtitleEdit/subtitleedit #9023

FEATURE REQUEST: Forced Alignment feature using Aeneas

Can we have a forced alignment feature using something like **[aeneas](https://github.com/readbeyond/aeneas)** This tool seems to be working really good in automagically synchronizing audio to text a…

MbuguaDavid updated 3 days ago
1
QwenLM/Qwen2-Audio #90

本地路径问题

改成读取本地路径的代码后： conversation = [ {'role': 'system', 'content': 'You are a helpful assistant.'}, {"role": "user", "content": [ {"type": "audio", "audio_url": audio_path…

jiahui-w updated 1 week ago
1
OpenPecha/stt-split-audio #32

STT0073: LLM-Based Correction of Inference Transcriptions Us…

### Description: Develop a process that improves the quality of inference transcriptions for audio files using Claude AI by aligning them with a verified transferred text. The transferred text is know…

jim-gyas updated 58 minutes ago
4
rpbouman/huey #263

AttributeUi: Rendering Options

Rendering options would allow the user to control how the item values are transformed to output. Concrete examples would be: - urls. these could be used to render a hyperlink, or an image or some o…

rpbouman updated 16 hours ago
1
creativeplatform/crtv3 #73

Feature Request: Enable Subtitles/VTT Files with Livepeer Pl…

**Description:** We would like to request the integration of subtitles or VTT files with the Livepeer player to support closed captioning. This feature would enhance accessibility by providing audio-t…

sirgawain0x updated 4 days ago
1
haoheliu/AudioLDM #101

Text-guided Audio-to-Audio Style Transfer

If I just want to apply Text-guided Audio-to-Audio Style Transfer for long text , will it be feasible to seamless transition from one audio to another as the prompt changes ?

PHOENIXFURY007 updated 3 months ago
1
Audio-AGI/FlowSep #2

Problem with the lass_inference

Hello, I installed FlowSep and run the file `lass_inference.py` like here: ```shell python3 lass_inference.py --text 'text_of_the_audio' --audio 'path_to_the_audio' ``` but I had this error: `…

Brodvd updated 1 day ago
1
gpt-omni/mini-omni2 #35

Is it possibel to do RAG woth this model?.

Hi, thank you for your excellent work. As we know, in text-to-text models, we can perform Retrieval-Augmented Generation (RAG). For more clarification, I have my personal data in text format, but to m…

ParthArora11 updated 1 week ago
1
pytube/pytube #2075

[BUG]

I'm trying to run Multimodal RAG for processing videos using OpenAI GPT4V and LanceDB vectorstore https://github.com/run-llama/llama_index/blob/main/docs/docs/examples/multi_modal/multi_modal_video…

ixn3rd3mxn updated 2 weeks ago
1
tarasglek/chatcraft.org #722

Support importing/pasting/dropping audio files

We just added support for more file types when you attach/paste/drop them. We also have support for turning audio into text, see (src/lib/speech-recognition.ts). Let's add support for importing audi…

humphd updated 6 days ago
9

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for text-to-audio

1000+ results
for text-to-audio