audio-text Search Results

1000+ results
for audio-text

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

shaka-project/shaka-packager #1434

Audio bitrate

How can I choose the audio for each resolution, for example 1080: 192k audio 720: 128k audio 480: 96k audio when I change hls_group_id it appears different in the player ![Captura de pantalla…

chikenare updated 13 hours ago
1
w3c/wcag #4072

The note on text transcript vs audio description needs to be…

The note in [WCAG 1.2.3](https://www.w3.org/WAI/WCAG21/Understanding/audio-description-or-media-alternative-prerecorded.html) on the differences between text transcript and audio description is highl…

Wildebrew updated 2 days ago
3
NVIDIA/audio-flamingo #12

inference example for interleave multiple audio-text

I am looking at the https://github.com/NVIDIA/audio-flamingo/blob/main/inference/inference_examples.py files and I couldn't find any example that use interleaving multiple audios and texts. However, I…

androstj updated 1 month ago
2
Kourva/AwesomeChatGPTBot #98

New tts provider

New tts provider ```python import requests import json import time from pathlib import Path from typing import Generator from playsound import playsound class FailedToGenerateResponseError…

HelpingAI updated 1 month ago
1
huggingface/huggingface.js #921

Add Audio Feature Extraction Task

Currently, the `Feature Extraction` task includes both models for audio and text feature extraction (it is officially placed under the NLP modality). I think it would be nice to have a new task for `A…

ecyht2 updated 6 days ago
1
zehanwang01/OmniBind #2

about pseudo pairs

great job! I want to know how to get pseudo pairs when I chose one modality(for example, Image) as a starting point. I can use audio-image and image-text model to retrieve audio and text, but how ca…

xiaos16 updated 4 weeks ago
4
RVC-Boss/GPT-SoVITS #1480

在同参数下，多次测试，api比webui中的推理产生的音频噪音要大

- 版本：V2 - 分割方式:webui 不切 api不传入任何切割符则也为不切 - 其余参数完全一致 - 情况：api产生的音频wav格式比 webui中的音频噪音要大 - 测试：api.py加上了webui中的音频归一效果也不行，webui中生成的音频效果是最好的，即使把webui中所有的推理代码都copy过来也不行 - 期望解答：api.py应该做什么才能达到webui中…

OriX0 updated 1 month ago
22
ucbepic/docetl #11

Support Audio File Inputs as Documents

We need to add support for audio file inputs as documents in our pipeline system. This will allow users to process audio files (e.g., MP3) and automatically transcribe them using services like OpenAI'…

shreyashankar updated 4 hours ago
1
flyerhq/flutter_chat_ui #633

[Feature Request] Customizable and extendable `MessageType` …

Hi, I have a design proposal for customisable and extendable message types. There are pros and cons to this design. At first glance it seems to me that it might not be a breaking change. Origin: It…

UsamaKarim updated 3 weeks ago
2
OpenPecha/tts-model #1

TTS lighter and faster model ( MM24 )

### Description The goal is to develop a Tibetan text-to-speech (TTS) model that can convert Tibetan text into Tibetan speech. This project involves training a TTS model using filtered good audio qual…

gangagyatso4364 updated 4 days ago
4

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for audio-text

1000+ results
for audio-text