audio-to-text Search Results

1000+ results
for audio-to-text

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/i-Code #134

Unable to reproduce the results of the paper

Hello. I tried using the demo code of Codi (https://github.com/microsoft/i-Code/tree/main/i-Code-V3) to reproduce results on the AudioCaps dataset. However, I was unable to achieve the results reporte…

XinMing0411 updated 1 week ago
2
microsoft/CLAP #42

Embeddings are non-deterministic even for durations < 7s

Hi, I have similar problem to https://github.com/microsoft/CLAP/issues/24, but I'm using shorter audio than 6 seconds. MWE: ```python from msclap import CLAP import torch import subprocess …

simonmandlik updated 1 week ago
1
unslothai/unsloth #954

Able to finetune `homebrewltd/llama3.1-s-instruct-v0.2` (Inp…

Hi Unsloth! I came across this interesting model on reddit: https://www.reddit.com/r/LocalLLaMA/comments/1ez8rmu/llama31_just_got_ears_early_experiments/ It allows Text and Audio as input, and o…

asmith26 updated 1 month ago
3
DhanushNehru/Python-Scripts #275

Interactive Dictionary: Audio Pronunciation and Console Defi…

I'm always frustrated when I want to quickly learn the meaning of a word or phrase but have to look it up manually and read the definition. It would be much more convenient if I could just input a str…

Deepikakolli4 updated 5 days ago
1
chochinlu/suno-song-generator #11

Text extraction pipeline to cater for all songs.

**Summary:** Currently, the project relies on YouTube’s captioning system for lyrics extraction. However, only a limited number of YouTube videos have captions enabled, restricting the number of song…

cmm25 updated 11 hours ago
1
Kourva/AwesomeChatGPTBot #98

New tts provider

New tts provider ```python import requests import json import time from pathlib import Path from typing import Generator from playsound import playsound class FailedToGenerateResponseError…

HelpingAI updated 1 month ago
1
huggingface/huggingface.js #921

Add Audio Feature Extraction Task

Currently, the `Feature Extraction` task includes both models for audio and text feature extraction (it is officially placed under the NLP modality). I think it would be nice to have a new task for `A…

ecyht2 updated 1 week ago
1
zehanwang01/OmniBind #2

about pseudo pairs

great job! I want to know how to get pseudo pairs when I chose one modality(for example, Image) as a starting point. I can use audio-image and image-text model to retrieve audio and text, but how ca…

xiaos16 updated 1 month ago
4
OpenPecha/tts-model #1

TTS lighter and faster model ( MM24 )

### Description The goal is to develop a Tibetan text-to-speech (TTS) model that can convert Tibetan text into Tibetan speech. This project involves training a TTS model using filtered good audio qual…

gangagyatso4364 updated 1 week ago
4
lamm-mit/PDF2Audio #4

How about adding OpenAI alternatives for TTS

How about adding Text-to-Speech alternatives to openai, such as: deepgram, fish.audio. Similarly adding other LLMs as well.

omerarshad updated 6 days ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for audio-to-text

1000+ results
for audio-to-text