audio-to-text Search Results

1000+ results
for audio-to-text

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

jianfch/stable-ts #404

Transcription quality

Why is the quality of `stable-ts` transcription much worse than that of `openai/whisper`? New lines of text are added where they should not be, numbers like `0.003` and `0.05` are defined as `0 0 3` a…

qo4on updated 11 hours ago
2
dynamic-superb/dynamic-superb #55

[Task] Text-to-Audio Generation

# Task Name: Text-to-Audio Generation The task aims to generate general audio based on the given holistic text description. ## Task Objective The primary goal of the Text-to-Sound (TTA) Gener…

Baiiiiiiiiii updated 3 months ago
5
OpenPecha/news-with-audio-data #1

TTS data preparation from News data

**Description**: We currently have news full audio and corresponding news transcript. We would like to get the news text and audio data split into segments to train our STT and TTS model. **Implemen…

kaldan007 updated 1 hour ago
4
Azure-Samples/cognitive-services-speech-sdk #2615

Diarization in Speech SDK for overlapping audio of multiple …

To the Microsoft Support Team, We have been using ConversationTranscriber of the Azure Speech SDK, to implement Diarization in our project, and have encountered an issue in which we need your assis…

ShyamalG97 updated 5 minutes ago
1
livepeer/go-livepeer #3185

ai_ prometheus metrics produced with inconsistent `pipeline`…

It's maybe not a bug but an inconvenience resulting in multiple similar tags being produced. * `pipeline` value produced in [monitor.AIProcessingError(err.Error(), pipeline, ...](https://github.co…

pwilczynskiclearcode updated 4 days ago
2
ookgezellig/Zimmerman-en-Space-podcast #2

Add audio transcriptions to all episodes

Add audio transcriptions to all episodes * Ask HZ for written out texts --> not availble * Generate texts via [Whisper AI](https://openai.com/index/whisper/) or similar speech-to-text software

ookgezellig updated 3 days ago
8
marawanxmamdouh/ConvoNerd #19

Sub-Feature 2: Audio File Handling - Speech-to-Text Conversi…

**Parent ticket:** [Feature: Audio File Handling](https://github.com/marawanxmamdouh/ConvoNerd/issues/17) ### Description: Implement a speech-to-text module to transcribe audio content into text. …

marawanxmamdouh updated 1 day ago
7
w3c/wcag #4072

The note on text transcript vs audio description needs to be…

The note in [WCAG 1.2.3](https://www.w3.org/WAI/WCAG21/Understanding/audio-description-or-media-alternative-prerecorded.html) on the differences between text transcript and audio description is highl…

Wildebrew updated 1 day ago
4
Azure/azure-sdk-for-python #36880

[Call Automation] Played text-to-speech audios are not prese…

- **Package Name**: azure.communication.callautomation - **Package Version**: 1.2.0 - **Operating System**: Ubuntu 20.24 - **Python Version**: 3.11 **Describe the bug** The recordings that are …

estebanz01 updated 2 weeks ago
4
Secreto31126/whatsapp-api-js #374

ClientMessageRequest has incomplete types, why?

I have a requirement to save the raw data sent to whatsapp to be recovered later on. However, i just noticed that the ClientMessageRequest type has incomplete data structure, which means that text,…

tecoad updated 1 week ago
3

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for audio-to-text

1000+ results
for audio-to-text