benchmarking-speech Search Results

vocodedev/vocode-core #487

[Feature]: Extend transcription provider support for Benchma…

### Brief Description Enhance Vocode's benchmarking capabilities by integrating additional transcription services such as Azure Speech to Text, Rev.ai, and others. This expansion will allow users to …

applebaconsoda123 updated 4 months ago

SpeechColab/GigaSpeech #117

GigaSpeech on HuggingFace

GigaSpeech dataset is now available on HuggingFace Hub. --- ### Highlights of GigaSpeech on HuggingFace * easy to use (a two-liner in python) * Smoother and faster downloading from US & EU, eve…

dophist updated 1 month ago

emanuelevivoli/awesome-comics-understanding #1

[add papers] new 2024 papers

In the [ICDAR 2024](https://icdar2024.net/), a bunch of papers on comics/manga understanding, analysis, and synthesis have been published. In particular, the MANPU workshop accepted papers are listed …

emanuelevivoli updated 1 week ago

calpoly-csai/swanton #19

Mozilla Text-To-Speech

## Objective Test the Mozilla Text-To-Speech module as an offline TTS option which sounds more natural and human-like. ## Key Result A function which takes a string as input to the function and …

chidiewenike updated 3 years ago

fastaudio/fastai2_audio #13

Datasets Needed for Tutorials and How-to-guides

We need to decide on datasets to use in the library. The primary purposes of the datasets will be 1. Benchmarking results to show efficacy of the library 2. Benchmarking results to see which tran…

rbracco updated 4 years ago

AI4Bharat/NPTEL2020-Indian-English-Speech-Dataset #11

New SOTA result for NPTEL2020

Dear people at NPTEL2020. We have ran the Pure Set againts our ASR api ( the www.speechly.com api). To our surplice we got the best WER result ( 0.2103 ) so far (at least from the list that is in the …

bigdatabaracus updated 2 years ago

egovernments/egov-rnd #20

Voice-Based Form Filling Component for Flutter to Enhance Di…

Description In light of the digital transformation of public services, this project endeavours to create a UI component tailored for Flutter applications. The main objective is to address the chall…

Ramkrishna-egov updated 1 month ago

microsoft/onnxruntime #6744

onnxruntime-gpu (cudaexecutionprovider) usage of cudnn autot…

Does onnxrt enable `cudnn.benchmark = True` (in PyTorch lingo)? (I found https://github.com/microsoft/onnxruntime/pull/712 which suggests that benchmarking is done) We're observing that onnxrt-gpu …

vadimkantorov updated 2 years ago

GasimV/Commercial_Projects #2

Speech Processing Models

`torchaudio` is an extension library for PyTorch, designed to facilitate audio processing using the same PyTorch paradigms familiar to users of its tensor library. It provides powerful tools for audio…

GasimV updated 3 months ago

gitmylo/audio-webui #176

Installation Issue.

**Describe the bug** Audio-Webui does not install the requirements properly, precisely on audiolm, saying it failed to install. **To Reproduce** Steps to reproduce the behavior: 1. Go to 'audio-…

PericoSpart updated 1 month ago

290 results for benchmarking-speech

290 results
for benchmarking-speech