-
### Brief Description
Enhance Vocode's benchmarking capabilities by integrating additional transcription services such as Azure Speech to Text, Rev.ai, and others. This expansion will allow users to …
-
GigaSpeech dataset is now available on HuggingFace Hub.
---
### Highlights of GigaSpeech on HuggingFace
* easy to use (a two-liner in python)
* Smoother and faster downloading from US & EU, eve…
-
In the [ICDAR 2024](https://icdar2024.net/), a bunch of papers on comics/manga understanding, analysis, and synthesis have been published. In particular, the MANPU workshop accepted papers are listed …
-
## Objective
Test the Mozilla Text-To-Speech module as an offline TTS option which sounds more natural and human-like.
## Key Result
A function which takes a string as input to the function and …
-
We need to decide on datasets to use in the library. The primary purposes of the datasets will be
1. Benchmarking results to show efficacy of the library
2. Benchmarking results to see which tran…
-
Dear people at NPTEL2020. We have ran the Pure Set againts our ASR api ( the www.speechly.com api). To our surplice we got the best WER result ( 0.2103 ) so far (at least from the list that is in the …
-
Description
In light of the digital transformation of public services, this project endeavours to create a UI component tailored for Flutter applications. The main objective is to address the chall…
-
Does onnxrt enable `cudnn.benchmark = True` (in PyTorch lingo)? (I found https://github.com/microsoft/onnxruntime/pull/712 which suggests that benchmarking is done)
We're observing that onnxrt-gpu …
-
`torchaudio` is an extension library for PyTorch, designed to facilitate audio processing using the same PyTorch paradigms familiar to users of its tensor library. It provides powerful tools for audio…
-
**Describe the bug**
Audio-Webui does not install the requirements properly, precisely on audiolm, saying it failed to install.
**To Reproduce**
Steps to reproduce the behavior:
1. Go to 'audio-…