librispeech-dataset Search Results

openspeech-team/openspeech #227

Prepare tokenizer of LibriSpeech by using sentencepiece

# ❓ Questions & Help HI, I can not save model and vocab when using spm.SentencePieceTrainer.Train. ## Details This is my config: python3 ./openspeech_cli/hydra_train.py dataset=librispeech…

tuandattt updated 2 weeks ago

yaya-sy/SpeechAya #7

Add English Multilingual Librispeech

Extract the English subset in the Multilingual Librispeech dataset (https://huggingface.co/datasets/facebook/multilingual_librispeech). The resulting data must look like this: ``` { 'audio': the…

yaya-sy updated 1 month ago

yaya-sy/SpeechAya #6

Add French Multilingual Librispeech

Extract the French subset in the Multilingual Librispeech dataset (https://huggingface.co/datasets/facebook/multilingual_librispeech). The resulting data must look like this: ``` { 'audio': the …

yaya-sy updated 1 month ago

yaya-sy/SpeechAya #8

Tokenize the French Librispeech

The goal is to discretize the speech data from the French Librispeech dataset you previously worked on (@abheesht17 & Adithiya): https://huggingface.co/datasets/abheesht/librispeech_fr. Remember that…

yaya-sy updated 3 weeks ago

phanngoc/english-pronunciation #1

Research for change into Wav2Vec2ForCTC

``` from datasets import load_dataset from transformers import Wav2Vec2ForCTC, Wav2Vec2Processor import soundfile as sf import torch from jiwer import wer librispeech_eval = load_dataset("li…

phanngoc updated 1 month ago

soundata/soundata #40

Add LibriSpeech dataset

https://www.openslr.org/12

justinsalamon updated 3 years ago

BUTSpeechFIT/DiaPer #7

Clarification about pretrained model naming

Hello and thanks for your work and sharing the pretrained models 1. I noticed that all the models shared have `2spk` in their names, does that mean that they only support two speakers? 2. If I want …

MahmoudAshraf97 updated 1 day ago

NVIDIA/TensorRT-LLM #1620

100% WER on distil-whisper/distil-large-v2

### System Info DGX V100 and DGX A100 ### Who can help? @ncomly-nvidia to add more folks. ### Information - [ ] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] An of…

esnvidia updated 2 months ago

zhenye234/xcodec #2

More details of the "baseline acoustic codec"

Hi, thank you for the amazing work. May I ask two questions about Table 1? 1. Could you please provide more detailed descriptions about the ```baseline acoustic codec``` in your paper? Does it - ex…

hbwu-ntu updated 5 days ago

dynamic-superb/dynamic-superb #171

[Task] Audio Spatial Distance Prediction

# Task Name Audio Spatial Distance Prediction ## Task Objective Audio Spatial Distance Prediction is a task that aims to predict spatial distance from the source of the sound based on the giv…

chengxi0618 updated 2 months ago

1000+ results for librispeech-dataset

1000+ results
for librispeech-dataset