-
### Describe the bug
I'm trying to implement the recipe https://github.com/speechbrain/speechbrain/tree/develop/recipes/LibriSpeech/ASR/transducer but the WER and train loss are very high. After runn…
-
# ❓ Questions & Help
HI, I can not save model and vocab when using spm.SentencePieceTrainer.Train.
## Details
This is my config:
python3 ./openspeech_cli/hydra_train.py dataset=librispeech…
-
https://www.openslr.org/12
-
### Describe the bug
I am trying to download the `librispeech_asr` `clean` dataset, which results in a `FSTimeoutError` exception after downloading around 61% of the data.
### Steps to reproduce the…
-
Extract the English subset in the Multilingual Librispeech dataset (https://huggingface.co/datasets/facebook/multilingual_librispeech). The resulting data must look like this:
```
{
'audio': the…
-
Extract the French subset in the Multilingual Librispeech dataset (https://huggingface.co/datasets/facebook/multilingual_librispeech). The resulting data must look like this:
```
{
'audio': the …
-
Hello, I used the checkpoint file you trained with librispeech to infer the Chinese audio and it still works well. Is that what you expected? Because your dataset doesn't seem to use Chinese, only Eng…
-
### System Info
DGX V100 and DGX A100
### Who can help?
@ncomly-nvidia to add more folks.
### Information
- [ ] The official example scripts
- [ ] My own modified scripts
### Tasks
- [ ] An of…
-
The goal is to discretize the speech data from the French Librispeech dataset you previously worked on (@abheesht17 & Adithiya): https://huggingface.co/datasets/abheesht/librispeech_fr. Remember that…
-
```
from datasets import load_dataset
from transformers import Wav2Vec2ForCTC, Wav2Vec2Processor
import soundfile as sf
import torch
from jiwer import wer
librispeech_eval = load_dataset("li…