filterbank Search Results

1000+ results
for filterbank

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

flashlight/wav2letter #678

Decode does not transcribe entire audio (Seq2Seq)

I'm trying to transcribe an audio clip which is about 2min long with my Seq2Seq model. The transcription is almost perfect but it stops after a few seconds. How can I decode the entire clip? I've t…

BoneGoat updated 4 years ago
8
flashlight/wav2letter #513

fine tune pretrained model

I have some problem with fine tuning wav2letter's pretrained models. I download seq2seq_tds model baseline_dev-other.bin to a folder. Running following command: build/Train fork /w2l/am/baseline…

jianminsun updated 4 years ago
3
flashlight/wav2letter #762

specAugment with Transformer

I am trying to achieve some good results on Libri 100 Hour data using transformer + CTC architecture provided in https://github.com/facebookresearch/wav2letter/blob/master/recipes/models/sota/2019/li…

rajeevbaalwan updated 4 years ago
20
pytorch/audio #574

Randomness in torch.compliance.kaldi.fbanks

## 🐛 Bug Fbanks computed on same waveform change from one run to another. **SOLVED**: My bad, I didn't know that the dither parameter, if different from zero, introduces noise. Thank you for yo…

dros1986 updated 4 years ago
2
flashlight/wav2letter #340

data preparation

I tried preparing data using the script `recipes/librispeech/data/prepare_data.py` and for some reason, it was only writing the .lst files and not the other files. After spending some time trying …

SY-nc updated 4 years ago
17
pytorch/audio #292

Incorrect mel filterbank when f_max != sample_rate // 2

## 🐛 Bug `MelScale` and thus `MelSpectrogram` will not use the correct filterbank when `f_max` is not equal to `sample_rate // 2`. ## To Reproduce Steps to reproduce the behavior: The foll…

jongwook updated 5 years ago
2
flashlight/wav2letter #897

W2lListFilesDataset.cpp:105] Could not read file ''

I am running Resnet CTC training using user's audios. I am using a very small number of audios for testing for now. I have got an bug: Could not read file '' Any ideas about that? Thank y…

ML6634 updated 3 years ago
11
freewym/espresso #17

Verify WER by scoring with Kaldi

Hi authors, I'm using **Librispeech run.sh** recipe. I trained the acoustic model (speech_conv_lstm_librispeech) using 4 GPU 1080ti But I'm facing this error while doing kaldi scoring. local/score.s…

ahmedalbahnasawy updated 4 years ago
7
devanshkv/fetch #6

Using predict.py

I have a .fil file, from the Parkes FRB010125 data. The file size is a few MB and contains a single burst. Can I use a model of FETCH to find if the filterbank file contains an FRB or not? If yes, how…

parulj3795 updated 4 years ago
1
google/uis-rnn #71

Embedding Extraction Procedure

> The flowchart of our diarization system is provided in Fig. 1. In this system, audio signals are first transformed into frames of width 25ms and step 10ms, and log-mel-filterbank energies of dimensi…

divyeshrajpura4114 updated 4 years ago
1

上一页 1...81 82 83 84 85 86 87...100 下一页

1000+ results for filterbank

1000+ results
for filterbank