-
I have been trying to implement paper "Deep clustering: Discriminative embeddings for segmentation and separation", but I am not able to create batches because each audio file have different no of fra…
-
hello, I am studying librispeech recipe and have 2 questions.
Q1.
Why use the Average ASR / LM model in the decoding stage ?
Does it mean that we store a snapshot of the appropriate weights foun…
ghost updated
4 years ago
-
**System information**
- OS Platform and Distribution (e.g., Linux Ubuntu 16.04): Windows 10
- TensorFlow installed from (source or binary): `conda install tensorflow`
- TensorFlow version (CPU) : …
-
Recently, I change some code in the multichannel end-to-end chime4 asr. I estimate the TDOA and improve the beamforming using steer vector. I check you code and plot the enhanced FFT Spectrogram. I j…
-
```
2020-04-01 04:01:15,648 (encoders:239) INFO: BLSTM with every-layer projection for encoder
2020-04-01 04:01:15,954 (asr_init:220) WARNING: loading ['enc.enc.'] from model: /data/karan/Yashsh/Too…
-
**You must follow the issue template and provide as much information as possible. otherwise, this issue will be closed.
请按照 issue 模板要求填写信息。如果没有按照 issue 模板填写,将会忽略并关闭这个 issue**
## Check List
Thanks…
-
I'm interested in implementing a real-time, streaming version of the separation method.
Do you have any advice on how to extract the model weights for this?
Would it be best to retrain, and sa…
-
#Progress
* Installed and tested the espnet with simple examples (get familiar with the CLSP GRID) and also tested the remote debug with the servers in CASIA, using the resources there to conduct the…
-
Hi, is it possible to use ESPnet for realtime ASR?
I find that Kaldi can do online decoding by using https://github.com/alumae/kaldi-gstreamer-server and
https://github.com/alumae/gst-kaldi-nnet2-…
-
Hi guys,
I'd like to contribute my models i've trained on wmt17 news commentary for chinese-english.
Are there any guidelines on dataset? I randomly split dataset 80% training 10% validation 10…