-
How can I get the transcription of a single wav file? Is there any other way to run `asr_recog.py` for a single wav file? I have looked into kaldi tools used to extract filterbank and pitch features l…
-
Hey @reuben and mozilla team, in the original Baidu DeepSpeech-1 paper, the input is filterbanks instead of mfcc. Also you had pushed some changes to use that. Have you compared the results of both on…
ghost updated
5 years ago
-
Include some other chroma variants, as described in [Mueller and Ewert, 2011](http://ismir2011.ismir.net/papers/PS2-8.pdf):
- [x] Pitch
- [x] Chroma
- [x] CENS
- [ ] CRP
- [ ] LBC [Ni, Y.; McVicar, M.…
-
It will be useful to have documentation on each of they key flags for Train and for Decoder. Some are self explanatory but others aren't clear or are missing information on the valid range or supporte…
-
For reproducibility it would be useful to have these two mel filters. These are the two types of filters I have seen the most so correct me if there are other more important types of mel filters.
-
## Expected Behavior
## Current Behavior
## Possible Solution
## Steps to Reproduce (for bugs)
1.
2.
3.
## Context
### Priority
## Your Environment
-
## Describe the question
A clear and concise description of what the question is.
**Summary of work:**
Audio signal is transformed into frames (log-mel-filterbank energies features) with frame widt…
-
Dear all,
Kindly help me with this.
I am trying to train my model. I am getting stuck with a few errors. Can u please elaborate how you solved this problem.
Below is my log on console. I get…
-
Hello, I have a small problem with GNU Radio Companion. I built everything from source using this command:
```
cmake -DPYTHON_EXECUTABLE=/usr/bin/python2.7 -DPYTHON_INCLUDE_DIR=/usr/include/python2.…
-
#### Description
The transformation of an windowed Fourier spectrogram into a mel-frequency spectrogram returns a numPy array in double precision (type `numpy.float64`) even when this windowed Fourie…