-
Multi-frequency Cepstral Coefficients - a common set of speech features.
-
Set the filterbank and transform to MelFilterbank and dct, respectively? There's always the Cepstrogram class if other parameters are needed / wanted.
Move the FFCC_\* constants into the class.
-
Hi,
thank you very much for this very useful project.
I started doing some speech recognition experiments with the MFCC features implemented in torchaudio. In particular, I tried the librosa ones …
-
# MFCCs - ratsgo's speechbook
articles about speech recognition
[https://ratsgo.github.io/speechbook/docs/fe/mfcc](https://ratsgo.github.io/speechbook/docs/fe/mfcc)
-
**Description:** I intend to work on implementing the MFCC for feature extraction from audio signals. The MFCC is a popular technique for extracting features from audio signals such as voice or music,…
-
Hi,
I'm trying to do some experiments with the kaldi-compliant MFCCs, but I run into some possible issues:
1- When I run the following code
```
file='/home/mirco/datasets/TIMIT/test/dr5/fnl…
-
Kaldi has several features available for use, these should be options in MFA:
- [ ] MFCC + pitch features
- [ ] LDA on MFCC features
- [ ] PLP features
The current default is MFCCs + deltas.
All fea…
-
Hi!
Firstly, thanks for releasing this work on GitHub -- it's been super useful in my research.
The problem seems to be in the call to `wavfile_to_mfccs` in the TIMIT pre-processing [here](ht…
-
Hi Shawn,
I have trained a multiclass classifier for speech recognition using your tutorials with tensorflow. The model can predict but it always outputs a single class. I suppose the problem is with…
-
Hi, I cannot make same result of librosa's mfccs. This maybe relate to #206
Librosa code:
```python
def getMfcc(file_name):
X, sample_rate = sf.read(file_name, dtype='float32')
mfccs = l…