-
## Hello, i being able to perform extractions of MFCCs with a lot of performance on android smartphones.
I advise you to look at an implementation I made in this [repository](https://github.com/cer…
-
`DataLoader` class to load MFCC features and e2e-format FSTs from disk into a pair of features Tensor and FST and create minibatches.
We can follow the way Deepspeech does this i.e. to use an SCP-l…
-
Hey, first of all, great work!
Two things bug me though:
1. What's the semantic value of the HuBERT model you trained if it's using the first RVQ layer of the _acoustic_ tokenizer? I.e. the acoust…
-
Hello!
ASVTorch generates 24 MFCCs, so the MFCCS are (n, 24) shape. Your input is (200, 30). Where is the 30 from? Can you please provide some test samples?
-
```
HCompV -A -T 1 -C /nfs/home-GLITE/sagrid005/home_cream_125980864/CREAM125980864/cvn_vectors -k */mfccs/%%%_*.mfc -q v -S /nfs/home-GLITE/sagrid005/home_cream_125980864/all.mfccs.lst
HMM Def Error…
-
## What is the issue about?
- [x] Bug
- [ ] Feature request
- [ ] Usage question
- [ ] Documentation
- [ ] Contributing / Development
## What part(s) of Essentia.js is involved?
- [x] essenti…
-
I would like to start drafting a new package for speech signal processing, focused mainly on speech feature extraction (MFCCs, LPCs, fundamental frequency, etc). @davidavdav has a lot of work on MFCCs…
-
Hey guys,
I am a bit lost. I would like to port python tflite code to rust. I have a model with:
**Input**
conv2d_6_input
type float32[1,16,35,1]
**Output**
float32[1,1]
The corresponding pyt…
-
test_load_audio.py line 13
pair = range(2, 2 + data_length)
need changed to
pair = range(2, 1 + data_length)
-
One essential difference between htk's variant of MFCC and other implementations is the preemphasis of high frequencies. This is done by means of a IIR filter (as far as I understand) as explained in …